Bayesian Variable Selection and Inference for Nonparametric Kernel Machine and Functional Models

dc.contributor.authorJin, Phil Geunen
dc.contributor.committeechairKim, Inyoungen
dc.contributor.committeememberFranck, Christopher Thomasen
dc.contributor.committeememberDatta, Jyotishkaen
dc.contributor.committeememberDeng, Xinweien
dc.contributor.departmentStatisticsen
dc.date.accessioned2025-05-21T08:03:18Zen
dc.date.available2025-05-21T08:03:18Zen
dc.date.issued2025-05-20en
dc.description.abstractIn this dissertation, we have focused on developing three methods to address the challenges in highly correlated high-dimensional and functional data. In the first study, the Bayesian variable selection method is developed under a generalized fused multi-kernel machine regression. This method can apply to continuous/binary/ordered categorical response variables. We demonstrate the advantage of our method using bio-photonics Raman spectroscopy to identify which molecular fingerprinting wavenumber is associated with drug dosages of brain tumors. In the second study, we propose a Bayesian inference based on the Bayes factor. Our approach employs a generalized fused multi-kernel machine regression to adjust for multiple tests and identify significant pathways. The advantage of this method is illustrated by using genetic pathway data to test significantly correlated multiple pathways associated with Type II diabetes, estimating nonlinear relationships. Finally, we introduce a testing procedure for the departure of nonlinearity using a functional single index model. This procedure employs a randomly projected empirical process to reduce dimensionality while preserving essential statistical properties. The method is applied to autism brain imaging data to test whether fMRI signals are related to the autism diagnostic observation schedule. Therefore, the proposed three methods advance the field of variable selection and inference by offering innovative solutions to problems associated with correlated high-dimensional and functional data with practical applications across various domains.en
dc.description.abstractgeneralThis study introduces new statistical methods to better understand and analyze complex data, which involves many variables and complex relationships. Our work focuses on improving how we identify important variables and test for various types of data. The first work introduces a new approach to handling complex datasets, such as those obtained from advanced imaging in medical research. This method helps us pinpoint significant variables in data where information is gathered from closely related sources, such as tracking drug effects on brain cancer cells. The second work presents a Bayesian statistics method to identify important functions in high-dimensional data. This approach is beneficial for studying genetic pathways, which are often interconnected. By refining how we detect these pathways, we provide an understanding of their role in diseases like type II diabetes. Lastly, we propose a new test for checking nonlinearity in functional data, which includes data collected over time or space, like brain activity measured by fMRI scans. Our method simplifies complex data while retaining key information, revealing patterns that traditional methods might overlook. This advancement offers a new way to understand brain activity in conditions like autism. Overall, our methods enhance how we analyze and interpret complex, high-dimensional data, improving our ability to identify important variables and relationships in various fields, including medical research and genetics.en
dc.description.degreeDoctor of Philosophyen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:44109en
dc.identifier.urihttps://hdl.handle.net/10919/133541en
dc.language.isoenen
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectBayes Factoren
dc.subjectFunctional Dataen
dc.subjectFused Lassoen
dc.subjectGeneralized MultiKernel Regressionen
dc.subjectVariable Selectionen
dc.titleBayesian Variable Selection and Inference for Nonparametric Kernel Machine and Functional Modelsen
dc.typeDissertationen
thesis.degree.disciplineStatisticsen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.leveldoctoralen
thesis.degree.nameDoctor of Philosophyen

Files

Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
Jin_P_D_2025.pdf
Size:
2.98 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
Jin_P_D_2025_support_1.pdf
Size:
2.98 MB
Format:
Adobe Portable Document Format
Description:
Supporting documents