Bayesian Variable Shrinkage and Selection in Compositional Data Regression: Application to Oral Microbiome

dc.contributor.authorDatta, Jyotishkaen
dc.contributor.authorBandyopadhyay, Dipankaren
dc.date.accessioned2025-11-20T15:58:34Zen
dc.date.available2025-11-20T15:58:34Zen
dc.date.issued2024-12-01en
dc.description.abstractMicrobiome studies generate multivariate compositional responses, such as taxa counts, which are strictly non-negative, bounded, residing within a simplex, and subject to unit-sum constraint. In presence of covariates (which can be moderate to high dimensional), they are popularly modeled via the Dirichlet-Multinomial (D-M) regression framework. In this paper, we consider a Bayesian approach for estimation and inference under a D-M compositional framework, and present a comparative evaluation of some state-of-the-art continuous shrinkage priors for efficient variable selection to identify the most significant associations between available covariates, and taxonomic abundance. Specifically, we compare the performances of the horseshoe and horseshoe+ priors (with the benchmark Bayesian lasso), utilizing Hamiltonian Monte Carlo techniques for posterior sampling, and generating posterior credible intervals. Our simulation studies using synthetic data demonstrate excellent recovery and estimation accuracy of sparse parameter regime by the continuous shrinkage priors. We further illustrate our method via application to a motivating oral microbiome data generated from the NYC-Hanes study. RStan implementation of our method is made available at the GitHub link: (https://github.com/dattahub/compshrink).en
dc.description.sponsorshipFoundation for the National Institutes of Health [R21DE031879, R01DE031134]; United States National Institutes of Healthen
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.1007/s41096-024-00194-9en
dc.identifier.eissn2364-9569en
dc.identifier.issue2en
dc.identifier.pmid39403125en
dc.identifier.urihttps://hdl.handle.net/10919/139700en
dc.identifier.volume25en
dc.language.isoenen
dc.publisherSpringernatureen
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.subjectBayesianen
dc.subjectCompositional dataen
dc.subjectGeneralized Dirichleten
dc.subjectDirichleten
dc.subjectLarge pen
dc.subjectShrinkage prioren
dc.subjectSparse probability vectorsen
dc.subjectStick-breakingen
dc.subjectHorseshoeen
dc.titleBayesian Variable Shrinkage and Selection in Compositional Data Regression: Application to Oral Microbiomeen
dc.title.serialJournal of the Indian Society for Probability and Statisticsen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
DattaBayesian.pdf
Size:
2.28 MB
Format:
Adobe Portable Document Format
Description:
Published version