Nonparametric Bayesian Functional Clustering with Applications to Racial Disparities in Breast Cancer
dc.contributor.author | Gao, Wenyu | en |
dc.contributor.author | Kim, Inyoung | en |
dc.contributor.author | Nam, Wonil | en |
dc.contributor.author | Ren, Xiang | en |
dc.contributor.author | Zhou, Wei | en |
dc.contributor.author | Agah, Masoud | en |
dc.date.accessioned | 2024-02-22T14:08:23Z | en |
dc.date.available | 2024-02-22T14:08:23Z | en |
dc.date.issued | 2024-01 | en |
dc.description.abstract | As we have easier access to massive data sets, functional analyses have gained more interest. However, such data sets often contain large heterogeneities, noises, and dimensionalities. When generalizing the analyses from vectors to functions, classical methods might not work directly. This paper considers noisy information reduction in functional analyses from two perspectives: functional clustering to group similar observations and thus reduce the sample size and functional variable selection to reduce the dimensionality. The complicated data structures and relations can be easily modeled by a Bayesian hierarchical model due to its flexibility. Hence, this paper proposes a nonparametric Bayesian functional clustering and peak point selection method via weighted Dirichlet process mixture (WDPM) modeling that automatically clusters and provides accurate estimations, together with conditional Laplace prior, which is a conjugate variable selection prior. The proposed method is named WDPM-VS for short, and is able to simultaneously perform the following tasks: (1) Automatic cluster without specifying the number of clusters or cluster centers beforehand; (2) Cluster for heterogeneously behaved functions; (3) Select vibrational peak points; and (4) Reduce noisy information from the two perspectives: sample size and dimensionality. The method will greatly outperform its comparison methods in root mean squared errors. Based on this proposed method, we are able to identify biological factors that can explain the breast cancer racial disparities. | en |
dc.description.version | Published version | en |
dc.format.extent | 14 page(s) | en |
dc.format.mimetype | application/pdf | en |
dc.identifier | ARTN e11657 (Article number) | en |
dc.identifier.doi | https://doi.org/10.1002/sam.11657 | en |
dc.identifier.eissn | 1932-1872 | en |
dc.identifier.issn | 1932-1864 | en |
dc.identifier.issue | 1 | en |
dc.identifier.orcid | Zhou, Wei [0000-0002-5257-3885] | en |
dc.identifier.orcid | Agah, Masoud [0000-0001-6117-4539] | en |
dc.identifier.uri | https://hdl.handle.net/10919/118109 | en |
dc.identifier.volume | 17 | en |
dc.language.iso | en | en |
dc.publisher | Wiley | en |
dc.rights | Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International | en |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | en |
dc.subject | functional clustering | en |
dc.subject | nonparametric Bayesian model | en |
dc.subject | peak point selection | en |
dc.subject | surface-enhanced Raman spectroscopy | en |
dc.subject | WDPM-VS | en |
dc.subject | weighted Dirichlet process mixture | en |
dc.title | Nonparametric Bayesian Functional Clustering with Applications to Racial Disparities in Breast Cancer | en |
dc.title.serial | Statistical Analysis and Data Mining | en |
dc.type | Article - Refereed | en |
dc.type.dcmitype | Text | en |
dc.type.other | Article | en |
dcterms.dateAccepted | 2023-12-15 | en |
pubs.organisational-group | /Virginia Tech | en |
pubs.organisational-group | /Virginia Tech/Science | en |
pubs.organisational-group | /Virginia Tech/Science/Statistics | en |
pubs.organisational-group | /Virginia Tech/Engineering | en |
pubs.organisational-group | /Virginia Tech/Engineering/Electrical and Computer Engineering | en |
pubs.organisational-group | /Virginia Tech/Faculty of Health Sciences | en |
pubs.organisational-group | /Virginia Tech/All T&R Faculty | en |
pubs.organisational-group | /Virginia Tech/Engineering/COE T&R Faculty | en |
pubs.organisational-group | /Virginia Tech/Science/COS T&R Faculty | en |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 2024-Nonparametric Bayesian functional clustering with applications to racial disparities in breast cancer.pdf
- Size:
- 13.38 MB
- Format:
- Adobe Portable Document Format
- Description:
- Published version
License bundle
1 - 1 of 1