Linkage Based Dirichlet Processes

dc.contributor.authorSong, Yuhyunen
dc.contributor.committeechairLeman, Scotland C.en
dc.contributor.committeememberTerrell, George R.en
dc.contributor.committeememberHouse, Leanna L.en
dc.contributor.committeememberKim, Inyoungen
dc.contributor.departmentStatisticsen
dc.date.accessioned2017-02-09T09:00:29Zen
dc.date.available2017-02-09T09:00:29Zen
dc.date.issued2017-02-08en
dc.description.abstractWe live in the era of textit{Big Data} with significantly richer computational resources than the last two decades. The concurrence of computation resources and a large volume of data has boosted researchers' desire for developing feasible Markov Chain Monte Carlo (MCMC) algorithms for large parameter spaces. Dirichlet Process Mixture Models (DPMMs) have become a Bayesian mainstay for modeling heterogeneous structures, namely clusters, especially when the quantity of clusters is not known with the established MCMC methods. As opposed to many ad-hoc clustering methods, using Dirichlet Processes (DPs) in models provide a flexible and probabilistic approach for automatically estimating both cluster structure and quantity. While DPs are not fully parameterized, they depend on both a base measure and a concentration parameter that can heavily impact inferences. Determining the concentration parameter is critical and essential, since it adjusts the a-priori cluster expectation, but typical approaches for specifying this parameter are rather cavalier. In this work, we propose a new method for automatically and adaptively determining this parameter, which directly calibrates distances between clusters through an explicit link function within the DP. Furthermore, we extend our method to mixture models with Nested Dirichlet Processes (NDPs) that cluster the multilevel data and depend on the specification of a vector of concentration parameters. In this work, we detail how to incorporate our method in Markov chain Monte Carlo algorithms, and illustrate our findings through a series of comparative simulation studies and applications.en
dc.description.degreePh. D.en
dc.format.mediumETDen
dc.identifier.othervt_gsexam:9752en
dc.identifier.urihttp://hdl.handle.net/10919/74970en
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectconcentration parameteren
dc.subjectDirichlet processesen
dc.subjectnested Dirichlet processesen
dc.titleLinkage Based Dirichlet Processesen
dc.typeDissertationen
thesis.degree.disciplineStatisticsen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.leveldoctoralen
thesis.degree.namePh. D.en

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Song_Y_D_2017.pdf
Size:
2.42 MB
Format:
Adobe Portable Document Format