Bayesian Visual Analytics: Interactive Visualization for High Dimensional Data

dc.contributor.authorHan, Chaoen
dc.contributor.committeechairLeman, Scotland C.en
dc.contributor.committeememberNorth, Christopher L.en
dc.contributor.committeememberSmith, Eric P.en
dc.contributor.committeememberHouse, Leanna L.en
dc.contributor.departmentStatisticsen
dc.date.accessioned2013-02-19T22:36:19Zen
dc.date.available2013-02-19T22:36:19Zen
dc.date.issued2012-12-07en
dc.description.abstractIn light of advancements made in data collection techniques over the past two decades, data mining has become common practice to summarize large, high dimensional datasets, in hopes of discovering noteworthy data structures. However, one concern is that most data mining approaches rely upon strict criteria that may mask information in data that analysts may find useful. We propose a new approach called Bayesian Visual Analytics (BaVA) which merges Bayesian Statistics with Visual Analytics to address this concern. The BaVA framework enables experts to interact with the data and the feature discovery tools by modeling the "sense-making" process using Bayesian Sequential Updating. In this paper, we use BaVA idea to enhance high dimensional visualization techniques such as Probabilistic PCA (PPCA). However, for real-world datasets, important structures can be arbitrarily complex and a single data projection such as PPCA technique may fail to provide useful insights. One way for visualizing such a dataset is to characterize it by a mixture of local models. For example, Tipping and Bishop [Tipping and Bishop, 1999] developed an algorithm called Mixture Probabilistic PCA (MPPCA) that extends PCA to visualize data via a mixture of projectors. Based on MPPCA, we developped a new visualization algorithm called Covariance-Guided MPPCA which group similar covariance structured clusters together to provide more meaningful and cleaner visualizations. Another way to visualize a very complex dataset is using nonlinear projection methods such as  the Generative Topographic Mapping algorithm(GTM). We developped an interactive version of GTM to discover interesting local data structures. We demonstrate the performance of our approaches using both synthetic and real dataset and compare our algorithms with existing ones.en
dc.description.degreePh. D.en
dc.format.mediumETDen
dc.identifier.othervt_gsexam:79en
dc.identifier.urihttp://hdl.handle.net/10919/19210en
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectVisual Analyticsen
dc.subjectBayesian Methodsen
dc.subjectDimension Reductionen
dc.subjectHuman-computer Interactionen
dc.titleBayesian Visual Analytics: Interactive Visualization for High Dimensional Dataen
dc.typeDissertationen
thesis.degree.disciplineStatisticsen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.leveldoctoralen
thesis.degree.namePh. D.en

Files

Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
Han_C_D_2012.pdf
Size:
2.99 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
Han_C_D_2012_support_1.pdf
Size:
171.41 KB
Format:
Adobe Portable Document Format
Description:
Supporting documents