Computational Dissection of Composite Molecular Signatures and Transcriptional Modules

Gong, Ting

Computational Dissection of Composite Molecular Signatures and Transcriptional Modules

dc.contributor.author	Gong, Ting	en
dc.contributor.committeechair	Xuan, Jianhua Jason	en
dc.contributor.committeemember	Lu, Chang-Tien	en
dc.contributor.committeemember	Wyatt, Christopher L.	en
dc.contributor.committeemember	Midkiff, Scott F.	en
dc.contributor.committeemember	Wang, Yue J.	en
dc.contributor.department	Electrical and Computer Engineering	en
dc.date.accessioned	2017-04-06T15:44:39Z	en
dc.date.adate	2010-01-22	en
dc.date.available	2017-04-06T15:44:39Z	en
dc.date.issued	2009-12-14	en
dc.date.rdate	2016-09-30	en
dc.date.sdate	2009-12-24	en
dc.description.abstract	This dissertation aims to develop a latent variable modeling framework with which to analyze gene expression profiling data for computational dissection of molecular signatures and transcriptional modules. The first part of the dissertation is focused on extracting pure gene expression signals from tissue or cell mixtures. The main goal of gene expression profiling is to identify the pure signatures of different cell types (such as cancer cells, stromal cells and inflammatory cells) and estimate the concentration of each cell type. In order to accomplish this, a new blind source separation method is developed, namely, nonnegative partially independent component analysis (nPICA), for tissue heterogeneity correction (THC). The THC problem is formulated as a constrained optimization problem and solved with a learning algorithm based on geometrical and statistical principles. The second part of the dissertation sought to identify gene modules from gene expression data to uncover important biological processes in different types of cells. A new gene clustering approach, nonnegative independent component analysis (nICA), is developed for gene module identification. The nICA approach is completed with an information-theoretic procedure for input sample selection and a novel stability analysis approach for proper dimension estimation. Experimental results showed that the gene modules identified by the nICA approach appear to be significantly enriched in functional annotations in terms of gene ontology (GO) categories. The third part of the dissertation moves from gene module level down to DNA sequence level to identify gene regulatory programs by integrating gene expression data and protein-DNA binding data. A sparse hidden component model is first developed for this problem, taking into account a well-known biological principle, i.e., a gene is most likely regulated by a few regulators. This is followed by the development of a novel computational approach, motif-guided sparse decomposition (mSD), in order to integrate the binding information and gene expression data. These computational approaches are primarily developed for analyzing high-throughput gene expression profiling data. Nevertheless, the proposed methods should be able to be extended to analyze other types of high-throughput data for biomedical research.	en
dc.description.degree	Ph. D.	en
dc.identifier.other	etd-12242009-093205	en
dc.identifier.sourceurl	http://scholar.lib.vt.edu/theses/available/etd-12242009-093205/	en
dc.identifier.uri	http://hdl.handle.net/10919/77302	en
dc.language.iso	en_US	en
dc.publisher	Virginia Tech	en
dc.rights	In Copyright	en
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	en
dc.subject	Tissue Heterogeneity Correction	en
dc.subject	Blind Source Separation	en
dc.subject	Latent Variable Modeling	en
dc.subject	Microarray	en
dc.subject	Gene Regulation	en
dc.subject	Transcriptional Module	en
dc.title	Computational Dissection of Composite Molecular Signatures and Transcriptional Modules	en
dc.type	Dissertation	en
dc.type.dcmitype	Text	en
thesis.degree.discipline	Electrical and Computer Engineering	en
thesis.degree.grantor	Virginia Polytechnic Institute and State University	en
thesis.degree.level	doctoral	en
thesis.degree.name	Ph. D.	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: etd-12242009-093205_Gong_T_D_2009.pdf
Size:: 6.05 MB
Format:: Adobe Portable Document Format

Download

Collections

Doctoral Dissertations