Show simple item record

dc.contributor.authorJin, Yingen_US
dc.date.accessioned2014-03-14T21:23:44Z
dc.date.available2014-03-14T21:23:44Z
dc.date.issued2009-12-08en_US
dc.identifier.otheretd-12302009-142944en_US
dc.identifier.urihttp://hdl.handle.net/10919/40493
dc.description.abstractBiological network data is plentiful with practically every experimental methodology giving â network viewsâ into cellular function and behavior. Bioinformatic screens that yield network data include, for example, genome-wide deletion screens, protein-protein interaction assays, RNA interference experiments, and methods to probe metabolic pathways. Efficient and comprehensive computational approaches are required to model these screens and gain insight into the nature of biological networks. This thesis presents three new algorithms to model and mine network datasets. First, we present an algorithm that models genome-wide perturbation screens by deriving relations between phenotypes and subsequently using these relations in a local manner to derive genephenotype relationships. We show how this algorithm outperforms all previously described algorithms for gene-phenotype modeling. We also present theoretical insight into the convergence and accuracy properties of this approach. Second, we define a new data mining problemâ constrained minimal separator miningâ and propose algorithms as well as applications to modeling gene perturbation screens by viewing the perturbed genes as a graph separator. Both of these data mining applications are evaluated on network datasets from S. cerevisiae and C. elegans. Finally, we present an approach to model the relationship between metabolic pathways and operon structure in prokaryotic genomes. In this approach, we present a new pattern classâ biclusters over domains with supplied partial ordersâ and present algorithms for systematically detecting such biclusters. Together, our data mining algorithms provide a comprehensive arsenal of techniques for modeling gene perturbation screens and metabolic pathways.en_US
dc.publisherVirginia Techen_US
dc.relation.haspartJin_Ying_D_2009.pdfen_US
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectpartial ordersen_US
dc.subjectbiclustersen_US
dc.subjectgraph separatorsen_US
dc.subjectrelative importance methodsen_US
dc.subjectBiological networksen_US
dc.titleNew Algorithms for Mining Network Datasets: Applications to Phenotype and Pathway Modelingen_US
dc.typeDissertationen_US
dc.contributor.departmentComputer Scienceen_US
dc.description.degreePh. D.en_US
thesis.degree.namePh. D.en_US
thesis.degree.leveldoctoralen_US
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen_US
thesis.degree.disciplineComputer Scienceen_US
dc.contributor.committeechairRamakrishnan, Narendranen_US
dc.contributor.committeememberFox, Edward A.en_US
dc.contributor.committeememberHeath, Lenwood S.en_US
dc.contributor.committeememberMurali, T. M.en_US
dc.contributor.committeememberHelm, Richard Fredericken_US
dc.identifier.sourceurlhttp://scholar.lib.vt.edu/theses/available/etd-12302009-142944/en_US
dc.date.sdate2009-12-30en_US
dc.date.rdate2010-01-22
dc.date.adate2010-01-22en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record