DeTangle: A Framework for Interactive Prediction and Visualization of Gene Regulatory Networks

dc.contributor.authorAltarawy, Doaa Abdelsalam Ahmed Mohameden
dc.contributor.committeechairHeath, Lenwood S.en
dc.contributor.committeememberNorth, Christopher L.en
dc.contributor.committeememberGrene, Ruthen
dc.contributor.committeememberShaffer, Clifford A.en
dc.contributor.committeememberIsmail, Mahameden
dc.contributor.departmentComputer Scienceen
dc.date.accessioned2018-10-25T06:00:55Zen
dc.date.available2018-10-25T06:00:55Zen
dc.date.issued2017-05-02en
dc.description.abstractWith the abundance of biological data, computational prediction of gene regulatory networks (GRNs) from gene expression data has become more feasible. Although incorporating other prior knowledge (PK), along with gene expression, greatly improves prediction accuracy, the accuracy remains low. PK in GRN inference can be categorized into noisy and curated. Several algorithms were proposed to incorporate noisy PK, but none address curated PK. Another challenge is that much of the PK is not stored in databases or not in a unified structured format to be accessible by inference algorithms. Moreover, no GRN inference method exists that supports post-prediction PK. This thesis addresses those limitations with three solutions: PEAK algorithm for integrating both curated and noisy PK, Online-PEAK for post-prediction interactive feedback, and DeTangle for visualization and navigation of GRNs. PEAK integrates both curated as well as noisy PK in GRN inference. We introduce a novel method for GRN inference, CurInf, to effectively integrate curated PK, and we use the previous method, Modified Elastic Net, for noisy PK, and we call it NoisInf. Using 100% curated PK, CurInf improves the AUPR accuracy score over NoisInf by 27.3% in synthetic data, 86.5% in E. coli data, and 31.1% in S. cerevisiae data. Moreover, we developed an online algorithm, online-PEAK, that enables the biologist to interact with the inference algorithm, PEAK, through a visual interface to add their domain experience about the structure of the GRN as a feedback to the system. We experimentally verified the ability of online-PEAK to achieve incremental accuracy when PK is added by the user, including true and false PK. Even when the noise in PK is 10 times more than true PK, online-PEAK performs better than inference without any PK. Finally, we present DeTangle, a Web server for interactive GRN prediction and visualization. DeTangle provides a seamless analysis of GRN starting from uploading gene expression, GRN inference, post-prediction feedback using online-PEAK, and visualization and navigation of the predicted GRN. More accurate prediction of GRN can facilitate studying complex molecular interactions, understanding diseases, and aiding drug design.en
dc.description.abstractgeneralProteins are complex molecules that are responsible for most of the functionalities in living organisms. Proteins are produced from genes inside the cells. The production of proteins can be controlled by other proteins causing their production to increase or to decrease. This control is called gene regulation. Studying gene regulation between genes is important because it facilitates understanding diseases and aiding drug design. In this thesis, I developed computational methods, called PEAK and online-PEAK, to predict the interactions between genes using computer algorithms. My test results show that the method PEAK is more accurate than previous existing methods. I have also built a Web application, DeTangle, to help biologists use the algorithm and visualize the results.en
dc.description.degreePh. D.en
dc.format.mediumETDen
dc.identifier.othervt_gsexam:11346en
dc.identifier.urihttp://hdl.handle.net/10919/85504en
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectGene regulationen
dc.subjectprior knowledgeen
dc.subjectgene regulatory network inferenceen
dc.subjectvisualizationen
dc.subjectMachine learningen
dc.titleDeTangle: A Framework for Interactive Prediction and Visualization of Gene Regulatory Networksen
dc.typeDissertationen
thesis.degree.disciplineComputer Science and Applicationsen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.leveldoctoralen
thesis.degree.namePh. D.en

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Altarawy_DA_D_2017.pdf
Size:
1.97 MB
Format:
Adobe Portable Document Format