Machine Learning Approaches for Identifying microRNA Targets and Conserved Protein Complexes

dc.contributor.authorTorkey, Hanaa A.en
dc.contributor.committeechairHeath, Lenwood S.en
dc.contributor.committeememberZhang, Liqingen
dc.contributor.committeememberGrene, Ruthen
dc.contributor.committeememberDeng, Xinweien
dc.contributor.committeememberElHefnawi, Mahmoud M.en
dc.contributor.departmentComputer Scienceen
dc.date.accessioned2017-04-28T09:50:58Zen
dc.date.available2017-04-28T09:50:58Zen
dc.date.issued2017-04-27en
dc.description.abstractMuch research has been directed toward understanding the roles of essential components in the cell, such as proteins, microRNAs, and genes. This dissertation focuses on two interesting problems in bioinformatics research: microRNA-target prediction and the identification of conserved protein complexes across species. We define the two problems and develop novel approaches for solving them. MicroRNAs are short non-coding RNAs that mediate gene expression. The goal is to predict microRNA targets. Existing methods rely on sequence features to predict targets. These features are neither sufficient nor necessary to identify functional target sites and ignore the cellular conditions in which microRNA and mRNA interact. We developed MicroTarget to predict microRNA-mRNA interactions using heterogeneous data sources. MicroTarget uses expression data to learn candidate target set for each microRNA. Then, sequence data is used to provide evidence of direct interactions and ranking the predicted targets. The predicted targets overlap with many of the experimentally validated ones. The results indicate that using expression data helps in predicting microRNA targets accurately. Protein complexes conserved across species specify processes that are core to cell machinery. Methods that have been devised to identify conserved complexes are severely limited by noise in PPI data. Behind PPIs, there are domains interacting physically to perform the necessary functions. Therefore, employing domains and domain interactions gives a better view of the protein interactions and functions. We developed novel strategy for local network alignment, DONA. DONA maps proteins into their domains and uses DDIs to improve the network alignment. We developed novel strategy for constructing an alignment graph and then uses this graph to discover the conserved sub-networks. DONA shows better performance in terms of the overlap with known protein complexes with higher precision and recall rates than existing methods. The result shows better semantic similarity computed with respect to both the biological process and the molecular function of the aligned sub-networks.en
dc.description.degreePh. D.en
dc.format.mediumETDen
dc.identifier.othervt_gsexam:11177en
dc.identifier.urihttp://hdl.handle.net/10919/77536en
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectmicroRNA targeten
dc.subjectMachine learningen
dc.subjectnetwork alignmenten
dc.subjectprotein complex.en
dc.titleMachine Learning Approaches for Identifying microRNA Targets and Conserved Protein Complexesen
dc.typeDissertationen
thesis.degree.disciplineComputer Science and Applicationsen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.leveldoctoralen
thesis.degree.namePh. D.en

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Torkey_HA_D_2017.pdf
Size:
2.34 MB
Format:
Adobe Portable Document Format