Design and Evaluation of Network Algorithms and Deep Learning Models in Systems Biology and Biomedicine
dc.contributor.author | Tasnina, Nure | en |
dc.contributor.committeechair | Murali, T. M. | en |
dc.contributor.committeemember | Crovella, Mark | en |
dc.contributor.committeemember | Zhang, Liqing | en |
dc.contributor.committeemember | Eldardiry, Hoda Mohamed | en |
dc.contributor.committeemember | Heath, Lenwood S. | en |
dc.contributor.department | Computer Science and#38; Applications | en |
dc.date.accessioned | 2025-10-08T08:00:08Z | en |
dc.date.available | 2025-10-08T08:00:08Z | en |
dc.date.issued | 2025-10-07 | en |
dc.description.abstract | We present novel solutions to four interrelated problems in network biology and computational biomedicine. The first two address methodological challenges in network biology, while the latter two focus on applications of network-based computational modeling in therapeutics. (i) We introduce a provenance tracing framework for random walk-based network diffusion algorithms. By quantifying the contribution of individual paths to a node's diffusion score, the framework captures the influence of global network topology and identifies critical mediators of information flow. This approach enhances the interpretability of diffusion-based predictions, a key requirement for their reliable application in biomedical contexts. (ii) Recognizing that the utility of network diffusion and other network-based algorithms depends on the quality of the underlying networks, we present ICoN, an unsupervised coattention-based graph neural network model for integrating heterogeneous protein-protein interaction networks. ICoN learns joint embeddings across multiple networks and captures complementary biological evidence. ICoN surpassed individual networks across three downstream tasks: gene module detection, gene coannotation prediction, and protein function prediction. Compared to existing unsupervised network integration models, ICoN exhibits superior performance across the majority of downstream tasks and shows enhanced robustness against noise. This work introduces a promising approach for effectively integrating diverse protein-protein association networks, aiming to achieve a biologically meaningful representation of proteins. (iii) Shifting our focus towards therapeutic applications of computational models, we systematically evaluate deep learning models for drug synergy prediction, an important challenge in combination therapy design. Using the SynVerse framework, we assess 16 models across diverse feature types and deep learning based architectures and demonstrate that models often exploit dataset-specific shortcuts rather than biologically meaningful drug or cell line features. Robust evaluation, as enabled by SynVerse, may improve the likelihood that model predictions hold up when tested in experimental and clinical settings. (iv) Finally, SynVerse's findings motivate the need to build a database of synergistic drug combinations annotated with their underlying synergy mechanisms so that one may build mechanism-aware predictive models. Hence, in the final project, we construct a structured database of synergy mechanisms from biomedical literature using a large language modelbased pipeline. This approach integrates literature retrieval, information extraction, biomedical entity recognition, and knowledge graph construction to extract both free-form and structured representations of mechanisms. The resulting resource, encompassing around 3,000 drug combinations, lays the foundation for developing mechanism-aware predictive models for drug synergy. | en |
dc.description.abstractgeneral | Cells function through a complex web of interactions among genes, proteins, and other molecules. A central goal of modern biology is to understand how these interactions shape health and disease. To study them, researchers often use a network representation, where molecules are depicted as nodes and their interactions as edges. These networks can then be analyzed using both traditional algorithms and machine learning models to reveal patterns of cellular behavior. In the first part of this thesis, we focus on protein-protein interaction (PPI) networks that represent how proteins connect and work together. We develop a method to explain the predictions of a widely used algorithm: network diffusion. Making these predictions interpretable is critical for building confidence in their use for biomedical research. Recognizing that the effectiveness of network-based methods depends on the quality of the input networks, we further design a deep learning model to integrate PPI networks generated from multiple experimental sources. The integrated network provides richer biological information and outperforms individual source networks in predicting protein function. Building on this foundation, our next two projects addressed a key biomedical application: combination therapy. Combination therapies are a cornerstone of cancer treatment, offering improved efficacy at lower doses and helping to overcome drug resistance. However, experimentally testing the vast space of drug combinations is prohibitively expensive and time-consuming, making computational prediction indispensable. We conduct a large-scale evaluation of existing computational models for drug synergy prediction, and demonstrate that many approaches rely on statistical shortcuts in the data rather than capturing true biological mechanisms. This result underscores the need for more robust and trustworthy predictors. To support the development of such stronger models, we build a pipeline powered by large language models to extract and organize information about the mechanisms, the "why" behind effective drug combinations in cancer treatment. Together, these contributions provide new methods and guidelines for making biological predictions more transparent, reliable, and useful for guiding experiments and therapeutic development. | en |
dc.description.degree | Doctor of Philosophy | en |
dc.format.medium | ETD | en |
dc.identifier.other | vt_gsexam:44705 | en |
dc.identifier.uri | https://hdl.handle.net/10919/138096 | en |
dc.language.iso | en | en |
dc.publisher | Virginia Tech | en |
dc.rights | In Copyright | en |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | en |
dc.subject | network biology | en |
dc.subject | biomedicine | en |
dc.subject | combination therapy | en |
dc.subject | drug repurposing | en |
dc.subject | graph machine learning | en |
dc.subject | LLM | en |
dc.title | Design and Evaluation of Network Algorithms and Deep Learning Models in Systems Biology and Biomedicine | en |
dc.type | Dissertation | en |
thesis.degree.discipline | Computer Science & Applications | en |
thesis.degree.grantor | Virginia Polytechnic Institute and State University | en |
thesis.degree.level | doctoral | en |
thesis.degree.name | Doctor of Philosophy | en |
Files
Original bundle
1 - 1 of 1