Design and Evaluation of Network Algorithms and Deep Learning Models in Systems Biology and Biomedicine

dc.contributor.authorTasnina, Nureen
dc.contributor.committeechairMurali, T. M.en
dc.contributor.committeememberCrovella, Marken
dc.contributor.committeememberZhang, Liqingen
dc.contributor.committeememberEldardiry, Hoda Mohameden
dc.contributor.committeememberHeath, Lenwood S.en
dc.contributor.departmentComputer Science and#38; Applicationsen
dc.date.accessioned2025-10-08T08:00:08Zen
dc.date.available2025-10-08T08:00:08Zen
dc.date.issued2025-10-07en
dc.description.abstractWe present novel solutions to four interrelated problems in network biology and computational biomedicine. The first two address methodological challenges in network biology, while the latter two focus on applications of network-based computational modeling in therapeutics. (i) We introduce a provenance tracing framework for random walk-based network diffusion algorithms. By quantifying the contribution of individual paths to a node's diffusion score, the framework captures the influence of global network topology and identifies critical mediators of information flow. This approach enhances the interpretability of diffusion-based predictions, a key requirement for their reliable application in biomedical contexts. (ii) Recognizing that the utility of network diffusion and other network-based algorithms depends on the quality of the underlying networks, we present ICoN, an unsupervised coattention-based graph neural network model for integrating heterogeneous protein-protein interaction networks. ICoN learns joint embeddings across multiple networks and captures complementary biological evidence. ICoN surpassed individual networks across three downstream tasks: gene module detection, gene coannotation prediction, and protein function prediction. Compared to existing unsupervised network integration models, ICoN exhibits superior performance across the majority of downstream tasks and shows enhanced robustness against noise. This work introduces a promising approach for effectively integrating diverse protein-protein association networks, aiming to achieve a biologically meaningful representation of proteins. (iii) Shifting our focus towards therapeutic applications of computational models, we systematically evaluate deep learning models for drug synergy prediction, an important challenge in combination therapy design. Using the SynVerse framework, we assess 16 models across diverse feature types and deep learning based architectures and demonstrate that models often exploit dataset-specific shortcuts rather than biologically meaningful drug or cell line features. Robust evaluation, as enabled by SynVerse, may improve the likelihood that model predictions hold up when tested in experimental and clinical settings. (iv) Finally, SynVerse's findings motivate the need to build a database of synergistic drug combinations annotated with their underlying synergy mechanisms so that one may build mechanism-aware predictive models. Hence, in the final project, we construct a structured database of synergy mechanisms from biomedical literature using a large language modelbased pipeline. This approach integrates literature retrieval, information extraction, biomedical entity recognition, and knowledge graph construction to extract both free-form and structured representations of mechanisms. The resulting resource, encompassing around 3,000 drug combinations, lays the foundation for developing mechanism-aware predictive models for drug synergy.en
dc.description.abstractgeneralCells function through a complex web of interactions among genes, proteins, and other molecules. A central goal of modern biology is to understand how these interactions shape health and disease. To study them, researchers often use a network representation, where molecules are depicted as nodes and their interactions as edges. These networks can then be analyzed using both traditional algorithms and machine learning models to reveal patterns of cellular behavior. In the first part of this thesis, we focus on protein-protein interaction (PPI) networks that represent how proteins connect and work together. We develop a method to explain the predictions of a widely used algorithm: network diffusion. Making these predictions interpretable is critical for building confidence in their use for biomedical research. Recognizing that the effectiveness of network-based methods depends on the quality of the input networks, we further design a deep learning model to integrate PPI networks generated from multiple experimental sources. The integrated network provides richer biological information and outperforms individual source networks in predicting protein function. Building on this foundation, our next two projects addressed a key biomedical application: combination therapy. Combination therapies are a cornerstone of cancer treatment, offering improved efficacy at lower doses and helping to overcome drug resistance. However, experimentally testing the vast space of drug combinations is prohibitively expensive and time-consuming, making computational prediction indispensable. We conduct a large-scale evaluation of existing computational models for drug synergy prediction, and demonstrate that many approaches rely on statistical shortcuts in the data rather than capturing true biological mechanisms. This result underscores the need for more robust and trustworthy predictors. To support the development of such stronger models, we build a pipeline powered by large language models to extract and organize information about the mechanisms, the "why" behind effective drug combinations in cancer treatment. Together, these contributions provide new methods and guidelines for making biological predictions more transparent, reliable, and useful for guiding experiments and therapeutic development.en
dc.description.degreeDoctor of Philosophyen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:44705en
dc.identifier.urihttps://hdl.handle.net/10919/138096en
dc.language.isoenen
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectnetwork biologyen
dc.subjectbiomedicineen
dc.subjectcombination therapyen
dc.subjectdrug repurposingen
dc.subjectgraph machine learningen
dc.subjectLLMen
dc.titleDesign and Evaluation of Network Algorithms and Deep Learning Models in Systems Biology and Biomedicineen
dc.typeDissertationen
thesis.degree.disciplineComputer Science & Applicationsen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.leveldoctoralen
thesis.degree.nameDoctor of Philosophyen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Tasnina_N_D_2025.pdf
Size:
4.27 MB
Format:
Adobe Portable Document Format