Design and Evaluation of Network Algorithms and Deep Learning Models in Systems Biology and Biomedicine

Tasnina, Nure

Design and Evaluation of Network Algorithms and Deep Learning Models in Systems Biology and Biomedicine

dc.contributor.author	Tasnina, Nure	en
dc.contributor.committeechair	Murali, T. M.	en
dc.contributor.committeemember	Crovella, Mark	en
dc.contributor.committeemember	Zhang, Liqing	en
dc.contributor.committeemember	Eldardiry, Hoda Mohamed	en
dc.contributor.committeemember	Heath, Lenwood S.	en
dc.contributor.department	Computer Science and#38; Applications	en
dc.date.accessioned	2025-10-08T08:00:08Z	en
dc.date.available	2025-10-08T08:00:08Z	en
dc.date.issued	2025-10-07	en
dc.description.abstract	We present novel solutions to four interrelated problems in network biology and computational biomedicine. The first two address methodological challenges in network biology, while the latter two focus on applications of network-based computational modeling in therapeutics. (i) We introduce a provenance tracing framework for random walk-based network diffusion algorithms. By quantifying the contribution of individual paths to a node's diffusion score, the framework captures the influence of global network topology and identifies critical mediators of information flow. This approach enhances the interpretability of diffusion-based predictions, a key requirement for their reliable application in biomedical contexts. (ii) Recognizing that the utility of network diffusion and other network-based algorithms depends on the quality of the underlying networks, we present ICoN, an unsupervised coattention-based graph neural network model for integrating heterogeneous protein-protein interaction networks. ICoN learns joint embeddings across multiple networks and captures complementary biological evidence. ICoN surpassed individual networks across three downstream tasks: gene module detection, gene coannotation prediction, and protein function prediction. Compared to existing unsupervised network integration models, ICoN exhibits superior performance across the majority of downstream tasks and shows enhanced robustness against noise. This work introduces a promising approach for effectively integrating diverse protein-protein association networks, aiming to achieve a biologically meaningful representation of proteins. (iii) Shifting our focus towards therapeutic applications of computational models, we systematically evaluate deep learning models for drug synergy prediction, an important challenge in combination therapy design. Using the SynVerse framework, we assess 16 models across diverse feature types and deep learning based architectures and demonstrate that models often exploit dataset-specific shortcuts rather than biologically meaningful drug or cell line features. Robust evaluation, as enabled by SynVerse, may improve the likelihood that model predictions hold up when tested in experimental and clinical settings. (iv) Finally, SynVerse's findings motivate the need to build a database of synergistic drug combinations annotated with their underlying synergy mechanisms so that one may build mechanism-aware predictive models. Hence, in the final project, we construct a structured database of synergy mechanisms from biomedical literature using a large language modelbased pipeline. This approach integrates literature retrieval, information extraction, biomedical entity recognition, and knowledge graph construction to extract both free-form and structured representations of mechanisms. The resulting resource, encompassing around 3,000 drug combinations, lays the foundation for developing mechanism-aware predictive models for drug synergy.	en
dc.description.abstractgeneral	Cells function through a complex web of interactions among genes, proteins, and other molecules. A central goal of modern biology is to understand how these interactions shape health and disease. To study them, researchers often use a network representation, where molecules are depicted as nodes and their interactions as edges. These networks can then be analyzed using both traditional algorithms and machine learning models to reveal patterns of cellular behavior. In the first part of this thesis, we focus on protein-protein interaction (PPI) networks that represent how proteins connect and work together. We develop a method to explain the predictions of a widely used algorithm: network diffusion. Making these predictions interpretable is critical for building confidence in their use for biomedical research. Recognizing that the effectiveness of network-based methods depends on the quality of the input networks, we further design a deep learning model to integrate PPI networks generated from multiple experimental sources. The integrated network provides richer biological information and outperforms individual source networks in predicting protein function. Building on this foundation, our next two projects addressed a key biomedical application: combination therapy. Combination therapies are a cornerstone of cancer treatment, offering improved efficacy at lower doses and helping to overcome drug resistance. However, experimentally testing the vast space of drug combinations is prohibitively expensive and time-consuming, making computational prediction indispensable. We conduct a large-scale evaluation of existing computational models for drug synergy prediction, and demonstrate that many approaches rely on statistical shortcuts in the data rather than capturing true biological mechanisms. This result underscores the need for more robust and trustworthy predictors. To support the development of such stronger models, we build a pipeline powered by large language models to extract and organize information about the mechanisms, the "why" behind effective drug combinations in cancer treatment. Together, these contributions provide new methods and guidelines for making biological predictions more transparent, reliable, and useful for guiding experiments and therapeutic development.	en
dc.description.degree	Doctor of Philosophy	en
dc.format.medium	ETD	en
dc.identifier.other	vt_gsexam:44705	en
dc.identifier.uri	https://hdl.handle.net/10919/138096	en
dc.language.iso	en	en
dc.publisher	Virginia Tech	en
dc.rights	In Copyright	en
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	en
dc.subject	network biology	en
dc.subject	biomedicine	en
dc.subject	combination therapy	en
dc.subject	drug repurposing	en
dc.subject	graph machine learning	en
dc.subject	LLM	en
dc.title	Design and Evaluation of Network Algorithms and Deep Learning Models in Systems Biology and Biomedicine	en
dc.type	Dissertation	en
thesis.degree.discipline	Computer Science & Applications	en
thesis.degree.grantor	Virginia Polytechnic Institute and State University	en
thesis.degree.level	doctoral	en
thesis.degree.name	Doctor of Philosophy	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Tasnina_N_D_2025.pdf
Size:: 4.27 MB
Format:: Adobe Portable Document Format

Download

Collections

Doctoral Dissertations