Computational Models for Resistome Risk Assessment and Environmental Surveillance
| dc.contributor.author | Rumi, Monjura Afrin | en |
| dc.contributor.committeechair | Zhang, Liqing | en |
| dc.contributor.committeemember | Lu, Chang Tien | en |
| dc.contributor.committeemember | Pruden-Bagchi, Amy Jill | en |
| dc.contributor.committeemember | Davis, Benjamin Cole | en |
| dc.contributor.committeemember | Bhattacharya, Debswapna | en |
| dc.contributor.department | Computer Science and#38; Applications | en |
| dc.date.accessioned | 2026-03-20T08:00:16Z | en |
| dc.date.available | 2026-03-20T08:00:16Z | en |
| dc.date.issued | 2026-03-19 | en |
| dc.description.abstract | Antibiotic resistance (AR) is a major global threat to human health and economic stability. Without effective intervention, AR is projected to cause substantial loss of life and severe global economic burden. Addressing this challenge requires coordinated national and international efforts that consider all pathways through which antibiotic resistance can emerge and spread. Many public health and regulatory organizations advocate for a One Health approach, which integrates human, animal, and environmental health perspectives. Core components of this approach include the monitoring, risk assessment, and mitigation of antibiotic resistance. This dissertation addresses all three of these components through the development and application of computational methods for analyzing antibiotic resistance genes (ARGs) in environmental metagenomic data. In Chapter 2, I presented a risk assessment framework designed to evaluate the potential health risk associated with ARGs at a given location. Using metagenomic sequencing data, this framework computes a resistome risk score that quantifies the level of ARG contamination. The method was shown to be robust across different genome assembly strategies and varying sequencing depths. In Chapter 3, I applied this framework to 1,326 metagenomic samples collected from 12 distinct environmental types. This large-scale analysis disentangles biological (e.g., ARG relative abundance), ecological (e.g., taxonomic diversity), and technical (e.g., sequencing coverage) factors that influence resistome risk scores. The results demonstrate that risk scores are significantly affected by taxonomic diversity and are strongly correlated with anthropogenic markers. In Chapter 4, I introduced ARGfore, a forecasting model designed to predict future ARG abundance based on longitudinal surveillance data. Forecasting ARG trends enables earlier detection of emerging resistance risks and supports proactive mitigation strategies. Finally, in Chapter 5, I described WWTPredictor, a regression-based model that predicts the abundance of metagenomic features—specifically ARGs and microbial taxa—released into the environment from WWTPs based on incoming wastewater data. This model provides a quantitative framework for anticipating environmental discharge risks and supports data-driven decision-making in public health and environmental management. | en |
| dc.description.abstractgeneral | Antibiotic resistance (AR) is the ability of bacteria to resist the effects of antibiotics, mak- ing common infections difficult or impossible to treat. Effectively mitigating the spread of AR requires systematic methods to track antibiotic resistance genes (ARGs)—the genetic blueprints that allow bacteria to survive treatment. In this thesis, I designed a computational tool that uses DNA sequencing data from en- vironmental samples to estimate the potential health risk from ARGs at any given location. I also conducted a large-scale analysis to examine how biological, environmental, and techni- cal factors influence the accuracy of these assessments. This study showed that poor-quality data and highly complex microbial communities can "mask" resistance, making it harder to estimate risk. My research indicates that wastewater is more likely to carry high-risk resistance genes that could easily spread further into the environment. Because wastewater collects bacteria from hospitals and households before releasing it into rivers and lakes, it serves as a critical pathway for the spread of resistance. Wastewater surveillance provides an early warning system for public health. Building on this, I devel- oped a deep learning–based model to forecast future trends in ARG levels from surveillance data. I also designed a second model to estimate the abundance of resistant bacteria released from wastewater treatment plants based on the wastewater entering the facility. Together, these tools enable earlier detection of emerging threats and support more informed decisions for public health and environmental protection. | en |
| dc.description.degree | Doctor of Philosophy | en |
| dc.format.medium | ETD | en |
| dc.identifier.other | vt_gsexam:45743 | en |
| dc.identifier.uri | https://hdl.handle.net/10919/142393 | en |
| dc.language.iso | en | en |
| dc.publisher | Virginia Tech | en |
| dc.rights | In Copyright | en |
| dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | en |
| dc.subject | Antibiotic Resistance Gene | en |
| dc.subject | Resistome Risk | en |
| dc.subject | Wastewater Surveillance | en |
| dc.subject | Time-series Forecasting | en |
| dc.subject | Time-Series Regression | en |
| dc.title | Computational Models for Resistome Risk Assessment and Environmental Surveillance | en |
| dc.type | Dissertation | en |
| thesis.degree.discipline | Computer Science & Applications | en |
| thesis.degree.grantor | Virginia Polytechnic Institute and State University | en |
| thesis.degree.level | doctoral | en |
| thesis.degree.name | Doctor of Philosophy | en |
Files
Original bundle
1 - 1 of 1