Training Physics-Guided Neural Networks with Multiple Constraints: An Application in Lake Ecology Modeling

dc.contributor.authorPradhan, Aanish Kaustubhen
dc.contributor.committeechairKarpatne, Anujen
dc.contributor.committeememberCarey, Cayelan C.en
dc.contributor.committeememberWang, Xuanen
dc.contributor.committeememberHanson, Paul C.en
dc.contributor.departmentComputer Science and#38; Applicationsen
dc.date.accessioned2025-05-24T08:01:09Zen
dc.date.available2025-05-24T08:01:09Zen
dc.date.issued2025-05-23en
dc.description.abstractLakes and reservoirs are critical components of Earth's ecosystems but are increasingly threatened by climate change and human activity, underscoring the need for reliable tools for modeling and predicting lake ecology. While machine learning has shown potential in modeling such systems, sparse environmental data often limits the ability of machine learn- ing models to produce physically consistent predictions or generalize to novel conditions. As a result, many existing approaches rely on computationally intensive physics-biogeochemical simulations to supplement training data. Physics-Guided Neural Networks (PGNN) offer a promising alternative by embedding scientific knowledge directly into the model through physical constraints applied during training. However, training these models at scale remains challenging due to the trade-offs between satisfying physical laws and fitting the data, often leading to optimization pathologies. This thesis explores the challenge of designing, training and evaluating PGNNs with up to six constraints without relying on auxillary simulation data. We assemble a suite of physics-based constraints grounded in limnological principles and evaluate their impact on neural network predictions by assessing within-distribution and zero-shot performance. To navigate the challenge of training with multiple constraints, we explore the use of multitask learning methods to counteract gradient pathologies that arise when training PGNNs. Our results suggest that multitask learning approaches can improve in-distribution performance in certain architectures, but they do not enhance zero- shot performance compared to unconstrained models. Our findings highlight the inherent complexity of scaling PGNNs and emphasize the need for principled training methodologies in data-scarce modeling contexts.en
dc.description.abstractgeneralLakes and reservoirs play a vital role in supporting biodiversity, providing freshwater and regulating the environment. As these ecosystems face increasing stress from climate change and human activity, it is critical to develop reliable tools for modeling lake conditions such as oxygen levels, water temperature, and algae growth. While machine learning has been shown to be a promising approach, these methods rely on large amounts of data for training. However, data from environmental systems is often sparsely available which cause models to struggle with generating physically consistent predictions or generalizing to unseen situa- tions. While past approaches have leveraged physics-biogeochemical models to simulate the lake ecosystem and generate more data, this approach can be computationally expensive. This thesis explores the use of physics-guided neural networks (PGNN), a class of machine learning models that incorporate scientific knowledge to improve accuracy and realism which excel in data-scarce situations. However, training these models can be challenging as the model may struggle to balance between obeying the physical knowledge and fitting the data. To navigate this, we leverage multitask learning methods to assist the models during train- ing. Our results demonstrate that applying these methods, may show promise in training PGNNs at scale albeit only with certain model architectures. Furthermore, our results show that while PGNNs trained with multiple constraints may predict better on the data they are trained on, they fail to generalize to unseen data compared to models trained without physical knowledge. These findings highlight both the complexity of combining physics and machine learning at scale to support lake and reservoir ecosystem modeling.en
dc.description.degreeMaster of Scienceen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:43605en
dc.identifier.urihttps://hdl.handle.net/10919/134207en
dc.language.isoenen
dc.publisherVirginia Techen
dc.rightsCreative Commons Attribution-NonCommercial-ShareAlike 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en
dc.subjectConstrained optimizationen
dc.subjectEcosystem modelingen
dc.subjectMultitask Learningen
dc.subjectPhysics-guided neural networksen
dc.titleTraining Physics-Guided Neural Networks with Multiple Constraints: An Application in Lake Ecology Modelingen
dc.typeThesisen
thesis.degree.disciplineComputer Science & Applicationsen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.levelmastersen
thesis.degree.nameMaster of Scienceen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Pradhan_AK_T_2025.pdf
Size:
5.76 MB
Format:
Adobe Portable Document Format

Collections