Process-Guided Deep Learning Predictions of Lake Water Temperature
dc.contributor.author | Read, Jordan S. | en |
dc.contributor.author | Jia, Xiaowei | en |
dc.contributor.author | Willard, Jared | en |
dc.contributor.author | Appling, Alison P. | en |
dc.contributor.author | Zwart, Jacob A. | en |
dc.contributor.author | Oliver, Samantha K. | en |
dc.contributor.author | Karpatne, Anuj | en |
dc.contributor.author | Hansen, Gretchen J. A. | en |
dc.contributor.author | Hanson, Paul C. | en |
dc.contributor.author | Watkins, William | en |
dc.contributor.author | Steinbach, Michael | en |
dc.contributor.author | Kumar, Vipin | en |
dc.contributor.department | Computer Science | en |
dc.date.accessioned | 2020-06-12T17:46:06Z | en |
dc.date.available | 2020-06-12T17:46:06Z | en |
dc.date.issued | 2019-11-08 | en |
dc.description.abstract | The rapid growth of data in water resources has created new opportunities to accelerate knowledge discovery with the use of advanced deep learning tools. Hybrid models that integrate theory with state-of-the art empirical techniques have the potential to improve predictions while remaining true to physical laws. This paper evaluates the Process-Guided Deep Learning (PGDL) hybrid modeling framework with a use-case of predicting depth-specific lake water temperatures. The PGDL model has three primary components: a deep learning model with temporal awareness (long short-term memory recurrence), theory-based feedback (model penalties for violating conversation of energy), and model pretraining to initialize the network with synthetic data (water temperature predictions from a process-based model). In situ water temperatures were used to train the PGDL model, a deep learning (DL) model, and a process-based (PB) model. Model performance was evaluated in various conditions, including when training data were sparse and when predictions were made outside of the range in the training data set. The PGDL model performance (as measured by root-mean-square error (RMSE)) was superior to DL and PB for two detailed study lakes, but only when pretraining data included greater variability than the training period. The PGDL model also performed well when extended to 68 lakes, with a median RMSE of 1.65 degrees C during the test period (DL: 1.78 degrees C, PB: 2.03 degrees C; in a small number of lakes PB or DL models were more accurate). This case-study demonstrates that integrating scientific knowledge into deep learning tools shows promise for improving predictions of many important environmental variables. | en |
dc.description.admin | Public domain – authored by a U.S. government employee | en |
dc.description.notes | See supporting information for data access, extended methods details, and example code. See https://doi.org/10.5066/P9AQPIVD for this study's data release and https://doi.org/10.5281/zenodo.3497495 for the versioned code repository. This research was funded by the Department of the Interior Northeast and North Central Climate Adaptation Science Centers, a Midwest Glacial Lakes Fish Habitat Partnership grant through F&WS, NSF Expedition in Computing Grant 1029711 to the University of Minnesota, a postdoctoral fellowship awarded to J.A.Z. under NSF EAR-PF-1725386, and a seed grant from the Digital Technology Center at the University of Minnesota. Access to computing facilities was provided by the University of Minnesota Supercomputing Institute and USGS Advanced Research Computing, USGS Yeti Supercomputer (https://doi.org/10.5066/F7D798MJ). We thank North Temperate Lakes Long-Term Ecological Research (NSF DEB-1440297) and Global Lake Ecological Observatory Network (NSF 1702991) for modeling discussions and data sharing, and Arka Daw, Randy Hunt, Jeff Sadler, Emily Read, and Mike Fienen for the PGDL discussions and ideas. We thank Luke Winslow, Noah Lottig, Madeline Magee, and along with MN DNR and WI DNR for temperature and bathymetric data, with special thanks to Pete Jacobson, Katie Hein, and Madeline Humphrey for collating thousands of temperature records, and Dave Wolock, the editorial group at WRR, and three anonymous reviewers for input that was used to improve this paper. | en |
dc.description.sponsorship | Department of the Interior Northeast Climate Adaptation Science Center; Midwest Glacial Lakes Fish Habitat Partnership grant through FWS; NSF Expedition in Computing Grant [1029711]; NSFNational Science Foundation (NSF) [EAR-PF-1725386]; Digital Technology Center at the University of MinnesotaUniversity of Minnesota System; Department of the Interior North Central Climate Adaptation Science Center; North Temperate Lakes Long-Term Ecological Research [NSF DEB-1440297]; Global Lake Ecological Observatory Network [NSF 1702991] | en |
dc.format.mimetype | application/pdf | en |
dc.identifier.doi | https://doi.org/10.1029/2019WR024922 | en |
dc.identifier.eissn | 1944-7973 | en |
dc.identifier.issn | 0043-1397 | en |
dc.identifier.uri | http://hdl.handle.net/10919/98833 | en |
dc.language.iso | en | en |
dc.rights | Creative Commons CC0 1.0 Universal Public Domain Dedication | en |
dc.rights.uri | http://creativecommons.org/publicdomain/zero/1.0/ | en |
dc.subject | deep learning | en |
dc.subject | lake modelling | en |
dc.subject | temperature prediction | en |
dc.subject | process-guided deep learning | en |
dc.subject | theory-guided data science | en |
dc.subject | data science | en |
dc.title | Process-Guided Deep Learning Predictions of Lake Water Temperature | en |
dc.title.serial | Water Resources Research | en |
dc.type | Article - Refereed | en |
dc.type.dcmitype | Text | en |
dc.type.dcmitype | StillImage | en |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 2019WR024922.pdf
- Size:
- 1.75 MB
- Format:
- Adobe Portable Document Format
- Description: