Establishing performance criteria for evaluating watershed-scale sediment and nutrient models at fine temporal scales

dc.contributor.authorPandit, Aayushen
dc.contributor.authorHogan, Sarahen
dc.contributor.authorMahoney, David T.en
dc.contributor.authorFord, William I.en
dc.contributor.authorFox, James F.en
dc.contributor.authorWellen, Christopheren
dc.contributor.authorHusic, Adminen
dc.date.accessioned2025-02-19T17:56:04Zen
dc.date.available2025-02-19T17:56:04Zen
dc.date.issued2025-01-18en
dc.description.abstractWatershed water quality models are mathematical tools used to simulate processes related to water, sediment, and nutrients. These models provide a framework that can be used to inform decision-making and the allocation of resources for watershed management. Therefore, it is critical to answer the question “when is a model good enough?” Established performance evaluation criteria, or thresholds for what is considered a ‘good’ model, provide common benchmarks against which model performance can be compared. Since the publication of prior meta-analyses on this topic, developments in the last decade necessitate further investigation, such as the advancement in high performance computing, the proliferation of aquatic sensors, and the development of machine learning algorithms. We surveyed the literature for quantitative model performance measures, including the Nash-Sutcliffe efficiency (NSE), with a particular focus on process-based models operating at fine temporal scales as their performance evaluation criteria are presently underdeveloped. The synthesis dataset was used to assess the influence of temporal resolution (sub-daily, daily, and monthly), calibration duration (< 3 years, 3 to 8 years, and > 8 years), and constituent target units (concentration, load, and yield) on model performance. The synthesis dataset includes 229 model applications, from which we use bootstrapping and personal modeling experience to establish sub-daily and daily performance evaluation criteria for flow, sediment, total nutrient, and dissolved nutrient models. For daily model evaluation, the NSE for sediment, total nutrient, and dissolved nutrient models should exceed 0.45, 0.30, and 0.35, respectively, for ‘satisfactory’ performance. Model performance generally improved when transitioning from short (< 3 years) to medium (3 to 8 years) calibration durations, but no additional gain was observed with longer (> 8 years) calibration. Dissolved nutrient models calibrated to load (e.g., kg/s) out-performed those calibrated to concentration (e.g., mg/L), whereas selection of target units was not significant for sediment and total nutrient models. We recommend the use of concentration rather than load as a water quality modeling target, as load may be biased by strong flow model performance whereas concentration provides a flow-independent measure of performance. Although the performance criteria developed herein are based on process-based models, they may be useful in assessing machine learning model performance. We demonstrate one such assessment on a recent deep learning model of daily nitrate prediction across the United States. The guidance presented here is intended to be used alongside, rather than to replace, the experience and modeling judgement of engineers and scientist who work to maintain our collective water resources.en
dc.description.versionPublished versionen
dc.format.extent13 page(s)en
dc.format.mimetypeapplication/pdfen
dc.identifierARTN 123156 (Article number)en
dc.identifier.doihttps://doi.org/10.1016/j.watres.2025.123156en
dc.identifier.eissn1879-2448en
dc.identifier.issn0043-1354en
dc.identifier.orcidHusic, Admin [0000-0002-4225-2252]en
dc.identifier.otherS0043-1354(25)00070-3 (PII)en
dc.identifier.pmid39955106en
dc.identifier.urihttps://hdl.handle.net/10919/124648en
dc.identifier.volume274en
dc.language.isoenen
dc.publisherPergamon-Elsevieren
dc.relation.urihttps://www.ncbi.nlm.nih.gov/pubmed/39955106en
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.subjectWater qualityen
dc.subjectWatershed modelen
dc.subjectPerformance measureen
dc.subjectEvaluation criteriaen
dc.subjectSedimenten
dc.subjectNutrientsen
dc.subject.meshGeologic Sedimentsen
dc.subject.meshModels, Theoreticalen
dc.subject.meshWater Qualityen
dc.subject.meshNutrientsen
dc.titleEstablishing performance criteria for evaluating watershed-scale sediment and nutrient models at fine temporal scalesen
dc.title.serialWater Researchen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten
dc.type.otherArticleen
dc.type.otherJournalen
dcterms.dateAccepted2025-01-15en
pubs.organisational-groupVirginia Techen
pubs.organisational-groupVirginia Tech/Engineeringen
pubs.organisational-groupVirginia Tech/Engineering/Civil & Environmental Engineeringen
pubs.organisational-groupVirginia Tech/All T&R Facultyen
pubs.organisational-groupVirginia Tech/Engineering/COE T&R Facultyen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Pandit-et-al-2025-WR.pdf
Size:
2.23 MB
Format:
Adobe Portable Document Format
Description:
Published version
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Plain Text
Description: