A machine learning framework to predict PPCP removal through various wastewater and water reuse treatment trains

dc.contributor.authorChoi, Joung Minen
dc.contributor.authorManthapuri, Vineethen
dc.contributor.authorKeenum, Ishien
dc.contributor.authorBrown, Connor L.en
dc.contributor.authorXia, Kangen
dc.contributor.authorChen, Chaoqien
dc.contributor.authorVikesland, Peter J.en
dc.contributor.authorBlair, Matthew F.en
dc.contributor.authorBott, Charlesen
dc.contributor.authorPruden, Amyen
dc.contributor.authorZhang, Liqingen
dc.date.accessioned2025-10-08T19:17:16Zen
dc.date.available2025-10-08T19:17:16Zen
dc.date.issued2025-01-30en
dc.description.abstractThe persistence of pharmaceuticals and personal care products (PPCPs) through wastewater treatment and resulting contamination of aquatic environments and drinking water is a pervasive concern, necessitating means of identifying effective treatment strategies for PPCP removal. In this study, we employed machine learning (ML) models to classify 149 PPCPs based on their chemical properties and predict their removal via wastewater and water reuse treatment trains. We evaluated two distinct clustering approaches: C1 (clustering based on the most efficient individual treatment process) and C2 (clustering based on the removal pattern of PPCPs across treatments). For this, we grouped PPCPs based on their relative abundances by comparing peak areas measured via non-target profiling using ultra-performance liquid chromatography-tandem mass spectrometry through two field-scale treatment trains. The resulting clusters were then classified using Abraham descriptors and log Kow as input to the three ML models: support vector machines (SVM), logistic regression, and random forest (RF). SVM achieved the highest accuracy, 79.1%, in predicting PPCP removal. Notably, a 58-75% overlap was observed between the ML clusters of PPCPs and the Abraham descriptor and log Kow clusters of PPCPs, indicating the potential of using Abraham descriptors and log Kow to predict the fate of PPCPs through various treatment trains. Given the myriad of PPCPs of concern, this approach can supplement information gathered from experimental testing to help optimize the design of wastewater and water reuse treatment trains for PPCP removal.en
dc.description.sponsorshipNational Science Foundation [2004751, 2125798]; U.S. National Science Foundation [75D30118C02904]; U.S. Centers for Disease Control and Prevention [R21AC10162-01]; US Bureau of Reclamation grant [R840619]; EPA; Hampton Roads Sanitation District; Pilot Program to Enhance NIH; COE at Virginia Techen
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.1039/d4ew00892hen
dc.identifier.eissn2053-1419en
dc.identifier.issn2053-1400en
dc.identifier.issue2en
dc.identifier.pmid39758590en
dc.identifier.urihttps://hdl.handle.net/10919/138103en
dc.identifier.volume11en
dc.language.isoenen
dc.publisherRoyal Society Chemistryen
dc.rightsCreative Commons Attribution 3.0 Unporteden
dc.rights.urihttps://creativecommons.org/licenses/by/3.0/en
dc.titleA machine learning framework to predict PPCP removal through various wastewater and water reuse treatment trainsen
dc.title.serialEnvironmental Science-Water Research & Technologyen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ChoiMachine.pdf
Size:
1.83 MB
Format:
Adobe Portable Document Format
Description:
Published version