...
首页> 外文期刊>Water Research >Using machine learning classification to detect simulated increases of de facto reuse and urban stormwater surges in surface water
【24h】

Using machine learning classification to detect simulated increases of de facto reuse and urban stormwater surges in surface water

机译:使用机器学习分类来检测地表水中事实上重用和城市雨水潮的模拟增加

获取原文
获取原文并翻译 | 示例
           

摘要

Water quality events such as increases in stormwater or wastewater effluent in drinking water sources pose hazards to drinking water consumers. Stormwater and wastewater effluent enter Lake Mead-an important drinking water source in the southwest USA-via the Las Vegas Wash. Previous studies have applied machine learning and online instruments to detect contamination in water distribution systems. However, alert systems at drinking water intakes would provide more time for corrective action. An array of online instruments measuring pH, conductivity, redox potential, turbidity, temperature, tryptophan-like fluorescence, UV absorbance (UVA(254)), TOC, and chlorophyll-a was fed raw water directly from Lake Mead. Wastewater effluent, dry weather Las Vegas Wash, and storm-impacted Las Vegas Wash samples were blended into the instrument inlets at known ratios to simulate three types of adverse water quality events. Data preprocessing was conducted to correct for diurnal patterns or instrument drift. Supervised machine learning was conducted using previously published models in R. Ninety-nine models were screened on the raw data. Eight high-performing models were evaluated in-depth and optimized. Weighted k-Nearest Neighbors, Single C5.0 Ruleset, Mixture Discriminant Analysis, and an ensemble of these three models had accuracy over 97% when assigning test set data among three classes (Normal, Event, or Maintenance). The ensemble detected all event types at the earliest timepoint and had one false positive that was not a lag error (i.e., consecutively following a true positive). Omitting Maintenance, the Adaboost model had over 99% test set accuracy and zero false positives that were not lag errors. Data preprocessing was beneficial, but the optimal methods were model-specific. All nine water quality variables were useful for most models, but UVA(254) and turbidity were most important.
机译:水质事件,如雨水或废水流出中的增加饮用水源在饮用水消费者造成危险。雨水和废水污水进入米德湖 - 美国西南部的重要饮用水来源 - 通过拉斯维加斯洗涤。以前的研究已经应用了机器学习和在线仪器,以检测水分配系统中的污染。但是,饮用水中的警报系统将为纠正措施提供更多时间。一系列在线仪器测量pH,电导率,氧化还原电位,浊度,温度,色氨酸样荧光,紫外光吸收(UVA(254)),TOC和叶绿素-A直接从乳房喂养水。废水污水,干燥天气Las Vegas洗涤,并将风暴冲击拉斯维加斯洗涤样品混合到已知比例的仪器入口中,以模拟三种类型的不利水质事件。进行数据预处理以校正昼夜图案或仪器漂移。通过先前发布的R.九十九种模型进行了监督机器学习在原始数据上筛选了九十九种模型。深入评估八种高性能模型并进行优化。加权K-Collect邻居,单C5.0规则集,混合判别分析,以及这三种模型的集合在分配三个类别(正常,事件或维护)之间的测试集数据时具有超过97%的准确性。该集合在最早的时间点处检测到所有事件类型,并且具有一个假阳性,这不是滞后错误(即,连续遵循真正的正面)。省略维护,Adaboost模型有超过99%的测试设置精度和零误报,而不是滞后错误。数据预处理是有益的,但最佳方法是特定于模型的。所有九个水质变量对于大多数型号都很有用,但UVA(254)和浊度最为重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号