...
首页> 外文期刊>Water Research >Developing a framework for classifying water lead levels at private drinking water systems: A Bayesian Belief Network approach
【24h】

Developing a framework for classifying water lead levels at private drinking water systems: A Bayesian Belief Network approach

机译:制定私人饮用水系统中的水引线水平框架:贝叶斯信仰网络方法

获取原文
获取原文并翻译 | 示例

摘要

The presence of lead in drinking water creates a public health crisis, as lead causes neurological damage at low levels of exposure. The objective of this research is to explore modeling approaches to predict the risk of lead at private drinking water systems. This research uses Bayesian Network approaches to explore interactions among household characteristics, geological parameters, observations of tap water, and laboratory tests of water quality parameters. A knowledge discovery framework is developed by integrating methods for data discretization, feature selection, and Bayes classifiers. Forward selection and backward selection are explored for feature selection. Discretization approaches, including domain-knowledge, statistical, and information-based approaches, are tested to discretize continuous features. Bayes classifiers that are tested include General Bayesian Network, Naive Bayes, and Tree-Augmented Naive Bayes, which are applied to identify Directed Acyclic Graphs (DAGs). Bayesian inference is used to fit conditional probability tables for each DAG. The Bayesian framework is applied to fit models for a dataset collected by the Virginia Household Water Quality Program (VAHWQP), which collected water samples and conducted household surveys at 2,146 households that use private water systems, including wells and springs, in Virginia during 2012 and 2013. Relationships among laboratory-tested water quality parameters, observations of tap water, and household characteristics, including plumbing type, source water, household location, and on-site water treatment are explored to develop features for predicting water lead levels. Results demonstrate that Naive Bayes classifiers perform best based on recall and precision, when compared with other classifiers. Copper is the most significant predictor of lead, and other important predictors include county, pH, and on-site water treatment. Feature selection methods have a marginal effect on performance, and discretization methods can greatly affect model performance when paired with classifiers. Owners of private wells remain disadvantaged and may be at an elevated level of risk, because utilities and governing agencies are not responsible for ensuring that lead levels meet the Lead and Copper Rule for private wells. Insight gained from models can be used to identify water quality parameters, plumbing characteristics, and household variables that increase the likelihood of high water lead levels to inform decisions about lead testing and treatment. (C) 2020 Elsevier Ltd. All rights reserved.
机译:由于铅导致低水平的暴露,铅在饮用水中的存在造成公共卫生危机。本研究的目的是探索建模方法,以预测私人饮用水系统铅的风险。该研究使用贝叶斯网络方法来探索家庭特征,地质参数,自来水观测的相互作用以及水质参数的实验室测试。通过集成数据离散化,特征选择和贝叶斯分类器的方法来开发知识发现框架。探索了Forward选择和倒退选择以进行功能选择。测试包括领域知识,统计和基于信息的方法的离散化方法以使连续功能分开。经过测试的贝叶斯分类器包括普通贝叶斯网络,幼稚贝叶斯和树增强的天真贝叶斯,其应用于识别定向的非循环图(DAG)。贝叶斯推断用于适合每个DAG的条件概率表。贝叶斯框架适用于弗吉尼亚州家庭水质计划(VAHWQP)收集的数据集的模型,该模型收集了水样,并在2012年期间在弗吉尼亚州使用私人水系统,包括私人水系统的2,146户家庭进行家庭调查2013年,实验室测试水质参数,自来水观察和家庭特征的关系,包括管道型,源水,家庭位置和现场水处理,以开发预测水铅水平的特点。结果表明,与其他分类器相比,幼稚贝叶斯分类器基于召回和精度最佳。铜是铅的最重要的预测因子,其他重要的预测因子包括县,pH和现场水处理。特征选择方法对性能有边缘效果,并且在与分类器配对时,可以极大地影响模型性能。私人井的所有者仍处于不利地位,可能处于风险水平提高,因为公用事业和管理机构不对确保私营井的铅和铜规则负责。模型中获得的洞察力可用于识别水质参数,管道特征和家庭变量,从而提高高水位的可能性,以提供关于铅测试和治疗的决策。 (c)2020 elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号