首页> 外文期刊>Environmental Monitoring and Assessment >How do data-mining models consider arsenic contamination in sediments and variables importance?
【24h】

How do data-mining models consider arsenic contamination in sediments and variables importance?

机译:数据挖掘模型如何考虑沉积物中的砷污染和变量重要性?

获取原文
获取原文并翻译 | 示例
       

摘要

Arsenic (As) is one of the most important dangerous elements as more than 100 million of people are exposed to risk, globally. The permissible threshold of As for drinking water is 10 mu g/L according to both the WHO's drinking water guidelines and the Iranian national standard. However, several studies have indicated that As concentrations exceed this threshold value in several regions of Iran. This research evaluates an As-susceptible region, the Tajan River watershed, using the following data-mining models: multivariate adaptive regression splines (MARS), functional data analysis (FDA), support vector machine (SVM), generalized linear model (GLM), multivariate discriminant analysis (MDA), and gradient boosting machine (GBM). This study considers 12 factors for elevated As concentrations: land use, drainage density, profile curvature, plan curvature, slope length, slope degree, topographic wetness index, erosion, village density, distance from villages, precipitation, and lithology. The susceptibility mapping was conducted using training (70%) and validation (30%). The results of As contamination in sediment showed that classifications into 4 levels of concentration are very similar for two models of GLM and FDA. The GBM calculated the areas of highest arsenic contamination risk by MARS and SVM with percentages of 30.0% and 28.7%, respectively. FDA, GLM, MARS, and MDA models calculated the areas of lowest risk to be 3.3%, 23.0%, 72.0%, 25.2%, and 26.1%, respectively. The results of ROC curve reveal that the MARS, SVM, and MDA had the highest accuracies with area under the curve ROC values of 84.6%, 78.9%, and 79.5%, respectively. Land use, lithology, erosion, and elevation were the most important predictors of contamination potential with a value of 0.6, 0.59, 0.57, and 0.56, respectively. These are the most important factors. Finally, these data-mining methods can be used as appropriate, inexpensive, and feasible options to identify As-susceptible areas and can guide managers to reduce contamination in sediment of the environment and the food chain.
机译:砷(As)是最重要的危险元素之一,全球有1亿多人面临这种​​风险。根据世界卫生组织的饮用水准则和伊朗国家标准,饮用水中砷的允许阈值为10μg / L。但是,一些研究表明,伊朗几个地区的砷浓度超过了该阈值。这项研究使用以下数据挖掘模型评估了一个易感区域Tajan河的分水岭:多元自适应回归样条(MARS),功能数据分析(FDA),支持向量机(SVM),广义线性模型(GLM) ,多元判别分析(MDA)和梯度增强机(GBM)。这项研究考虑了提高砷浓度的12个因素:土地利用,排水密度,剖面曲率,平面曲率,边坡长度,边坡度,地形湿度指数,侵蚀,村庄密度,与村庄的距离,降水和岩性。使用培训(70%)和验证(30%)进行了敏感性图。沉积物中砷污染的结果表明,对于两种模型的GLM和FDA,对4种浓度水平的分类非常相似。 GBM通过MARS和SVM计算出最高的砷污染风险区域,分别为30.0%和28.7%。 FDA,GLM,MARS和MDA模型计算出的最低风险区域分别为3.3%,23.0%,72.0%,25.2%和26.1%。 ROC曲线的结果表明,MARS,SVM和MDA的精度最高,曲线下ROC值分别为84.6%,78.9%和79.5%。土地利用,岩性,侵蚀和海拔高度是污染潜力的最重要预测指标,分别为0.6、0.59、0.57和0.56。这些是最重要的因素。最后,这些数据挖掘方法可以用作识别易感区域的适当,廉价且可行的选择,并可以指导管理人员减少环境和食物链沉积物中的污染。

著录项

  • 来源
    《Environmental Monitoring and Assessment》 |2019年第12期|777.1-777.19|共19页
  • 作者单位

    Tarbiat Modares Univ Fac Nat Resources Dept Watershed Management & Engn Tehran Iran;

    Tarbiat Modares Univ Fac Nat Resources Dept Watershed Management & Engn Tehran Iran|Lund Univ Ctr Middle Eastern Studies Lund Sweden|Lund Univ Dept Water Resources Engn Lund Sweden;

    Shiraz Univ Coll Agr Dept Nat Resources & Environm Engn Shiraz Iran;

    KTH Royal Inst Technol KTH Int Groundwater Arsen Res Grp Dept Sustainable Dev Environm Sci & Engn Teknikringen 10B SE-10044 Stockholm Sweden|Univ Southern Queensland Sch Civil Engn & Surveying West St Toowoomba Qld 4350 Australia|Univ Southern Queensland Int Ctr Appl Climate Sci West St Toowoomba Qld 4350 Australia;

    Texas State Univ Dept Geog San Marcos TX 78666 USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);美国《化学文摘》(CA);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Arsenic; Data-mining; GIS-based mapping; LVQ; Human health; Iran;

    机译:砷;数据挖掘;基于GIS的制图;LVQ;人类健康;伊朗;
  • 入库时间 2022-08-18 05:04:30

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号