首页> 外文期刊>Water Environment Research >Comparison of the performance of decision tree (DT) algorithms and extreme learning machine (ELM) model in the prediction of water quality of the Upper Green River watershed
【24h】

Comparison of the performance of decision tree (DT) algorithms and extreme learning machine (ELM) model in the prediction of water quality of the Upper Green River watershed

机译:决策树(DT)算法和极端学习机(ELM)模型在上绿河流域水质预测中的性能比较

获取原文
获取原文并翻译 | 示例
           

摘要

Stream waters play a crucial role in catering to the world's needs with the required quality of water. Due to the discharges of wastewater from the various point and nonpoint sources, most of the watersheds are contaminated easily. The Upper Green River watershed in Kentucky, USA, is one such watershed that is contaminated over the years due to the runoff from rural areas and agricultural lands and combined sewer overflows (CSOs) from urban areas. Monitoring and characterizing the water quality status of streams in such watersheds has become of great importance, with multivariate statistical techniques such as regression, factor analysis, cluster analysis, and artificial intelligence methods such as artificial neural networks (ANNs). The water quality parameters, namely, fecal coliform (FC), turbidity, pH, and conductivity have been predicted quantitatively using ANNs to understand the water quality status of streams in the Upper Green River watershed elsewhere. In this study, a novel attempt has been made to predict the status of the quality of the Green River water with the predictive capabilities of a few decision tree (DT) algorithms such as classification and regression tree (CART) model, multivariate adaptive regression splines (MARS) model, random forest (RF) model, and extreme learning machine (ELM) model. The RF model's performance is better in predicting FC, turbidity, and pH than CART models in training and testing phases. Relatively, MARS and ELM models did better in testing though the performance is poorer in training. For example, we obtain the RMSE values of 2206, 2532, 1533, and 1969 using RF, CART, MARS, and ELM for FC in testing. A good correlation has been observed between conductivity and temperature, precipitation, and land-use factors for the MARS model. Overall, DT models are helpful in understanding, interpreting the outcomes, and visualizing the results compared with the other models. Practitioner points The prediction of stream water quality parameters using decision trees is explored. The climate and land use parameters are used as input parameters to the modeling. The DT models of CART, MARS, RF, and ANNs such as ELM are explored to predict stream water quality. The RF model shows stable results compared with CART, MARS, and ELM for the data explored. Apart from the R-2 value, RMSE and MAE indicate the effectiveness of DTs in prediction.
机译:流水域在满足世界的需求方面发挥着至关重要的作用。由于来自各种点和非点源的废水排放,大部分流域都容易被污染。美国肯塔基州的上绿河流域是多年来,由于农村地区和农业土地的径流量,从城市地区溢出(CSOS)溢出(CSOS)的径流,这是一个这样的流域。监测和表征这种流域的流的水质状况具有重要意义,具有多元统计技术,如回归,因子分析,集群分析和人工智能方法,如人工神经网络(ANNS)。使用ANN定量使用ANN来预测水质参数,即粪大肠菌菌(Fc),浊度,pH和电导率,以了解其他地方上绿河流域中的溪流的水质状态。在这项研究中,已经进行了一种新的尝试,以预测绿河水质量的状态与一些决定树(DT)算法的预测能力,例如分类和回归树(推车)模型,多变量自适应回归样条(火星)模型,随机林(RF)模型和极限机械(榆树)模型。 RF模型的性能更好地预测FC,浊度和pH,而不是训练和测试阶段的推车模型。相对而言,火星和榆树模型在测试方面做得更好,但培训表现较差。例如,我们使用RF,推车,火星和ELM获得2206,2532,1533和1969的RMS值,用于测试FC。在火星模型的电导率和温度,降水量和土地利用因子之间观察到良好的相关性。总体而言,与其他模型相比,DT模型有助于理解,解释结果,并将结果可视化。从业者指出使用决策树的流水质参数的预测。气候和土地使用参数用作建模的输入参数。探讨了推车,火星,RF和ANN的DT模型,以预测流水质量。 RF模型显示稳定的结果与推车,火星和ELM进行探索的数据相比。除了R-2值外,RMSE和MAE表示DTS在预测中的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号