首页> 外文期刊>Journal of hydro-environment research >Artificial Neural Network ensemble modeling with conjunctive data clustering for water quality prediction in rivers
【24h】

Artificial Neural Network ensemble modeling with conjunctive data clustering for water quality prediction in rivers

机译:联合数据聚类的人工神经网络集成模型在河流水质预测中的应用。

获取原文
获取原文并翻译 | 示例
           

摘要

The Artificial Neural Network (ANN) is a powerful data-driven model that can capture and represent both linear and non-linear relationships between input and output data. Hence, ANNs have been widely used for the prediction and forecasting of water quality variables, to treat the uncertainty of contaminant source, and nonlinearity of water quality data. However, the initial weight parameter problem and imbalanced training data set make it difficult to assess the optimality of the results obtained, and impede the performance of ANN modeling. This study attempted to employ the ensemble modeling technique to estimate the performance of the ANN without the influence of initial weight parameters on the model results, and to apply several clustering methods, to alleviate the imbalance of the training data set. An ANN ensemble model was developed, and applied to forecast the water quality variables, pH, DO, turbidity (Turb), TN, and TP, at Sangdong station, on the Nakdong River. The optimal ANN models for each water quality variable could be selected from the ensemble modeling. The optimal ANN models for pH, DO, TN, and TP, of which the training target data set was distributed evenly, showed good results, with R squared higher than 0.90. But the ANN model for Turb, of which the training data set was imbalanced, showed large RMSE (11.8 NTU), and low R squared (0.58). The training data set of Turb was partitioned into several classes, by conjunctive clustering methods according to the patterns of data set for each number of clusters. The ANN ensemble models for Turb with the clustered training data set (clustered ANN models) were then developed. All clustered ANN models for Turb showed better results, than the model without clustering. In particular, the three-clustered ANN model showed an increase of R squared from 0.58 to 0.88, and a decrease of total RMSE from 11.8 NTU to 6.3 NTU. (C) 2015 International Association for Hydro-environment Engineering and Research, Asia Pacific Division. Published by Elsevier B.V. All rights reserved.
机译:人工神经网络(ANN)是功能强大的数据驱动模型,可以捕获并表示输入和输出数据之间的线性和非线性关系。因此,人工神经网络已被广泛用于水质变量的预测和预测,以处理污染物来源的不确定性和水质数据的非线性。然而,初始权重参数问题和不平衡的训练数据集使得难以评估所获得结果的最优性,并阻碍了ANN建模的性能。这项研究试图采用集成建模技术来估计ANN的性能,而不会影响初始权重参数对模型结果的影响,并尝试采用多种聚类方法来减轻训练数据集的不平衡性。建立了ANN集成模型,并将其用于预测那东河桑东站的水质变量,pH,DO,浊度(Turb),TN和TP。每个水质变量的最佳人工神经网络模型可以从整体模型中选择。 pH,DO,TN和TP的最佳ANN模型(训练目标数据集均匀分布)显示出良好的结果,R平方高于0.90。但是训练数据集不平衡的Turb的ANN模型显示,RMSE大(11.8 NTU),R平方低(0.58)。根据每个群集数的数据集模式,通过联合聚类方法将Turb的训练数据集划分为几个类别。然后使用聚类的训练数据集(集群的ANN模型)开发了Turb的ANN集成模型。与未聚类的模型相比,Turb的所有聚类ANN模型都显示出更好的结果。特别是,三簇ANN模型显示R平方从0.58增加到0.88,总RMSE从11.8 NTU减少到6.3 NTU。 (C)2015年国际水环境工程与研究协会亚太分会。由Elsevier B.V.发布。保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号