首页> 外文会议>International Conference on Networked Computing and Advanced Information Management >Data Mining on Forecast Raw Water Quality from Online Monitoring Station Based on Decision-making Tree
【24h】

Data Mining on Forecast Raw Water Quality from Online Monitoring Station Based on Decision-making Tree

机译:基于决策树的在线监测站预测原始水质的数据挖掘

获取原文

摘要

The excessive propagation of algae caused by eutrophication of aquatic environment in the urban source water supply is the main issues of concern to drinking water purification industry in recent years. The prediction of algae in raw water can offer time guarantee for operation of contingency caused by excessive propagation of algae which can ensure the safety of water supply. In the study, we collected 115 daily measured data about indirect monitoring of raw water quality of algae and solar irradiance data from online monitoring and direct-line artificial monitoring of chlorophyll content of raw water. We select decision-making tree which is very visible and easy realized as data mining tools, and set up decision-making tree model which is used to predict the level of chlorophyll in raw water in next day. To enable online monitoring data and artificial monitoring data with the same dimension, combined with the algal growth dynamics, we transform several on-line monitoring data of dissolved oxygen and solar irradiance data in one day into one data per day, that is mean calculating the average standard deviation and average. The former 100 sets of data are used to train and set up decision-making tree model which is to predict the level of chlorophyll in next day. The rest 15 sets of data are used to test data. The results of simulation show that the prediction accuracy can reach 80%.
机译:城市资源供水中水生环境富营养化引起的藻类过度繁殖是近年来饮用水净化行业关注的主要问题。原水中藻类的预测可以为藻类过度繁殖引起的应急性运行提供时间保证,这可以保证供水的安全性。在研究中,我们收集了115个关于间接监测藻类和太阳辐照数据的间接监测的每日测量数据,从在线监测和直线人工监测原水叶绿素含量的直接线。我们选择决策树,这是非常可见的树,其作为数据挖掘工具非常明显,并且容易实现为数据挖掘工具,并建立决策制作树模型,用于预测第二天生水中叶绿素水平。为了使在线监测数据和具有相同维度的人工监测数据,与藻类生长动态相结合,我们将溶解氧和太阳辐照度数据的几个在线监测数据转换为每天一个数据,这是平均计算的平均标准偏差和平均值。前者100套数据用于培训和建立决策树模型,该树模型是预测第二天叶绿素的水平。其余15组数据用于测试数据。仿真结果表明,预测精度可达到80%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号