首页> 外文会议>Industrial conference on data mining >Induction of Model Trees for Predicting BOD in River Water: A Data Mining Perspective
【24h】

Induction of Model Trees for Predicting BOD in River Water: A Data Mining Perspective

机译:数据挖掘视角下的河水生化需氧量预测模型树的归纳

获取原文

摘要

Water is a primary natural resource and its quality is negatively affected by various anthropogenic activities. Deterioration of water bodies has triggered serious management efforts by many countries. BOD is an important water quality parameter as it measures the amount of biodegradable organic matter in water. Testing for BOD is a time-consuming task as it takes 5 days from data collection to analyzing with lengthy incubation of samples. Also, interpolations of BOD results and their implications are mired in uncertainties. So, there is a need for suitable secondary (indirect) method for predicting BOD. A model tree for predicting BOD in river water from a data mining perspective is proposed in this paper. The proposed model is also compared with two other tree based predictive methods namely decision stump and regression trees. The predictive accuracy of the models is evaluated using two metrics namely correlation coefficient and RMSE. Results show that the model tree has a correlation coefficient of 0.9397 which is higher than the other two methods. It also has the least RMSE of 0.5339 among these models.
机译:水是一种主要的自然资源,其水质受到各种人为活动的负面影响。水体的恶化引发了许多国家的认真管理努力。 BOD是重要的水质参数,因为它可以测量水中可生物降解的有机物的量。对BOD的测试是一项耗时的工作,因为从数据收集到长时间孵育样品需要5天。此外,BOD结果的插值及其含义也陷入不确定性中。因此,需要合适的辅助(间接)方法来预测BOD。提出了一种从数据挖掘的角度预测河水中生化需氧量的模型树。还将所提出的模型与其他两种基于树的预测方法(决策树和回归树)进行比较。使用两个指标(相关系数和RMSE)评估模型的预测准确性。结果表明,模型树的相关系数为0.9397,高于其他两种方法。在这些型号中,它的RMSE最小,为0.5339。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号