首页> 外文会议>International Conference of ICT for Adapting Agriculture to Climate Change >Knowledge Inference from a Small Water Quality Dataset with Multivariate Statistics and Data-Mining
【24h】

Knowledge Inference from a Small Water Quality Dataset with Multivariate Statistics and Data-Mining

机译:来自具有多元统计和数据采矿的小水质数据集的知识推断

获取原文

摘要

Multivariate analysis (MV) and data mining (DM) techniques were applied to a small water quality dataset obtained from the surface waters at three water quality monitoring stations in the Petaquilla River Basin, Panama, during the hydrological period of 2008 through 2011 for the assessment and understanding of the ongoing environmental stress within the river basin. From Factor Analysis (PCA/FA), results indicated that the factors which changed the quality of the water for the two seasons differed. During the dry (low flows) season, water quality showed to be strongly influenced by turbidity (NTU) and total suspended solids (TSS) concentrations. In contrast, during the wet (high flows) season the main changes on water quality sources were characterized by an inverse relation of NTU and TSS with the electrical conductivity (EC) and chlorides (CL), followed by significant sources of agricultural pollution. To complement the MV analysis, DM techniques like cluster analysis (CA) and classification (CLA) was applied to the data. Cluster analysis was used to separate the stations based on their levels of pollution and the classification of stations was implemented by C5.0 algorithm to classify stations of unknown origin into one of the several known groups of water quality constituents. The study demonstrated that the major water pollution threats to the Petaquilla River Basin are industrial and urban development in character and uses of agricultural and grazing land which are defined as non-point sources. The use of DM techniques was to complement the MV analysis. Taking into account the limited data, the usage of these methodologies is regarded useful in aiding water managers for implementing water monitoring campaigns and in setting priorities for improving and protecting water quality sources that are impaired due to land disturbances from anthropogenic activities.
机译:多变量分析(MV)和数据挖掘(DM)技术是通过2011施加到在三个水质监测站在Petaquilla流域,巴拿马,在2008年水文期间从表层水而获得的小水质数据集的评估和理解流域内正在发生的环境压力。因素分析(PCA / FA),结果表明,其改变了水的质量为两个季节的因素不同。在干(低流量)的季节,水质显示由浊度(NTU)的强烈影响和总悬浮固体(TSS)的浓度。与此相反,湿(高流量)期间季节对水质源的主要变化是由NTU和TSS与电导率(EC)和氯化物(CL)的反比关系,其特征在于,随后农业污染的显著源。为了补充MV分析,DM技术,例如聚类分析(CA)和分类(CLA)被施加到数据。用于分离基于其污染程度和地点分类由C5.0算法来实现对来历不明的分类站到水质成分的几个已知其中一个组的站聚类分析。这项研究表明,到Petaquilla流域重大水污染的威胁在性质被定义为非点源的农田和牧场的土地用途的工业和城市的发展。采用DM技术是为了配合MV分析。考虑到数据有限,这些方法的使用被认为是在帮助水资源管理者实施监测水中活动,并为改善和保护与人为活动导致受损土地扰动水质源确定优先次序非常有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号