...
首页> 外文期刊>Hydrological Processes >Similarity search and pattern discovery in hydrological time series data mining
【24h】

Similarity search and pattern discovery in hydrological time series data mining

机译:水文时间序列数据挖掘中的相似度搜索和模式发现

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The rapid development of data mining provides a new method for water resource management, hydrology and hydroinformatics research. In the paper, based on data mining theory and technology, we analyse hydrological daily discharge time series of the Shaligunlanke Station in the Tarim River Basin in China from the year 1961 to 2000. Firstly, according to the four monthly statistics, namely mean monthly discharge, monthly maximum discharge, monthly amplitude and monthly standard deviation. K-mean clustering was used to segment the annual process of the daily discharge. The clustering result showed that the annual process of the daily discharge can be divided into rive segments: snowmelt period I (April), snowmelt period II (May), rainfall period I (June-August), rainfall period II (September) and dry period (October-December and January-March). Secondly, dynamic time warping (DTW), which is a different distance metric method from the traditional Euclidian distance metric, was used to look for similarities in the discharge process. On the basis of the similarity matrix, the similar discharge processes can be mined in each period. Thirdly, agglomerative hierarchical clustering was used to cluster and discover the discharge patterns in terms of the autoregressive model. It was found that the discharge had a close relationship with the temperature and the precipitation, and the discharge processes were more similar under the same climatic condition. Our study shows that data mining is a feasible and efficient approach to discover the hidden information in the historical hydrological data and mining the implicative laws under the hydrological process.
机译:数据挖掘的快速发展为水资源管理,水文学和水信息学研究提供了一种新的方法。本文以数据挖掘理论和技术为基础,分析了塔里木河流域沙里贡兰克站1961 — 2000年的水文日排放时间序列。首先,根据四个月的统计数据,即月平均排放量,每月最大排放量,每月振幅和每月标准偏差。使用K均值聚类来划分每日排放的年度过程。聚类结果表明,日排放量的年过程可分为河段:融雪期I(4月),融雪期II(5月),降雨期I(6月至8月),降雨期II(9月)和干燥期间(10月-12月和1月-3月)。其次,动态时间规整(DTW)是一种与传统的欧几里德距离度量不同的距离度量方法,用于寻找放电过程中的相似之处。基于相似度矩阵,可以在每个周期中挖掘相似的放电过程。第三,利用聚类层次聚类对自回归模型进行聚类和发现放电模式。研究发现,在相同的气候条件下,放电与温度和降水有密切关系,并且放电过程更为相似。我们的研究表明,数据挖掘是一种在历史水文数据中发现隐藏信息并挖掘水文过程中隐含规律的可行而有效的方法。

著录项

  • 来源
    《Hydrological Processes》 |2010年第9期|p.1198-1210|共13页
  • 作者单位

    State Key Laboratory of Hydrology-Water Resources and Hydraulic Engineering, Hohai University, Nanjing 210098, P. R. China State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, P. R. China;

    State Key Laboratory of Hydrology-Water Resources and Hydraulic Engineering, Hohai University, Nanjing 210098, P. R. China;

    State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, P. R. China;

    State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, P. R. China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    data mining; hydrological time series; clustering; dynamic time warping; similarity search; pattern discovery;

    机译:数据挖掘;水文时间序列;集群动态时间扭曲;相似度搜索模式发现;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号