首页> 外文期刊>Journal of applied statistics >A comparative analysis of clustering algorithms to identify the homogeneous rainfall gauge stations of Bangladesh
【24h】

A comparative analysis of clustering algorithms to identify the homogeneous rainfall gauge stations of Bangladesh

机译:聚类算法识别孟加拉国均匀降雨量站的比较分析

获取原文
获取原文并翻译 | 示例
           

摘要

ABSTRACT Dealing with individual rainfall station is time consuming as well as prone to more variation. It seems reasonable and advantageous to deal with a group of homogeneous stations rather than an individual station. Such groups can be identified using clustering algorithms, techniques used in the multivariate data analysis. Particularly, in this study, covering both hard and soft clustering approaches, three clustering algorithms namely Agglomerative hierarchical, K-means clustering and Fuzzy C-means methods are chosen due to their popularity. These algorithms are applied over precipitation data recorded by the Bangladesh Meteorology Department, and a comparison among the algorithms is made. Annual and seasonal precipitations from 1977 to 2012 recorded in 30 stations are used in this study. Optimal numbers of clusters in the four precipitation series are determined using the Gap statistic for K-means clustering and using the extended Gap statistic for Fuzzy C-means clustering, and are found as 3, 1, 3 and 2 for annual, pre-monsoon, monsoon and post-monsoon, respectively. This study investigates the clustering methods in terms of the similarity, members and homogeneity, among the clusters formed. The clusters are also characterized to see how they are distributed. Moreover, in terms of cluster homogeneity, Fuzzy C-means algorithm outperforms the other clustering methods.
机译:摘要处理单个降雨站是耗时的,也容易出现更多的变化。似乎合理且有利地处理一组同质的站而不是单独的站。可以使用聚类算法,多变量数据分析中使用的技术来识别这些组。特别是,在本研究中,涵盖了硬质和软聚类方法,三种聚类算法即附加分层,K-Means聚类和模糊C-Means方法是由于流行性而选择的。这些算法应用于孟加拉国气象部门记录的降水数据,并进行算法之间的比较。在本研究中使用了1977年至2012年从1977年到2012年的年度和季节性降沉淀。使用k-means聚类的间隙统计和使用模糊C-means聚类的扩展间隙统计数据确定四个降水系列中的最佳数量,并以每年,季前翁的3,1,3和2发现为3,1,3和2 ,季风和季风分别。本研究在形成的簇中调查了在相似性,成员和均匀性方面的聚类方法。群集也表征,看看它们是如何分布的。此外,就集群同质性而言,模糊C-均值算法优于其他聚类方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号