A comparative analysis of clustering algorithms to identify the homogeneous rainfall gauge stations of Bangladesh

Mohammad Samsul Alam; Sangita Paul

首页> 外文期刊>Journal of applied statistics >A comparative analysis of clustering algorithms to identify the homogeneous rainfall gauge stations of Bangladesh

【24h】

A comparative analysis of clustering algorithms to identify the homogeneous rainfall gauge stations of Bangladesh

机译：聚类算法识别孟加拉国均匀降雨量站的比较分析

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

ABSTRACT Dealing with individual rainfall station is time consuming as well as prone to more variation. It seems reasonable and advantageous to deal with a group of homogeneous stations rather than an individual station. Such groups can be identified using clustering algorithms, techniques used in the multivariate data analysis. Particularly, in this study, covering both hard and soft clustering approaches, three clustering algorithms namely Agglomerative hierarchical, K-means clustering and Fuzzy C-means methods are chosen due to their popularity. These algorithms are applied over precipitation data recorded by the Bangladesh Meteorology Department, and a comparison among the algorithms is made. Annual and seasonal precipitations from 1977 to 2012 recorded in 30 stations are used in this study. Optimal numbers of clusters in the four precipitation series are determined using the Gap statistic for K-means clustering and using the extended Gap statistic for Fuzzy C-means clustering, and are found as 3, 1, 3 and 2 for annual, pre-monsoon, monsoon and post-monsoon, respectively. This study investigates the clustering methods in terms of the similarity, members and homogeneity, among the clusters formed. The clusters are also characterized to see how they are distributed. Moreover, in terms of cluster homogeneity, Fuzzy C-means algorithm outperforms the other clustering methods.

机译：摘要处理单个降雨站是耗时的，也容易出现更多的变化。似乎合理且有利地处理一组同质的站而不是单独的站。可以使用聚类算法，多变量数据分析中使用的技术来识别这些组。特别是，在本研究中，涵盖了硬质和软聚类方法，三种聚类算法即附加分层，K-Means聚类和模糊C-Means方法是由于流行性而选择的。这些算法应用于孟加拉国气象部门记录的降水数据，并进行算法之间的比较。在本研究中使用了1977年至2012年从1977年到2012年的年度和季节性降沉淀。使用k-means聚类的间隙统计和使用模糊C-means聚类的扩展间隙统计数据确定四个降水系列中的最佳数量，并以每年，季前翁的3,1,3和2发现为3,1,3和2 ，季风和季风分别。本研究在形成的簇中调查了在相似性，成员和均匀性方面的聚类方法。群集也表征，看看它们是如何分布的。此外，就集群同质性而言，模糊C-均值算法优于其他聚类方法。

著录项

来源
《Journal of applied statistics》 |2020年第8期|1460-1481|共22页
作者
Mohammad Samsul Alam; Sangita Paul;
展开▼
作者单位

University of Dhaka;

University of Dhaka;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Gap statistic; hierarchical clustering; K-means; Fuzzy C-means; L-moments; rainfall;

机译：间隙统计;分层聚类;K-means;模糊C-means;L-MOCENTS;降雨;

相似文献

外文文献
中文文献
专利

1. Identifying the dependency pattern of daily rainfall of Dhaka station in Bangladesh using Markov chain and logistic regression model [J] . Mina Mahbub Hossain, Sayedul Anam Agricultural Sciences . 2012,第3期

机译：利用马尔可夫链和Logistic回归模型识别孟加拉国达卡站日降水量的依赖模式。
2. Comparative analysis of two rainfall retrieval algorithms during extreme rainfall event: a case study on cloudburst, 2010 over Ladakh (Leh), Jammu and Kashmir [J] . Banerjee Arkadeb, Dimri A. P. Natural Hazards . 2019,第3期

机译：极端降雨期间两次降雨检索算法的比较分析 - 以拉达克（LEH），Jammu和Kashmir的Cloudburst案例研究
3. Identifying homogeneous rainfall catchments for non- stationary time series using tops is algorithm and bootstrap k-sample Anderson darling test [J] . Zun Liang Chuan, Noriszura Ismail, Wan Nur Syahidah Wan Yusoff, International Journal of Engineering & Technology . 2018,第4期

机译：使用tops算法和Bootstrap k样本Anderson darling检验识别非平稳时间序列的均质降雨集水区
4. Comparison between Satellite Rainfall Data and Rain Gauge Stations in Galal-Badra Watershed, Iraq [C] . Mohammed Shamkhi, Atyaf Jawad, Tabark Jameel International Conference on Developments in eSystems Engineering . 2019

机译：伊拉克Galal-Badra流域的卫星降雨数据与雨量计之间的比较
5. DSD variations on rain-rate estimate algorithms of X-band polarimetric radar and rainfall characterization in tropical environments using 2DVD, rain gauges and TRMM data. [D] . Baquero Fuentes, Margarita. 2006

机译：使用2DVD，雨量计和TRMM数据，X波段极化雷达的雨率估计算法的DSD变化和热带环境中的降雨特征。
6. A comparative analysis of biclustering algorithms for gene expression data [O] . Kemal Eren, *, Mehmet Deveci, -1

机译：基因表达数据的双聚类算法的比较分析
7. Identifying the dependency pattern of daily rainfall of Dhaka station in Bangladesh using Markov chain and logistic regression model [O] . Sayedul Anam, Mina Mahbub Hossain 2012

机译：用马尔可夫链和Logistic回归模型识别孟加拉国达卡站日降水量的依赖性。

A comparative analysis of clustering algorithms to identify the homogeneous rainfall gauge stations of Bangladesh

摘要

著录项

相似文献

相关主题

期刊订阅