首页> 外文会议>2010 International Conference on Networking and Information Technology >A fuzzy threshold based unsupervised clustering algorithm for natural data exploration
【24h】

A fuzzy threshold based unsupervised clustering algorithm for natural data exploration

机译:基于模糊阈值的无监督自然数据挖掘聚类算法

获取原文

摘要

Traditional clustering methods require the user to determine the number of clusters before we start any data exploration. In fuzzy clustering methods the performance efficiency of the algorithm depends mainly on the initial selection of number of clusters and cluster seeds. The real world data is almost never arranged in clear cut group and the initial selection of cluster count and centroids becomes a tedious task. In this paper we propose a new unsupervised clustering algorithm which works on the principles of fuzzy clustering. The new method we propose is using a modified form of popular fuzzy c-means algorithm for membership calculation. The algorithm begins with two initial cluster centers and forms many clusters based on a threshold value. It uses the fuzzy membership value of a cluster centre in another existing cluster to merge the clusters and finally converges to the optimum number of clusters. The algorithm is tested with the data for Gross National Happiness (GNH) program of Bhutan and found to be highly efficient in segmenting natural data sets.
机译:传统的群集方法要求用户在开始任何数据探索之前确定群集的数量。在模糊聚类方法中,算法的性能效率主要取决于聚类数和聚类种子的初始选择。现实世界中的数据几乎永远不会以明确的形式排列,而簇数和质心的初始选择变得繁琐。在本文中,我们提出了一种新的基于模糊聚类原理的无监督聚类算法。我们提出的新方法是使用一种改进形式的流行模糊c均值算法进行隶属度计算。该算法从两个初始聚类中心开始,并基于阈值形成许多聚类。它使用另一个现有集群中集群中心的模糊隶属度值来合并集群,最后收敛到最优数量的集群。该算法经过不丹国民幸福指数(GNH)程序数据的测试,发现在分割自然数据集方面非常高效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号