【24h】

DifFUZZY: a fuzzy clustering algorithm for complex datasets

机译:DifFUZZY:复杂数据集的模糊聚类算法

获取原文
获取原文并翻译 | 示例
       

摘要

Soft (fuzzy) clustering techniques are often used in the study of high-dimensional datasets, such as microarray and other high-throughput bioinformatics data. The most widely used method is the fuzzy C-means (FCM) algorithm , but it can present difficulties when dealing with some datasets. A fuzzy clustering algorithm, DifFUZZY, which utilises concepts from diffusion processes in graphs and is applicable to a larger class of clustering problems than other fuzzy clustering algorithms is developed. Examples of datasets (synthetic and real) for which this method outperforms other frequently used algorithms are presented, including two benchmark biological datasets, a genetic expression dataset and a dataset that contains taxonomic measurements. This method is better than traditional fuzzy clustering algorithms at handling datasets that are 'curved', elongated or those which contain clusters of different dispersion. The algorithm has been implemented in Matlab and C++ and is available at http://www.maths.ox.ac.uk/cmb/difFUZZY.
机译:软(模糊)聚类技术通常用于研究高维数据集,例如微阵列和其他高通量生物信息学数据。使用最广泛的方法是模糊C均值(FCM)算法,但在处理某些数据集时可能会遇到困难。开发了一种模糊聚类算法DifFUZZY,该算法利用了图形中扩散过程的概念,并且比其他模糊聚类算法更适用于更大类别的聚类问题。给出了此方法优于其他常用算法的数据集(合成和真实)示例,包括两个基准生物学数据集,一个基因表达数据集和一个包含分类学测量值的数据集。这种方法比传统的模糊聚类算法在处理“弯曲的”,拉长的或包含不同色散聚类的数据集方面更好。该算法已在Matlab和C ++中实现,可从http://www.maths.ox.ac.uk/cmb/difFUZZY获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号