...
首页> 外文期刊>Data & Knowledge Engineering >New fuzzy c-means clustering model based on the data weighted approach
【24h】

New fuzzy c-means clustering model based on the data weighted approach

机译:基于数据加权方法的新型模糊c均值聚类模型

获取原文
获取原文并翻译 | 示例

摘要

This paper proposes a new kind of data weighted fuzzy c-means clustering approach. Different from most existing fuzzy clustering approaches, the data weighted clustering approach considers the internal connectivity of all data points. An exponent impact factors vector and an influence exponent are introduced to the new model. Together they influence the clustering process. The data weighted clustering can simultaneously produce three categories of parameters: fuzzy membership degrees, exponent impact factors and the cluster prototypes. A new fuzzy algorithm, DWG-K, is developed by combining the data weighted approach and the G-K. Two groups of numerical experiments were executed. Group 1 demonstrates the clustering performance of the DWG-K. The counterpart is the G-K. The results show the DWG-K can obtain better clustering quality and meanwhile it holds the same level of computational efficiency as the G-K holds. Group 2 checks the ability of the DWG-K in mining the outliers. The counterpart is the well-known LOF. The results show the DWG-K has considerable advantage over the LOF in computational efficiency. And the outliers mined by the DWG-K are global. It was pointed out that the data weighted clustering approach has its unique advantages when mining the outliers of the large scale data sets, when clustering the data set for better clustering results, and especially when these two tasks are done simultaneously.
机译:本文提出了一种新的数据加权模糊c均值聚类方法。与大多数现有的模糊聚类方法不同,数据加权聚类方法考虑了所有数据点的内部连通性。将指数影响因子向量和影响指数引入新模型。它们一起影响聚类过程。数据加权聚类可以同时产生三类参数:模糊隶属度,指数影响因子和聚类原型。通过将数据加权方法与G-K相结合,开发了一种新的模糊算法DWG-K。进行了两组数值实验。第1组演示了DWG-K的群集性能。对应的是G-K。结果表明,DWG-K可以获得更好的聚类质量,同时具有与G-K相同的计算效率。第2组检查DWG-K挖掘异常值的能力。对应的是著名的LOF。结果表明,DWG-K在计算效率方面优于LOF。 DWG-K挖掘的异常值是全球性的。有人指出,数据加权聚类方法在挖掘大规模数据集的离群值,对数据集聚类以获得更好的聚类结果时,尤其是同时完成这两个任务时,具有其独特的优势。

著录项

  • 来源
    《Data & Knowledge Engineering 》 |2010年第9期| P.881-900| 共20页
  • 作者单位

    School of Mechanical and Dynamical Engineering of Shanghai Jiao Tong University, No.800 Dong Chuan Road, Minhang District, Shanghai 200240, PR China;

    rnSchool of Mechanical and Dynamical Engineering of Shanghai Jiao Tong University, No.800 Dong Chuan Road, Minhang District, Shanghai 200240, PR China;

    rnSchool of Mechanical and Dynamical Engineering of Shanghai Jiao Tong University, No.800 Dong Chuan Road, Minhang District, Shanghai 200240, PR China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    fuzzy clustering; data weighted approach; exponent impact factor; influence exponent; outliers mining;

    机译:模糊聚类数据加权法;指数影响因子;影响指数;异常值挖掘;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号