首页> 外文期刊>Granular Computing >Three-way clustering method for incomplete information system based on set-pair analysis
【24h】

Three-way clustering method for incomplete information system based on set-pair analysis

机译:基于集对分析的不完全信息系统的三路聚类方法

获取原文
获取原文并翻译 | 示例

摘要

Traditional clustering algorithms clearly assign uncertain information into a single cluster, which does not fully indicate that a cluster may not have a clear boundary. For a large number of missing data, the traditional clustering method cannot achieve a good clustering effect on these datasets. Therefore, the idea of three-way decision is introduced into the traditional &-means clustering, as a result, the knowledge of set-pair information granule be combined. This paper presents a three-way clustering method which can process missing values effectively. First, for missing values, the granularity corresponding to missing values are recorded as the degree of difference. Next, the algorithm is going to establish the distance between the samples and the clustering centers according to the set-pair theory. All samples are assigned into clusters according to the size of the distance, and the clustering results with three-way are formed, which are positive region, boundary region and negative region, which improves the structure of clustering results. The samples of positive region certainly belong to this cluster; the samples of boundary region may belong to this cluster; the samples of negative region don't belong to this cluster; and the clustering results are represented by the three regions together. Finally, the validity of the algorithm is verified by UCI dataset great work.
机译:传统的聚类算法清楚地将不确定的信息分配给单个群集,这不完全指示群集可能没有明确的边界。对于大量缺少数据,传统的聚类方法无法在这些数据集上达到良好的聚类效果。因此,将三种决策的想法引入了传统和 - eANS聚类,结果,组合了集合信息颗粒的知识。本文介绍了一种三通聚类方法,可以有效地处理缺失值。首先,对于缺失值,对应于缺失值的粒度被记录为差异程度。接下来,根据设定对理论,该算法将建立样本和聚类中心之间的距离。所有样品根据距离的尺寸分配到簇中,并且形成具有三向的聚类结果,其是正区域,边界区域和负区域,这提高了聚类结果的结构。积极区域的样本肯定属于该簇;边界区域的样本可以属于该群集;负面区域的样本不属于此集群;并且聚类结果由三个区域组合在一起。最后,通过UCI DataSet良好的工作验证了算法的有效性。

著录项

  • 来源
    《Granular Computing》 |2021年第2期|389-398|共10页
  • 作者单位

    College of Science North China University of Technology Tangshan 063210 Hebei China Key Laboratory of Data Science and Application in Hebei Province Tangshan 063210 China;

    College of Science North China University of Technology Tangshan 063210 Hebei China;

    College of Science North China University of Technology Tangshan 063210 Hebei China;

    College of Science North China University of Technology Tangshan 063210 Hebei China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Set-pair analysis; Incomplete information - Three-way decision; K-means; Three-way clustering;

    机译:设定对分析;信息不完整 - 三方决策;K-means;三通聚类;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号