首页> 外文期刊>Knowledge-Based Systems >Fuzzy C-Means clustering of incomplete data based on probabilistic information granules of missing values
【24h】

Fuzzy C-Means clustering of incomplete data based on probabilistic information granules of missing values

机译:基于缺失值概率信息颗粒的不完全数据的模糊C-均值聚类

获取原文
获取原文并翻译 | 示例
       

摘要

Missing values are a common phenomenon when dealing with real-world data sets. Analysis of incomplete data sets has become an active area of research. In this paper, we focus on the problem of clustering incomplete data, which is intended to introduce some prior distribution information of the missing values into the algorithm of fuzzy clustering. First, non-parametric hypothesis testing is employed to describe the missing values adhering to a certain Gaussian distribution as probabilistic information granules based on the nearest neighbors of incomplete data. Second, we propose a novel clustering model, in which probabilistic information granules of missing values are incorporated into the Fuzzy C-Means clustering of incomplete data by involving the maximum likelihood criterion. Third, the clustering model is optimized by using a tri-level alternating optimization utilizing the method of Lagrange multipliers. The convergence and the time complexity of the clustering algorithm are also discussed. The experiments reported both on synthetic and real-world data sets demonstrate that the proposed approach can effectively realize clustering of incomplete data. (C) 2016 Elsevier B.V. All rights reserved.
机译:在处理实际数据集时,缺少值是一种常见现象。不完整数据集的分析已成为研究的活跃领域。在本文中,我们关注于对不完整数据进行聚类的问题,旨在将缺失值的一些先验分布信息引入模糊聚类算法中。首先,基于非完整数据的最近邻,采用非参数假设检验将遵循某种高斯分布的缺失值描述为概率信息颗粒。其次,我们提出了一种新颖的聚类模型,其中通过涉及最大似然准则,将缺失值的概率信息颗粒合并到不完整数据的模糊C均值聚类中。第三,通过使用拉格朗日乘数法的三级交替优化对聚类模型进行优化。还讨论了聚类算法的收敛性和时间复杂度。在合成和真实数据集上进行的实验报告表明,该方法可以有效地实现不完整数据的聚类。 (C)2016 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Knowledge-Based Systems》 |2016年第may1期|51-70|共20页
  • 作者单位

    Dalian Univ Technol, Sch Control Sci & Engn, Dalian 116024, Peoples R China|Dalian Univ Technol, Dept Engn Mech, Dalian 116024, Peoples R China;

    Dalian Univ Technol, Sch Control Sci & Engn, Dalian 116024, Peoples R China;

    Dalian Univ Technol, Sch Control Sci & Engn, Dalian 116024, Peoples R China;

    Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6R 2V4, Canada|King Abdulaziz Univ, Fac Engn, Dept Elect & Comp Engn, Jeddah 21413, Saudi Arabia|Polish Acad Sci, Syst Res Inst, PL-01447 Warsaw, Poland;

    Dalian Univ Technol, Sch Control Sci & Engn, Dalian 116024, Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Fuzzy clustering; Incomplete data; Missing value; Probabilistic information granules; Alternating optimization;

    机译:模糊聚类;数据不完整;缺失值;概率信息粒度;交替优化;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号