首页> 外文期刊>Neurocomputing >An integrated K-means - Laplacian cluster ensemble approach for document datasets
【24h】

An integrated K-means - Laplacian cluster ensemble approach for document datasets

机译:用于文档数据集的集成K均值-拉普拉斯聚类集成方法

获取原文
获取原文并翻译 | 示例

摘要

Cluster ensemble has become an important extension to traditional clustering algorithms, yet the cluster ensemble problem is very challenging due to the inherent difficulty in resolving the label correspondence problem. We adapted the integrated K-means - Laplacian clustering approach to solve the cluster ensemble problem by exploiting both the attribute information embedded in the cluster labels and the pairwise relations among the objects. The optimal solution of the proposed approach requires computing the pseudo inverse of the normalized Laplacian matrix and the eigenvalue decomposition of a large matrix, which can be computationally burdensome for large scale document datasets. We devised an effective algebraic transformation method for efficiently carrying out the aforementioned computations and proposed an integrated K-means - Laplacian cluster ensemble approach (IKLCEA). Experimental results with benchmark document datasets demonstrate that IKLCEA outperforms other cluster ensemble techniques on most cases. In addition, IKLCEA is computationally efficient and can be readily employed in large scale document applications. (C) 2016 Elsevier B.V. All rights reserved.
机译:聚类集成已经成为传统聚类算法的重要扩展,但是聚类集成问题由于解决标签对应问题的固有困难而非常具有挑战性。我们采用了集成的K均值-Laplacian聚类方法,通过利用嵌入在聚类标签中的属性信息以及对象之间的成对关系来解决聚类集成问题。所提出方法的最佳解决方案需要计算归一化的拉普拉斯矩阵的伪逆和大矩阵的特征值分解,这对于大规模文档数据集可能是繁重的计算工作。我们设计了一种有效进行上述计算的有效代数变换方法,并提出了一种集成的K均值-拉普拉斯聚类集成方法(IKLCEA)。基准文档数据集的实验结果表明,在大多数情况下,IKLCEA优于其他集群集成技术。此外,IKLCEA具有高效的计算能力,可轻松用于大规模文档应用程序。 (C)2016 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Neurocomputing》 |2016年第19期|495-507|共13页
  • 作者单位

    Yancheng Inst Technol, Sch Informat Engn, Yancheng, Peoples R China|Univ Iowa, Dept Stat & Actuarial Sci, Iowa City, IA 52242 USA;

    Univ Iowa, Dept Stat & Actuarial Sci, Iowa City, IA 52242 USA;

    Yancheng Inst Technol, Sch Informat Engn, Yancheng, Peoples R China;

    Yancheng Inst Technol, Sch Informat Engn, Yancheng, Peoples R China;

    Yancheng Inst Technol, Sch Informat Engn, Yancheng, Peoples R China;

    Yancheng Inst Technol, Sch Informat Engn, Yancheng, Peoples R China;

    Yancheng Inst Technol, Sch Informat Engn, Yancheng, Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Cluster analysis; Cluster ensemble; K-means; Laplacian;

    机译:聚类分析;聚类集成;K-均值;拉普拉斯算子;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号