首页> 外文会议>ACM SIGKDD International Conference on Knowledge Discovery and Data Mining >Integration of Semantic-based Bipartite Graph Representation and Mutual Refinement Strategy for Biomedical Literature Clustering
【24h】

Integration of Semantic-based Bipartite Graph Representation and Mutual Refinement Strategy for Biomedical Literature Clustering

机译:基于语义的二分拉图表示与生物医学文献聚类的互述策略集成

获取原文

摘要

We introduce a novel document clustering approach that overcomes those problems by combining a semantic-based bipartite graph representation and a mutual refinement strategy. The primary contributions of this paper are the following. First, we introduce a new representation of documents using a bipartite graph between documents and co-occurrence concepts in the documents. Second, we show how to enhance clustering quality by applying the mutual refinement strategy to the initial clustering results. Third, through the experiments on MEDLINE documents, we show that our integrated method significantly enhances cluster quality and clustering reliability compared to existing clustering methods. Our approach improves on the average 29.5% cluster quality and 26.3% clustering reliability, in terms of misclassification index, over Bisecting K-means with the best parameters.
机译:我们介绍了一种新的文档聚类方法,通过组合基于语义的二分图表示和相互细化策略来克服这些问题。本文的主要贡献如下。首先,我们使用文档中的文档和共同发生概念之间的二分图引入了新的文档的新表现形式。其次,我们展示了如何通过将相互细化策略应用于初始聚类结果来提高聚类质量。第三,通过MEDLINE文件的实验,我们表明,与现有的聚类方法相比,我们的集成方法显着提高了群集质量和聚​​类可靠性。在错误分类指数方面,我们的方法提高了平均29.5%的群集质量和26.3%的聚类可靠性,以便与最佳参数的Bisecting K-militing。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号