首页> 外文会议>ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD'06); 20060820-23; Philadelphia,PA(US) >Integration of Semantic-based Bipartite Graph Representation and Mutual Refinement Strategy for Biomedical Literature Clustering
【24h】

Integration of Semantic-based Bipartite Graph Representation and Mutual Refinement Strategy for Biomedical Literature Clustering

机译:生物医学文献聚类中基于语义的二部图表示和互精策略的集成

获取原文
获取原文并翻译 | 示例

摘要

We introduce a novel document clustering approach that overcomes those problems by combining a semantic-based bipartite graph representation and a mutual refinement strategy. The primary contributions of this paper are the following. First, we introduce a new representation of documents using a bipartite graph between documents and co-occurrence concepts in the documents. Second, we show how to enhance clustering quality by applying the mutual refinement strategy to the initial clustering results. Third, through the experiments on MEDLINE documents, we show that our integrated method significantly enhances cluster quality and clustering reliability compared to existing clustering methods. Our approach improves on the average 29.5% cluster quality and 26.3% clustering reliability, in terms of misclassification index, over Bisecting K-means with the best parameters.
机译:我们介绍了一种新颖的文档聚类方法,该方法通过结合基于语义的二部图表示法和互精策略来克服这些问题。本文的主要贡献如下。首先,我们使用文档和文档中共现概念之间的二部图来介绍文档的新表示形式。其次,我们展示了如何通过将相互细化策略应用于初始聚类结果来提高聚类质量。第三,通过在MEDLINE文档上进行的实验,我们表明,与现有的聚类方法相比,我们的集成方法显着提高了聚类质量和聚类可靠性。在分类错误指数方面,我们的方法相对于具有最佳参数的二等分K均值,平均聚类质量提高了29.5%,聚类可靠性提高了26.3%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号