首页> 外文会议> >Biomedical Ontology MeSH Improves Document Clustering Qualify on MEDLINE Articles: A Comparison Study
【24h】

Biomedical Ontology MeSH Improves Document Clustering Qualify on MEDLINE Articles: A Comparison Study

机译:生物医学本体论MeSH改善MEDLINE文章的文档聚类资格:一项比较研究

获取原文

摘要

Document clustering has been used for better document retrieval, document browsing, and text mining. In this paper, we investigate if biomedical ontology MeSH improves the clustering quality for MEDLINE articles. For this investigation, we perform a comprehensive comparison study of various document clustering approaches such as hierarchical clustering methods (single-link, complete-link, and complete link), Bisecting K-means, K-means, and Suffix Tree Clustering (STC) in terms of efficiency, effectiveness, and scalability. According to our experiment results, biomedical ontology MeSH significantly enhances clustering quality on biomedical documents. In addition, our results show that decent document clustering approaches, such as Bisecting Kmeans, K-means and STC, gains some benefit from MeSH ontology while hierarchical algorithms showing the poorest clustering quality do not reap the benefit of MeSH ontology.
机译:文档聚类已用于更好的文档检索,文档浏览和文本挖掘。在本文中,我们研究了生物医学本体MeSH是否可以提高MEDLINE文章的聚类质量。对于此调查,我们对各种文档聚类方法进行了全面的比较研究,例如分层聚类方法(单链接,完整链接和完整链接),平分K均值,K均值和后缀树聚类(STC)在效率,有效性和可扩展性方面。根据我们的实验结果,生物医学本体MeSH显着提高了生物医学文献上的聚类质量。此外,我们的结果表明,象元划分Kmeans,K-means和STC之类的体面文档聚类方法从MeSH本体中获得了一些好处,而显示最差聚类质量的分层算法并没有从MeSH本体中受益。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号