首页> 外文会议>International Conference on "Stability and Control Processes" in Memory of V.I. Zubov >Analysis of standard clustering algorithms for grouping MEDLINE abstracts into evidence-based medicine intervention categories
【24h】

Analysis of standard clustering algorithms for grouping MEDLINE abstracts into evidence-based medicine intervention categories

机译:将Medline摘要分析到基于证据的药物干预类别的标准聚类算法分析

获取原文

摘要

The paper describes a process of clustering of article abstracts, taken from the largest bibliographic life sciences and biomedical information MEDLINE database into categories that correspond to types of medical interventions - types of patient treatments. Experiments were carried out to evaluate the quality of clustering for the following algorithms: K-means; K-means++; Hierarchical clustering, SIB (Sequential information bottleneck) together with the LSA (Latent Semantic Analysis) methods and MI (Mutual Information) which allow selecting feature vectors. Best results of clustering were achieved by K-means++ together with LSA then 210-dimensional space was chosen: Purity = 0.5719, Entropy = 1.3841, Normalized Entropy = 0.6299.
机译:本文介绍了文章摘要的过程,从最大的书目生命科学和生物医学信息Medline数据库中获取成与医疗干预类型的类别 - 患者治疗类型。进行实验以评估以下算法的聚类质量:K-means; k-means ++;分层聚类,SIB(顺序信息瓶颈)以及LSA(潜在语义分析)方法和MI(互信息),允许选择特征向量。通过K-means ++实现聚类的最佳结果,然后选择LSA,然后选择210维空间:纯度= 0.5719,熵= 1.3841,归一化熵= 0.6299。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号