首页> 外文会议>2015 International Conference "Stability and Control Processes" in Memory of V.I. Zubov >Analysis of standard clustering algorithms for grouping MEDLINE abstracts into evidence-based medicine intervention categories
【24h】

Analysis of standard clustering algorithms for grouping MEDLINE abstracts into evidence-based medicine intervention categories

机译:分析将MEDLINE摘要分组为循证医学干预类别的标准聚类算法

获取原文
获取原文并翻译 | 示例

摘要

The paper describes a process of clustering of article abstracts, taken from the largest bibliographic life sciences and biomedical information MEDLINE database into categories that correspond to types of medical interventions - types of patient treatments. Experiments were carried out to evaluate the quality of clustering for the following algorithms: K-means; K-means++; Hierarchical clustering, SIB (Sequential information bottleneck) together with the LSA (Latent Semantic Analysis) methods and MI (Mutual Information) which allow selecting feature vectors. Best results of clustering were achieved by K-means++ together with LSA then 210-dimensional space was chosen: Purity = 0.5719, Entropy = 1.3841, Normalized Entropy = 0.6299.
机译:本文描述了将文章摘要(从最大的书目生命科学和生物医学信息MEDLINE数据库中抽取)进行聚类的过程,将其分类为与医疗干预类型(患者治疗类型)相对应的类别。针对以下算法,进行了评估聚类质量的实验:K-均值; K-均值++;分层聚类,SIB(顺序信息瓶颈)以及LSA(潜在语义分析)方法和MI(互信息)允许选择特征向量。聚类的最佳结果是通过K-means ++和LSA达成的,然后选择了210维空间:纯度= 0.5719,熵= 1.3841,归一化熵= 0.6299。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号