首页> 外文期刊>Artificial intelligence in medicine >Association measures for estimating semantic similarity and relatedness between biomedical concepts
【24h】

Association measures for estimating semantic similarity and relatedness between biomedical concepts

机译:估计生物医学概念之间语义相似性和相关性的关联度量

获取原文
获取原文并翻译 | 示例
           

摘要

Association measures quantify the observed likelihood a term pair co-occurs versus their predicted co-occurrence together if by chance. This is based both on the terms' individual occurrence frequencies, and their mutual co-occurrence frequencies. One application of association scores is estimating semantic relatedness, which is critical for many natural language processing applications, such as clustering of biomedical and clinical documents and the development of biomedical terminologies and ontololgies. In this paper we propose a method of generating association scores between biomedical concepts to estimate semantic relatedness. We use co-occurrence statistics between Unified Medical Language System (UMLS) concepts to account for lexical variation at the synonymous level, and introduce a process of concept expansion that exploits hierarchical information from the UMLS to account for lexical variation at the hyponymous level. State of the art results are achieved on several standard evaluation datasets, and an in depth analysis of hyper-parameters is presented.
机译:关联度量会量化一个术语对同时出现的观测可能性与其预测的共同出现的可能性(如果偶然)。这既基于术语的单独出现频率,又基于它们的相互同时出现频率。关联分数的一种应用是估计语义相关性,这对于许多自然语言处理应用至关重要,例如生物医学和临床文档的聚类以及生物医学术语和本体论的发展。在本文中,我们提出了一种在生物医学概念之间生成关联评分以估计语义相关性的方法。我们使用统一医学语言系统(UMLS)概念之间的共现统计来解释同义词级别的词汇变异,并引入概念扩展过程,该概念扩展过程利用UMLS中的层次结构信息来解释下位层次的词汇变异。在几个标准评估数据集上获得了最先进的结果,并提出了对超参数的深入分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号