...
首页> 外文期刊>Journal of biomedical informatics. >Measures of semantic similarity and relatedness in the biomedical domain.
【24h】

Measures of semantic similarity and relatedness in the biomedical domain.

机译:生物医学领域中语义相似性和相关性的度量。

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Measures of semantic similarity between concepts are widely used in Natural Language Processing. In this article, we show how six existing domain-independent measures can be adapted to the biomedical domain. These measures were originally based on WordNet, an English lexical database of concepts and relations. In this research, we adapt these measures to the SNOMED-CT ontology of medical concepts. The measures include two path-based measures, and three measures that augment path-based measures with information content statistics from corpora. We also derive a context vector measure based on medical corpora that can be used as a measure of semantic relatedness. These six measures are evaluated against a newly created test bed of 30 medical concept pairs scored by three physicians and nine medical coders. We find that the medical coders and physicians differ in their ratings, and that the context vector measure correlates most closely with the physicians, while the path-based measures and one of the information content measures correlates most closely with the medical coders. We conclude that there is a role both for more flexible measures of relatedness based on information derived from corpora, as well as for measures that rely on existing ontological structures.
机译:概念之间的语义相似性度量在自然语言处理中被广泛使用。在本文中,我们展示了如何将六个现有的独立于域的措施应用于生物医学领域。这些措施最初是基于WordNet的,WordNet是概念和关系的英语词汇数据库。在这项研究中,我们将这些措施适应于医学概念的SNOMED-CT本体。这些措施包括两个基于路径的措施,以及三个通过语料库的信息内容统计信息增强基于路径的措施的措施。我们还基于医学语料库导出了上下文向量度量,可以用作语义相关性的度量。这三项医师和九名医疗编码员根据新创建的包含30个医疗概念对的测试平台对这六项措施进行了评估。我们发现,医疗编码人员和医师的等级不同,并且上下文向量度量与医师之间的关联最紧密,而基于路径的度量和信息内容度量之一与医疗编码者的关联最紧密。我们得出结论,基于从语料库获得的信息,更灵活的相关性度量以及依赖现有本体结构的度量都将发挥作用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号