首页> 外文会议>Seventh international workshop on health text mining and information analysis >Retrofitting Word Vectors of MeSH Terms to Improve Semantic Similarity Measures
【24h】

Retrofitting Word Vectors of MeSH Terms to Improve Semantic Similarity Measures

机译:改进MeSH词的单词向量以改善语义相似度

获取原文
获取原文并翻译 | 示例

摘要

Estimation of the semantic relatedness between biomedical concepts has utility for many informatics applications. Automated methods fall into two broad categories: methods based on distributional statistics drawn from text corpora, and methods based on the structure of existing knowledge resources. In the former case, taxonomic structure is disregarded. In the latter, semantically relevant empirical information is not considered. In this paper, we present a method that retrofits the context vector representation of MeSH terms by using additional linkage information from UMLS/MeSH hierarchy such that linked concepts have similar vector representations. We evaluated the method relative to previously published physician and coder's ratings on sets of MeSH terms. Our experimental results demonstrate that the retrofitted word vector measures obtain a higher correlation with physician judgments. The results also demonstrate a clear improvement on the correlation with experts' ratings from the retrofitted vector representation in comparison to the vector representation without retrofitting.
机译:生物医学概念之间语义相关性的估计对于许多信息学应用具有实用性。自动化方法分为两大类:基于从文本语料库得出的分布统计的方法和基于现有知识资源的结构的方法。在前一种情况下,分类结构被忽略。在后者中,不考虑语义相关的经验信息。在本文中,我们提出了一种通过使用来自UMLS / MeSH层次结构的附加链接信息来改进MeSH术语的上下文向量表示的方法,以使链接的概念具有相似的向量表示。我们根据MeSH术语集评估了相对于先前发表的医师和编码人员等级的方法。我们的实验结果表明,改进后的词向量测度与医生的判断具有较高的相关性。结果还表明,与不进行改进的矢量表示相比,经过改进的矢量表示与专家评级之间的相关性有了明显改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号