首页> 外文会议>International conference on advanced data and information engineering >Augmenting Concept Definition in Gloss Vector Semantic Relatedness Measure using Wikipedia Articles
【24h】

Augmenting Concept Definition in Gloss Vector Semantic Relatedness Measure using Wikipedia Articles

机译:使用维基百科文章的光泽传染媒介语义相关措施增强概念定义

获取原文

摘要

Semantic relatedness measures are widely used in text mining and information retrieval applications. Considering these automated measures, in this research paper we attempt to improve Gloss Vector relatedness measure for more accurate estimation of relatedness between two given concepts. Generally, this measure, by constructing concepts definitions (Glosses) from a thesaurus, tries to find the angle between the concepts' gloss vectors for the calculation of relatedness. Nonetheless, this definition construction task is challenging as thesauruses do not provide full coverage of expressive definitions for the particularly specialized concepts. By employing Wikipedia articles and other external resources, we aim at augmenting these concepts' definitions. Applying both definition types to the biomedical domain, using MEDLINE as corpus, UMLS as the default thesaurus, and a reference standard of 68 concept pairs manually rated for relatedness, we show exploiting available resources on the Web would have positive impact on final measurement of semantic relatedness.
机译:语义相关性措施广泛用于文本挖掘和信息检索应用。考虑到这些自动化措施,在本研究论文中,我们试图改善光泽矢量相关措施,以便更准确地估算两个给定的概念之间的相关性。通常,通过构建来自词库的概念定义(彩色彩色)来试图在概念的光泽向量之间找到用于计算相关性的概念之间的角度。尽管如此,这种定义施工任务挑战,因为叙述不提供特别专业概念的表达定义的全面覆盖。通过雇用维基百科文章和其他外部资源,我们旨在增强这些概念的定义。将定义类型应用于生物医学域,使用Medline作为语料库,UMLS作为默认词库,以及68个概念对的参考标准,有关相关性的68对,我们显示有利用网络上的可用资源将对语义的最终测量产生积极影响相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号