首页> 外文期刊>Artificial Intelligence Review: An International Science and Engineering Journal >Leveraging synonymy and polysemy to improve semantic similarity assessments based on intrinsic information content
【24h】

Leveraging synonymy and polysemy to improve semantic similarity assessments based on intrinsic information content

机译:利用同义词和多义,以提高基于内在信息内容的语义相似性评估

获取原文
获取原文并翻译 | 示例
           

摘要

Semantic similarity measures based on the estimation of the information content (IC) of concepts are currently regarded as the state of the art. Calculating the IC in an intrinsic (i.e., ontology-based) way is particularly convenient due to its accuracy and lack of dependency on annotated corpora. Intrinsic IC calculation models estimate concept probabilities from the taxonomic knowledge (i.e., number of hyponyms and/or hypernyms of the concepts) modelled in an ontology. In this paper, we aim to improve the intrinsic calculation of the IC by leveraging not only the hyponyms and hypernyms of concepts, but also the explicit evidences of synonymy and polysemy that ontologies such as WordNet also model. Specifically, we propose a more accurate intrinsic estimation of the concepts' probabilities in which the IC calculation relies. We evaluate the accuracy of our proposal through a set of comprehensive experiments in which our IC calculation model is tested on a variety of IC-based similarity measures and benchmarks. Experimental results show that our proposal obtains consistently good accuracies, which vary less across measures and benchmarks than the most prominent intrinsic IC calculation models available in the literature.
机译:基于概念的信息内容(IC)估计的语义相似度措施当前被认为是最先进的。计算IC在内在(即,基于本体)的方式,由于其准确性和对注释的语料库缺乏依赖性,特别方便。内在IC计算模型在本体中建模的分类学知识(即,概念的虚构名称和/或/或概念的超级次数)估计概念概率。在本文中,我们的目的是通过利用概念的下个词和概念的虚空和高次数来改善IC的内在计算,也是Synolym和Polymy的显式证据,例如Wordnet等模型。具体地,我们提出了一种更准确的IC计算概念的内在估计依据依赖性依赖性。我们通过一系列综合实验评估我们提案的准确性,其中我们的IC计算模型在各种基于IC的相似度措施和基准测试中进行了测试。实验结果表明,我们的提案始终如一的良好准确性,可跨文献中可用的最突出的内在IC计算模型而变得不那么不等。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号