...
首页> 外文期刊>Engineering Applications of Artificial Intelligence >A graph-based semantic relatedness assessment method combining wikipedia features
【24h】

A graph-based semantic relatedness assessment method combining wikipedia features

机译:结合维基百科特征的基于图的语义相关性评估方法

获取原文
获取原文并翻译 | 示例
           

摘要

Semantic relatedness assessment between concepts is a critical issue in many domains such as artificial intelligence, information retrieval, psychology, biology, linguistics and cognitive science. Therefore, several methods assess relatedness by exploiting knowledge bases to express the semantics of concepts. However, there are some limitations such as high-dimensional space, high-computational complexity, fitting non-dynamic domains. Considering that Wikipedia, a domain-independent encyclopedic repository, which provides very large coverage, has been exploited by many methods as a huge semantic resource. In this paper, we propose a novel graph-based relatedness assessment method using Wikipedia features to avoid some of the limitations and drawbacks mentioned above. Firstly, for each term in a word pair, the top k most relevant Wikipedia concepts are returned by the Naive-ESA algorithm to reduce the dimensional space of Explicit Semantic Analysis (ESA) method. Secondly, for each different candidate concept in two relevant concept sets, we collect its categories set from the Wikipedia Category Graph (WCG). Based on the categories in WCG network, the relatedness between concepts at the correspondence position of the two sorted concept sets is computed as the association coefficient. Thirdly, based on this parameter, a novel relatedness assessment metric is presented. The evaluation is performed on some datasets well-recognized as benchmarks, using several widely used metrics and a new metric designed by ourselves. The result demonstrates that our method has a better correlation with the intuitions of human judgments than other related works.
机译:在人工智能,信息检索,心理学,生物学,语言学和认知科学等许多领域,概念之间的语义相关性评估是一个关键问题。因此,有几种方法通过利用知识库来表达概念的语义来评估相关性。但是,存在一些限制,例如高维空间,高计算复杂性,拟合非动态域。考虑到Wikipedia提供了非常大的覆盖范围,它是一种与域无关的百科全书库,已被许多方法用作巨大的语义资源。在本文中,我们提出了一种新颖的基于图的相关性评估方法,该方法利用了Wikipedia的功能来避免上述某些局限性和缺陷。首先,对于一个词对中的每个词,最朴素的ESA算法返回前k个最相关的Wikipedia概念,以减少显式语义分析(ESA)方法的维数空间。其次,对于两个相关概念集中的每个候选概念,我们从Wikipedia类别图(WCG)收集其类别集。基于WCG网络中的类别,将两个分类概念集的对应位置处的概念之间的相关性计算为关联系数。第三,基于该参数,提出了一种新颖的相关性评估指标。使用一些广泛使用的指标和我们自己设计的新指标,对一些公认的基准数据集进行评估。结果表明,与其他相关著作相比,我们的方法与人类判断的直觉具有更好的相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号