首页> 外文期刊>Information Processing & Management >Feature-based approaches to semantic similarity assessment of concepts using Wikipedia
【24h】

Feature-based approaches to semantic similarity assessment of concepts using Wikipedia

机译:基于特征的维基百科概念语义相似度评估方法

获取原文
获取原文并翻译 | 示例
       

摘要

Semantic similarity assessment between concepts is an important task in many language related applications. In the past, several approaches to assess similarity by evaluating the knowledge modeled in an (or multiple) ontology (or ontologies) have been proposed. However, there are some limitations such as the facts of relying on predefined ontologies and fitting non-dynamic domains in the existing measures. Wikipedia provides a very large domain-independent encyclopedic repository and semantic network for computing semantic similarity of concepts with more coverage than usual ontologies. In this paper, we propose some novel feature based similarity assessment methods that are fully dependent on Wikipedia and can avoid most of the limitations and drawbacks introduced above. To implement similarity assessment based on feature by making use of Wikipedia, firstly a formal representation of Wikipedia concepts is presented. We then give a framework for feature based similarity based on the formal representation of Wikipedia concepts. Lastly, we investigate several feature based approaches to semantic similarity measures resulting from instantiations of the framework. The evaluation, based on several widely used benchmarks and a benchmark developed in ourselves, sustains the intuitions with respect to human judgements. Overall, several methods proposed in this paper have good human correlation and constitute some effective ways of determining similarity between Wikipedia concepts.
机译:在许多语言相关的应用程序中,概念之间的语义相似性评估是一项重要任务。在过去,已经提出了几种通过评估在(或多个)本体(或本体)中建模的知识来评估相似性的方法。但是,存在一些局限性,例如依赖于预定义的本体并在现有措施中拟合非动态域。 Wikipedia提供了一个非常大的独立于域的百科全书存储库和语义网络,用于计算概念的语义相似性,其覆盖范围比普通本体更多。在本文中,我们提出了一些新颖的基于特征的相似度评估方法,这些方法完全依赖于Wikipedia,并且可以避免上面介绍的大多数局限性和缺陷。为了利用维基百科实现基于特征的相似度评估,首先提出了维基百科概念的形式表示。然后,我们基于Wikipedia概念的形式表示,给出基于特征的相似性的框架。最后,我们研究了基于框架实例化的几种基于特征的语义相似性度量方法。该评估基于几个广泛使用的基准和我们自己制定的基准,维持了有关人类判断的直觉。总体而言,本文提出的几种方法具有良好的人类相关性,并构成确定维基百科概念之间相似性的一些有效方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号