首页> 外文会议>International conference on computational linguistics >An Enhanced Lesk Word Sense Disambiguation Algorithm through a Distributional Semantic Model
【24h】

An Enhanced Lesk Word Sense Disambiguation Algorithm through a Distributional Semantic Model

机译:基于分布语义模型的增强型韭菜词义消歧算法

获取原文

摘要

This paper describes a new Word Sense Disambiguation (WSD) algorithm which extends two well-known variations of the Lesk WSD method. Given a word and its context, Lesk algorithm exploits the idea of maximum number of shared words (maximum overlaps) between the context of a word and each definition of its senses (gloss) in order to select the proper meaning. The main contribution of our approach relies on the use of a word similarity function defined on a distributional semantic space to compute the gloss-context overlap. As sense inventory we adopt Babel-Net, a large multilingual semantic network built exploiting both WordNet and Wikipedia. Besides linguistic knowledge, BabelNet also represents encyclopedic concepts coming from Wikipedia. The evaluation performed on SemEval-2013 Multilingual Word Sense Disambiguation shows that our algorithm goes beyond the most frequent sense baseline and the simplified version of the Lesk algorithm. Moreover, when compared with the other participants in SemEval-2013 task, our approach is able to outperform the best system for English.
机译:本文介绍了一种新的词义消除歧义(WSD)算法,该算法扩展了Lesk WSD方法的两个众所周知的变体。给定一个单词及其上下文,Lesk算法利用一个单词的上下文及其含义(光泽)的每个定义之间共享单词的最大数量(最大重叠)的思想,以选择适当的含义。我们方法的主要贡献在于使用在分布语义空间上定义的单词相似度函数来计算光泽上下文重叠。作为感觉清单,我们采用Babel-Net,这是一个利用WordNet和Wikipedia构建的大型多语言语义网络。除了语言知识,BabelNet还代表来自维基百科的百科全书概念。对SemEval-2013多语言词义歧义消除的评估表明,我们的算法超出了最常见的感知基线和Lesk算法的简化版本。此外,与参加SemEval-2013任务的其他参与者相比,我们的方法能够胜过最佳英语系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号