首页> 外文会议>Advances in Natural Language Processing >Augmenting Word Space Models for Word Sense Discrimination Using an Automatic Thesaurus
【24h】

Augmenting Word Space Models for Word Sense Discrimination Using an Automatic Thesaurus

机译:使用自动词库的增强词空间模型以进行词义识别

获取原文
获取原文并翻译 | 示例

摘要

This paper presents an algorithm for Word Sense Discrimination that divides the global representation of a word into a number of classes by determining for any two occurrences whether they belong to the same sense or not. We rely on the notion that words that are used in similar contexts will have the same or a closely related meaning, thus, given a target word, we group its dependency co-occurrences in a Word Space Model. Each cluster represents a distinct meaning or sense of that word. We experiment with augmenting the bag of words of each cluster of co-occurrences, the dictionary of sense definition, and augmenting both. Then we count the number of intersections of each word of the bag of clustered senses and the bag of the dictionary of senses following the Lesk method. We find an increase in recall and a decrease in precision when augmenting. However, the best resulting F-measure is for the option of augmenting the both dictionary of senses and the bag of words from the clusters.
机译:本文提出了一种词义识别算法,该算法通过确定任意两次出现是否属于同一个词义,将词的全局表示形式划分为多个类别。我们依赖于这样的概念,即在相似上下文中使用的词将具有相同或紧密相关的含义,因此,给定目标词,我们将其依存共存分组在词空间模型中。每个簇代表该词的不同含义或意义。我们尝试扩大每个共现簇的词袋,意义定义词典并同时增强两者。然后,根据Lesk方法,计算聚类感官包和感官字典包中每个单词的交点数。我们发现增强时召回率增加而精度下降。但是,最佳结果F度量是用于增加感官字典和群集词组的选项。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号