首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Zero-shot Word Sense Disambiguation using Sense Definition Embeddings
【24h】

Zero-shot Word Sense Disambiguation using Sense Definition Embeddings

机译:使用Sense Definition eMbeddings Zero-Shot Word感觉歧义

获取原文

摘要

Word Sense Disambiguation (WSD) is a longstanding but open problem in Natural Language Processing (NLP). WSD corpora are typically small in size, owing to an expensive annotation process. Current supervised WSD methods treat senses as discrete labels and also resort to predicting the Most-Frequent-Sense (MFS) for words unseen during training. This leads to poor performance on rare and unseen senses. To overcome this challenge, we propose Extended WSD Incorporating Sense Embeddings (EWISE), a supervised model to perform WSD by predicting over a continuous sense embedding space as opposed to a discrete label space. This allows EWISE to generalize over both seen and unseen senses, thus achieving generalized zero-shot learning. To obtain target sense embeddings, EWISE utilizes sense definitions. EWISE learns a novel sentence encoder for sense definitions by using WordNet relations and also ConvE, a recently proposed knowledge graph embedding method. We also compare EWISE against other sentence encoders pretrained on large corpora to generate definition embeddings. EWISE achieves new state-of-the-art WSD performance.
机译:词感消解(WSD)是自然语言处理(NLP)中的长期但是开放的问题。由于昂贵的注释过程,WSD Corpora的规模通常很小。当前监督的WSD方法将感官视为离散标签,也可以诉诸于在培训期间预测无声的最常见的感觉(MFS)。这导致稀有和看不见的感官表现不佳。为了克服这一挑战,我们提出了扩展的WSD,该WSD包含了感知嵌入的(E),通过预测连续感知嵌入空间来执行WSD,而不是离散标签空间。这允许EVEWELE概括过两种看见和看不见的感官,从而实现广义零射击学习。要获得目标感知嵌入,EWSE使用感测定义。通过使用Wordnet关系并达到最近提出的知识图形嵌入方法,EWESE学习一个新颖的句子编码器。我们还与大型语料库上的其他句子编码器进行比较,以生成定义嵌入。 EWSEWS实现了新的最先进的WSD性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号