首页> 外文会议>Workshop on vector space Modeling for Natural Language Processing >Neural context embeddings for automatic discovery of word senses
【24h】

Neural context embeddings for automatic discovery of word senses

机译:神经背景嵌入自动发现词感觉

获取原文
获取外文期刊封面目录资料

摘要

Word sense induction (WSI) is the problem of automatically building an inventory of senses for a set of target words using only a text corpus. We introduce a new method for embedding word instances and their context, for use in WSI. The method, Instance-context embedding (ICE), leverages neural word embeddings, and the correlation statistics they capture, to compute high quality embeddings of word contexts. In WSI, these context embeddings are clustered to find the word senses present in the text. ICE is based on a novel method for combining word embeddings using continuous Skip-gram, based on both semantic and a temporal aspects of context words. ICE is evaluated both in a new system, and in an extension to a previous system for WSI. In both cases, we surpass previous state-of-the-art, on the WSI task of SemEval-2013, which highlights the generality of ICE. Our proposed system achieves a 33% relative improvement.
机译:单词感测归纳(WSI)是仅使用文本语料库自动构建一组目标单词的感官库存的问题。我们介绍了一种用于嵌入WSI的Word实例及其上下文的新方法,以便在WSI中使用。该方法,实例 - 上下文嵌入(ICE),利用神经单词嵌入,以及它们捕获的相关统计,以计算单词上下文的高质量嵌入。在WSI中,群集这些上下文嵌入式以查找文本中存在的词感觉。 ICE基于使用连续跳过的单词嵌入式组合的新方法,基于语义词语的语义和时间方面。 ICE在新系统中进行评估,并在一个用于WSI的先前系统的扩展中进行评估。在这两种情况下,我们在Semeval-2013的WSI任务上超越了以前的最先进,这突出了冰的一般性。我们所提出的系统相对改善达到33%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号