首页> 外文会议>International Conference on Analysis of Images, Social Networks, and Texts >Combining Neural Language Models for Word Sense Induction
【24h】

Combining Neural Language Models for Word Sense Induction

机译:结合神经语言模型进行词义感应

获取原文

摘要

Word sense induction (WSI) is the problem of grouping occurrences of an ambiguous word according to the expressed sense of this word. Recently a new approach to this task was proposed, which generates possible substitutes for the ambiguous word in a particular context using neural language models, and then clusters sparse bag-of-words vectors built from these substitutes. In this work, we apply this approach to the Russian language and improve it in two ways. First, we propose methods of combining left and right contexts, resulting in better-substitutes generated. Second, instead of fixed number of clusters for all ambiguous words we propose a technique for selecting individual number of clusters for each word. Our approach established new state-of-the-art level, improving current best results of WSI for the Russian language on two RUSSE 2018 datasets by a large margin.
机译:单词感觉归纳(WSI)是根据这个词的表达的意义上分组模糊词的发生的问题。最近提出了一种新方法,提出了使用神经语言模型在特定上下文中为模糊字的可能替代品,然后从这些替代品构建的稀疏文字矢量群。在这项工作中,我们将这种方法应用于俄语语言,并以两种方式改进。首先,我们提出了组合左和右背景的方法,从而产生更好的替代品。其次,而不是针对所有模糊单词的固定数量的群集,我们提出了一种为每个单词选择单个群集数的技术。我们的方法建立了新的最先进的水平,通过大幅保证金提高了俄语2018年数据集的WSI的最佳结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号