首页> 外文会议>International conference on analysis of Images, social networks and texts >Combining Neural Language Models for Word Sense Induction
【24h】

Combining Neural Language Models for Word Sense Induction

机译:结合神经语言模型进行词义归纳

获取原文

摘要

Word sense induction (WSI) is the problem of grouping occurrences of an ambiguous word according to the expressed sense of this word. Recently a new approach to this task was proposed, which generates possible substitutes for the ambiguous word in a particular context using neural language models, and then clusters sparse bag-of-words vectors built from these substitutes. In this work, we apply this approach to the Russian language and improve it in two ways. First, we propose methods of combining left and right contexts, resulting in better substitutes generated. Second, instead of fixed number of clusters for all ambiguous words we propose a technique for selecting individual number of clusters for each word. Our approach established new state-of-the-art level, improving current best results of WSI for the Russian language on two RUSSE 2018 datasets by a large margin.
机译:词义归纳(WSI)是根据该词的表达意义对歧义词的出现进行分组的问题。最近,提出了一种用于此任务的新方法,该方法使用神经语言模型在特定上下文中为歧义词生成可能的替代词,然后对由这些替代词构建的稀疏词袋向量进行聚类。在这项工作中,我们将这种方法应用于俄语,并通过两种方式对其进行改进。首先,我们提出了组合左右上下文的方法,从而产生更好的替代品。第二,我们提出了一种为每个单词选择单独数目的簇的技术,而不是为所有歧义词固定数目的簇。我们的方法建立了新的最先进水平,大大提高了在两个RUSSE 2018数据集上针对俄语使用的WSI的当前最佳结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号