Combining Neural Language Models for Word Sense Induction

机译：结合神经语言模型进行词义感应

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Word sense induction (WSI) is the problem of grouping occurrences of an ambiguous word according to the expressed sense of this word. Recently a new approach to this task was proposed, which generates possible substitutes for the ambiguous word in a particular context using neural language models, and then clusters sparse bag-of-words vectors built from these substitutes. In this work, we apply this approach to the Russian language and improve it in two ways. First, we propose methods of combining left and right contexts, resulting in better-substitutes generated. Second, instead of fixed number of clusters for all ambiguous words we propose a technique for selecting individual number of clusters for each word. Our approach established new state-of-the-art level, improving current best results of WSI for the Russian language on two RUSSE 2018 datasets by a large margin.

机译：单词感觉归纳（WSI）是根据这个词的表达的意义上分组模糊词的发生的问题。最近提出了一种新方法，提出了使用神经语言模型在特定上下文中为模糊字的可能替代品，然后从这些替代品构建的稀疏文字矢量群。在这项工作中，我们将这种方法应用于俄语语言，并以两种方式改进。首先，我们提出了组合左和右背景的方法，从而产生更好的替代品。其次，而不是针对所有模糊单词的固定数量的群集，我们提出了一种为每个单词选择单个群集数的技术。我们的方法建立了新的最先进的水平，通过大幅保证金提高了俄语2018年数据集的WSI的最佳结果。

著录项

来源
《International Conference on Analysis of Images, Social Networks, and Texts》|2019年|426p|共17页
会议地点
作者
Nikolay Arefyev; Boris Sheludko; Tatiana Aleksashina;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP393-53;
关键词
Word sense induction; Contextual substitutes; Neural language models;

机译：词感应;背景替代品;神经语言模型;

相似文献

外文文献
中文文献
专利

1. Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition [J] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, IEICE transactions on information and systems . 2019,第12期

机译：潜在词递归神经网络语言模型用于自动语音识别
2. A Two-Level Recurrent Neural Network Language Model Based on the Continuous Bag-of-Words Model for Sentence Classification [J] . Lee Yo Han, Kim Dong W., Lim Myo Taeg International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2019,第1期

机译：一种基于句子分类连续袋式模型的两级反复性神经网络语言模型
3. Combining Language Modeling and LSA on Greek Song "Words" for Mood Classification [J] . Katia L. Kermanidis, Ioannis Karydis, Antonis Koursoumis, International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2014,第2期

机译：将语言建模和LSA结合在希腊歌曲“单词”上进行情绪分类
4. Combining Neural Language Models for Word Sense Induction [C] . Nikolay Arefyev, Boris Sheludko, Tatiana Aleksashina International conference on analysis of Images, social networks and texts . 2019

机译：结合神经语言模型进行词义归纳
5. Language Evolves, So Should WordNet: Automatically Extending WordNet with the Senses of Out of Vocabulary Lemmas [D] . Rusert, Jonathan. 2017

机译：语言演变，所以Wordnet应该自动扩展Wordnet与词汇lemmas的感官
6. Learning to Read Words in a New Language Shapes the Neural Organization of the Prior Languages [O] . Leilei Mei, Gui Xue, Zhong-Lin Lu, -1

机译：学习阅读新语言中的单词塑造了先前语言的神经组织
7. Combining Neural Language Models for Word Sense Induction [O] . Nikolay Arefyev, Boris Sheludko, Tatiana Aleksashina 2019

机译：结合神经语言模型进行词义感应

Combining Neural Language Models for Word Sense Induction

摘要

著录项

相似文献

相关主题

期刊订阅