首页> 外文会议>International conference on computational linguistics >A Deep Dive into Word Sense Disambiguation with LSTM

【24h】

A Deep Dive into Word Sense Disambiguation with LSTM

机译：深入潜入词语歧义与LSTM

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

LSTM-based language models have been shown effective in Word Sense Disambiguation (WSD). In particular, the technique proposed by Yuan et al. (2016) returned state-of-the-art performance in several benchmarks, but neither the training data nor the source code was released. This paper presents the results of a reproduction study and analysis of this technique using only openly available datasets (GigaWord, SemCor, OMSTI) and software (TensorFlow). Our study showed that similar results can be obtained with much less data than hinted at by Yuan et al. (2016). Detailed analyses shed light on the strengths and weaknesses of this method. First, adding more unannotated training data is useful, but is subject to diminishing returns. Second, the model can correctly identify both popular and unpopular meanings. Finally, the limited sense coverage in the annotated datasets is a major limitation. All code and trained models are made freely available.

机译：基于LSTM的语言模型已在Word Sense Dismigation（WSD）中有效。特别是袁等人提出的技术。（2016）在几个基准中返回最先进的性能，但训练数据和源代码都没有发布。本文仅使用公开可用的数据集（GigaWord，Semcor，OMSTI）和软件（TensorFlow）来介绍对该技术的再现研究和分析的结果。我们的研究表明，可以获得类似的结果，这些结果比Yuan等人暗示。（2016）。详细分析了这种方法的优点和弱点的阐明。首先，添加更多未解除的培训数据是有用的，但须逐渐减少。其次，该模型可以正确识别流行和不受欢迎的含义。最后，注释数据集中的有限意义覆盖率是一个重大限制。所有代码和培训的型号都是免费提供的。

著录项

来源
《International conference on computational linguistics 》|2018年|lxxi 652 p.|共12页
会议地点
作者
Minh Le; Marten Postma; Jacopo Urbani; Piek Vossen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程 ;
关键词

相似文献

外文文献
中文文献
专利

1. A Sense Annotated Corpus for All-Words Urdu Word Sense Disambiguation [J] . Saeed Ali, Nawab Rao Muhammad Adeel, Stevenson Mark, ACM transactions on Asian language information processing . 2019 ,第4期

机译：用于全词乌尔都语的词义注释语料库
2. Word sense disambiguation for Punjabi language using deep learning techniques [J] . Neural computing & applications . 2020 ,第8期

机译：使用深度学习技术的旁遮普语言的词语感歧义
3. deepBioWSD: effective deep neural word sense disambiguation of biomedical text data [J] . Ahmad Pesaranghader, Stan Matwin, Marina Sokolova, Journal of the American Medical Informatics Association : . 2019 ,第5期

机译：DeepBiowsd：生物医学文本数据的有效深度神经词感义歧义
4. A Deep Dive into Word Sense Disambiguation with LSTM [C] . Minh Le, Marten Postma, Jacopo Urbani, International conference on computational linguistics . 2018

机译：使用LSTM深入研究词义歧义
5. Subjectivity word sense disambiguation: A method for sense-aware subjectivity analysis. [D] . Akkaya, Cem. 2014

机译：主观性词义消歧：一种用于感知感知的主观性分析的方法。
6. deepBioWSD: effective deep neural word sense disambiguation of biomedical text data [O] . Ahmad Pesaranghader, Stan Matwin, Marina Sokolova, 2019

机译：Deepbiowsd：生物医学文本数据的有效深度神经词感歧义
7. An analysis of word sense disambiguation in Bangla and English using supervised learning and a deep neural network classifier [O] . Pasha Maroof Ur Rahman 2015

机译：基于监督学习和深度神经网络分类器的孟加拉语和英语单词义歧义分析
8. Word Domain Disambiguation via Word Sense Disambiguation [R] . Sanfilippo, A. 2006

机译：Word Word消歧通过Word sense消歧

A Deep Dive into Word Sense Disambiguation with LSTM

摘要

著录项

相似文献

相关主题

期刊订阅