Predicting and interpreting embeddings for out of vocabulary words in downstream tasks

机译：预测和解释下游任务中词汇外的嵌入

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We propose a novel way to handle out of vocabulary (OOV) words in downstream natural language processing (NLP) tasks. We implement a network that predicts useful em-beddings for OOV words based on their morphology and on the context in which they appear. Our model also incorporates an attention mechanism indicating the focus allocated to the left context words, the right context words or the word's characters, hence making the prediction more interpretable. The model is a "drop-in" module that is jointly trained with the downstream task's neural network, thus producing embeddings specialized for the task at hand. When the task is mostly syntactical, we observe that our model aims most of its attention on surface form characters. On the other hand, for tasks more semantical, the network allocates more attention to the surrounding words. In all our tests, the module helps the network to achieve better performances in comparison to the use of simple random embeddings.

机译：我们提出了一种新颖的方法来处理下游自然语言处理（NLP）任务中的词汇（OOV）单词。我们实现了一个网络，该网络可以根据OOV词的形态和出现的上下文来预测有用的嵌入。我们的模型还包含一种注意机制，该机制指示分配给左上下文单词，右上下文单词或单词字符的焦点，因此使预测更具可解释性。该模型是一个“插入式”模块，与下游任务的神经网络共同训练，从而生成专门用于手头任务的嵌入。当任务主要是语法性的时，我们观察到我们的模型将大部分注意力集中在表面形状特征上。另一方面，对于更具语义的任务，网络将更多的注意力分配给周围的单词。在我们所有的测试中，与使用简单的随机嵌入相比，该模块可帮助网络获得更好的性能。

著录项

来源
《1st EMNLP workshop blackboxNLP: analyzing and interpreting neural networks for NLP 2018》|2018年|331-333|共3页
会议地点 Brussels(BE)
作者
Nicolas Garneau; Jean-Samuel Leboeuf; Luc Lamontagne;
展开▼
作者单位

Departement d'informatique et de genie logiciel Universite Laval, Quebec, Canada;

Departement d'informatique et de genie logiciel Universite Laval, Quebec, Canada;

Departement d'informatique et de genie logiciel Universite Laval, Quebec, Canada;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. 中国英语学习者在完成合作输出任务中的实际词汇处理和词汇学习 [J] . 牛瑞英中国应用语言学：英文版 . 2014,第003期
2. A two-pass approach for handling out-of-vocabulary words in a large vocabulary recognition task [J] . Odette Scharenborg, Stephanie Seneff, Lou Boves Computer speech and language . 2007,第1期

机译：在大型词汇识别任务中处理词汇外单词的两遍方法
3. Explicit Instruction of Context-embedded Hyperlinked Thematic Words and Vocabulary Recall [J] . Hassan Soleimani, Maryam Molla Esmaeili Procedia - Social and Behavioral Sciences . 2013,第2期

机译：上下文相关的超链接主题词和词汇记忆的明确说明
4. How to use a wide variety of words in telling a story with a small vocabulary: cognitive predictors of lexical selection for simultaneous bilingual children [J] . Language, cognition and neuroscience . 2020,第3期

机译：如何在讲述一个小词汇表中讲述一个故事的各种各样的词：同时双语儿童的词汇选择的认知预测因子
5. Predicting and interpreting embeddings for out of vocabulary words in downstream tasks [C] . Nicolas Garneau, Jean-Samuel Leboeuf, Luc Lamontagne Conference on empirical methods in natural language processing . 2018

机译：预测和解释嵌入下游任务中的词汇单词的嵌入
6. Improved GloVe Word Embedding Using Linear Weighting Scheme for Word Similarity Tasks [D] . Lu, Qinglan. 2021

机译：使用线性加权方案进行改进的手套单词嵌入单词相似性任务
7. Nonword Repetition and Vocabulary Knowledge as Predictors of Childrens Phonological and Semantic Word Learning [O] . Suzanne M. Adlof, Hannah Patten -1

机译：非单词重复和词汇知识作为儿童语音和语义单词学习的预测指标
8. Predicting and interpreting embeddings for out of vocabulary words in downstream tasks [O] . Nicolas Garneau, Jean-Samuel Leboeuf, Luc Lamontagne 2018

机译：预测和解释嵌入下游任务中的词汇单词的嵌入

Predicting and interpreting embeddings for out of vocabulary words in downstream tasks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅