【24h】

Pointing the Unknown Words

机译:指向未知词

获取原文
获取外文期刊封面目录资料

摘要

The problem of rare and unknown words is an important issue that can potentially effect the performance of many NLP systems, including traditional count-based and deep learning models. We propose a novel way to deal with the rare and unseen words for the neural network models using attention. Our model uses two softmax layers in order to predict the next word in conditional language models: one predicts the location of a word in the source sentence, and the other predicts a word in the shortlist vocabulary. At each timestep, the decision of which softmax layer to use is adaptively made by an MLP which is conditioned on the context. We motivate this work from a psychological evidence that humans naturally have a tendency to point towards objects in the context or the environment when the name of an object is not known. Using our proposed model, we observe improvements on two tasks, neural machine translation on the Europarl English to French parallel corpora and text summarization on the Gigaword dataset.
机译:稀有单词和未知单词的问题是一个重要问题,可能会影响许多NLP系统的性能,包括传统的基于计数的学习模式和深度学习模型。我们提出了一种使用注意力来处理神经网络模型中稀有和看不见的单词的新颖方法。我们的模型使用两个softmax层来预测条件语言模型中的下一个单词:一个预测单词在源句子中的位置,另一层预测候选单词在词汇表中的位置。在每个时间步长,由哪个MLP自适应地决定要使用哪个softmax层,具体取决于上下文。我们从心理学证据中激发这一工作的动机是,当不知道物体的名称时,人类自然会倾向于指向背景或环境中的物体。使用我们提出的模型,我们观察到两个任务的改进:Europarl英语到法语并行语料库的神经机器翻译和Gigaword数据集的文本摘要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号