Context-dependent word representation for neural machine translation

Heeyoul Choi; Kyunghyun Cho; Yoshua Bengio

首页> 外文期刊>Computer speech and language >Context-dependent word representation for neural machine translation

【24h】

Context-dependent word representation for neural machine translation

机译：神经机器翻译的上下文相关词表示

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We first observe a potential weakness of continuous vector representations of symbols in neural machine translation. That is, the continuous vector representation, or a word embedding vector, of a symbol encodes multiple dimensions of similarity, equivalent to encoding more than one meaning of the word. This has the consequence that the encoder and decoder recurrent networks in neural machine translation need to spend substantial amount of their capacity in disambiguating source and target words based on the context which is defined by a source sentence. Based on this observation, in this paper we propose to contextualize the word embedding vectors using a nonlinear bag-of-words representation of the source sentence. Additionally, we propose to represent special tokens (such as numbers, proper nouns and acronyms) with typed symbols to facilitate translating those words that are not well-suited to be translated via continuous vectors. The experiments on En-Fr and En-De reveal that the proposed approaches of contextualization and symbolization improves the translation quality of neural machine translation systems significantly.

机译：我们首先观察到神经机器翻译中符号的连续矢量表示的潜在弱点。即，符号的连续向量表示或词嵌入向量对相似性的多个维度进行编码，等效于对词的一种以上含义进行编码。结果是，神经机器翻译中的编码器和解码器循环网络需要花费大量的能力来根据由源语句定义的上下文对源词和目标词进行歧义消除。基于这种观察，本文提出使用源句的非线性词袋表示对词嵌入向量进行上下文化。另外，我们建议用类型化的符号来表示特殊标记（例如数字，专有名词和缩写），以方便翻译不适合通过连续向量进行翻译的单词。在En-Fr和En-De上进行的实验表明，所提出的上下文化和符号化方法显着提高了神经机器翻译系统的翻译质量。

著录项

来源
《Computer speech and language》 |2017年第9期|149-160|共12页
作者
Heeyoul Choi; Kyunghyun Cho; Yoshua Bengio;
展开▼
作者单位

Handong Global University, Pohang, Republic of Korea;

New York University, New York, NY, USA;

University of Montreal, Montreal, QC, Canada;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Neural machine translation; Contextualization; Symbolization;

机译：神经机器翻译;语境化;符号化;

相似文献

外文文献
中文文献
专利

1. A Hierarchical Clustering Approach to Fuzzy Semantic Representation of Rare Words in Neural Machine Translation [J] . Yang Muyun, Liu Shujie, Chen Kehai, IEEE Transactions on Fuzzy Systems . 2020,第5期

机译：神经机翻译中稀有词模糊语义表示的分层聚类方法
2. Incorporating Statistical Machine Translation Word Knowledge Into Neural Machine Translation [J] . Xing Wang, Zhaopeng Tu, Min Zhang Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2018,第12期

机译：将统计机器翻译单词知识整合到神经机器翻译中
3. Explicitly Modeling Word Translations in Neural Machine Translation [J] . Han Dong, Li Junhui, Li Yachao, ACM transactions on Asian language information processing . 2020,第1期

机译：在神经机器翻译中显式建模单词翻译
4. Addressing unknown word problem for neural machine translation using distributee representations of words as input features [C] . Tomoki Nishimura, Tomoyosi Akiba International Conference on Advanced Informatics: Concepts, Theory and Applications . 2017

机译：使用单词的分布式表示作为输入特征来解决神经机器翻译中的未知单词问题
5. The representation of multiple translations in bilingual memory: An examination of lexical organization for concrete, abstract, and emotion words in Spanish-English bilinguals. [D] . Basnight-Brown, Dana M. 2009

机译：双语记忆中多种翻译的表示：西班牙语-英语双语者中具体，抽象和情感词的词汇组织检查。
6. Context-Dependent Interpretation Of Words: Evidence For Interactive Neural Processes [O] . Silvia P. Gennari, Maryellen C. MacDonald, Bradley R. Postle, -1

机译：单词的上下文相关解释：交互式神经过程的证据
7. Context-Dependent Word Representation for Neural Machine Translation [O] . Choi, Heeyoul, Cho, Kyunghyun, Bengio, Yoshua 2016

机译：神经机器翻译的上下文相关词表示
8. Soviet Developments in Information Processing and Machine Translation. Problems of Word-Order in Russian-Chinese Machine Translation and Their Solutions. [R] . Yung-chuan, L. 1960

机译：苏联信息处理与机器翻译的发展。俄汉机器翻译中的词序问题及其解决方法。

Context-dependent word representation for neural machine translation

摘要

著录项

相似文献

相关主题

期刊订阅