The representational geometry of word meanings acquired by neural machine translation models

Hill Felix; Cho Kyunghyun; Jean Sébastien; Bengio Yoshua

首页> 外文期刊>Machine translation >The representational geometry of word meanings acquired by neural machine translation models

【24h】

The representational geometry of word meanings acquired by neural machine translation models

机译：神经机器翻译模型获得的词义表示几何

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This work is the first comprehensive analysis of the properties of word embeddings learned by neural machine translation (NMT) models trained on bilingual texts. We show the word representations of NMT models outperform those learned from monolingual text by established algorithms such as Skipgram and CBOW on tasks that require knowledge of semantic similarity and/or lexical-syntactic role. These effects hold when translating from English to French and English to German, and we argue that the desirable properties of NMT word embeddings should emerge largely independently of the source and target languages. Further, we apply a recently-proposed heuristic method for training NMT models with very large vocabularies, and show that this vocabulary expansion method results in minimal degradation of embedding quality. This allows us to make a large vocabulary of NMT embeddings available for future research and applications. Overall, our analyses indicate that NMT embeddings should be used in applications that require word concepts to be organised according to similarity and/or lexical function, while monolingual embeddings are better suited to modelling (nonspecific) inter-word relatedness.

机译：这项工作是对通过双语文本训练的神经机器翻译（NMT）模型学习的词嵌入属性的首次全面分析。我们显示，在需要知识相似性和/或词汇句法作用知识的任务上，NMT模型的单词表示优于通过已建立的算法（例如Skipgram和CBOW）从单语文本中学习的单词表示。从英语到法语和英语到德语翻译时，这些效果仍然存在。我们认为NMT词嵌入的理想属性应在很大程度上独立于源语言和目标语言而出现。此外，我们应用了最近提出的启发式方法来训练具有非常大的词汇量的NMT模型，并证明了这种词汇量扩展方法可以最大程度地降低嵌入质量。这使我们能够为将来的研究和应用提供大量的NMT嵌入词汇。总体而言，我们的分析表明，在需要根据相似性和/或词汇功能来组织词概念的应用中应使用NMT嵌入，而单语言嵌入更适合于建模（非特定的）词间相关性。

著录项

来源
《Machine translation》 |2017年第2期|3-18|共16页
作者
Hill Felix; Cho Kyunghyun; Jean Sébastien; Bengio Yoshua;
展开▼
作者单位

Deepmind, 7 Pancras Square, London, United Kingdom;

Courant Institute of Mathematical Sciences, New York University, New York, NY, United States;

Courant Institute of Mathematical Sciences, New York University, New York, NY, United States;

MILA, Université de Montréal, Montreal, Canada;

展开▼
收录信息美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Machine translation; Representation; Word embeddings;

机译：机器翻译;表示;词嵌入;

相似文献

外文文献
中文文献
专利

1. Explicitly Modeling Word Translations in Neural Machine Translation [J] . Han Dong, Li Junhui, Li Yachao, ACM transactions on Asian language information processing . 2020,第1期

机译：在神经机器翻译中显式建模单词翻译
2. On the Linguistic Representational Power of Neural Machine Translation Models [J] . Yonatan Belinkov, Nadir Durrani, Fahim Dalvi, Computational linguistics . 2020,第1期

机译：论神经机翻译模型的语言代表力量
3. Semantic Structural Alignment of Neural Representational Spaces Enables Translation between English and Chinese Words [J] . Benjamin D. Zinszer, Andrew J. Anderson, Olivia Kang, Journal of Cognitive Neuroscience . 2016,第11期

机译：神经表示空间的语义结构对齐使英汉单词之间的翻译成为可能
4. Exploring hybrid character-words representational unit in Classical-to-Modern Chinese machine translation [C] . Hongyang Zhang, Muyun Yang, Tiejun Zhao International conference on asian language processing . 2015

机译：探索古典到现代中文机器翻译中的混合汉字表示单位
5. On Word Alignment Models for Statistical Machine Translation. [D] . Zhao, Shaojun. 2011

机译：关于统计机器翻译的单词对齐模型。
6. A Chaotic Neural Network Model for English Machine Translation Based on Big Data Analysis [O] . Qianyu Cao, Hanmei Hao 2021

机译：基于大数据分析的英式电机翻译混沌神经网络模型
7. On the Linguistic Representational Power of Neural Machine Translation Models [O] . Yonatan Belinkov, Nadir Durrani, Fahim Dalvi, 2020

机译：论神经机翻译模型的语言代表力量

The representational geometry of word meanings acquired by neural machine translation models

摘要

著录项

相似文献

相关主题

期刊订阅