【24h】

Character-based Neural Machine Translation

机译:基于字符的神经机器翻译

获取原文

摘要

Neural Machine Translation (MT) has reached state-of-the-art results. However, one of the main challenges that neural MT still faces is dealing with very large vocabularies and morphologically rich languages. In this paper, we propose a neural MT system using character-based embeddings in combination with convolutional and highway layers to replace the standard lookup-based word representations. The resulting unlimited-vocabulary and affix-aware source word embeddings are tested in a state-of-the-art neural MT based on an attention-based bidirectional recurrent neural network. The proposed MT scheme provides improved results even when the source language is not morphologically rich. Improvements up to 3 BLEU points are obtained in the German-English WMT task.
机译:神经机器翻译(MT)已达到最新水平。然而,神经MT仍然面临的主要挑战之一是处理非常大的词汇和形态丰富的语言。在本文中,我们提出了一种基于神经网络的MT系统,该系统将基于字符的嵌入与卷积和高速公路层相结合,以取代基于标准查找的单词表示形式。基于基于注意力的双向递归神经网络,在最新的神经MT中测试生成的无限词汇量和词缀感知源词嵌入。即使源语言在形态上不丰富,所提出的MT方案也提供了改进的结果。德英WMT任务最多可提高3个BLEU点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号