首页> 外文期刊>IEICE transactions on information and systems >Syntax-Based Context Representation for Statistical Machine Translation
【24h】

Syntax-Based Context Representation for Statistical Machine Translation

机译:统计机器翻译的基于语法的上下文表示

获取原文
           

摘要

Learning semantic representation for translation context is beneficial to statistical machine translation (SMT). Previous efforts have focused on implicitly encoding syntactic and semantic knowledge in translation context by neural networks, which are weak in capturing explicit structural syntax information. In this paper, we propose a new neural network with a tree-based convolutional architecture to explicitly learn structural syntax information in translation context, thus improving translation prediction. Specifically, we first convert parallel sentences with source parse trees into syntax-based linear sequences based on a minimum syntax subtree algorithm, and then define a tree-based convolutional network over the linear sequences to learn syntax-based context representation and translation prediction jointly. To verify the effectiveness, the proposed model is integrated into phrase-based SMT. Experiments on large-scale Chinese-to-English and German-to-English translation tasks show that the proposed approach can achieve a substantial and significant improvement over several baseline systems.
机译:学习翻译上下文的语义表示有利于统计机器翻译(SMT)。先前的工作集中在通过神经网络在翻译上下文中隐式编码句法和语义知识,这在捕获显式结构语法信息方面较弱。在本文中,我们提出了一种新的基于树的卷积架构的神经网络,以在翻译上下文中显式学习结构语法信息,从而改善翻译预测。具体来说,我们首先将基于源语法分析树的并行句子转换为基于最小语法子树算法的基于语法的线性序列,然后在线性序列上定义基于树的卷积网络,以共同学习基于语法的上下文表示和翻译预测。为了验证有效性,将所提出的模型集成到基于短语的SMT中。在大规模的中文到英语和德语到英语的翻译任务上的实验表明,所提出的方法可以在几个基准系统上实现实质性的显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号