首页> 外文会议>IEEE International Conference on Research, Innovation and Vision for the Future >A Dependency-based Word Reordering Approach for Statistical Machine Translation
【24h】

A Dependency-based Word Reordering Approach for Statistical Machine Translation

机译:基于依赖性的统计机器翻译词重新排序方法

获取原文
获取外文期刊封面目录资料

摘要

Reordering is of crucial importance for machine translation. Solving the reordering problem can lead to remarkable improvements in translation performance. In this paper, we propose a novel approach to solve the word reordering problem in Statistical Machine Translation. We rely on the dependency relations retrieved from a statistical parser incorporating with linguistic hand-crafted rules to create the transformations. These dependency-based transformations can produce the problem of word movement on both phrase and word reordering which is a difficult problem on parse tree based approaches. Such transformations are then applied as a preprocessor to English language both in training and decoding process to obtain an underlying word order closer to the Vietnamese language. About the hand-crafted rules, we extract from the syntactic differences of word order between English and Vietnamese language. This approach is simple and easy to implement with a small rule set, not lead to the rule explosion. We describe the experiments using our model on VCLEVC corpus [18] and consider the translation from English to Vietnamese, showing significant improvements about 2-4% BLEU score in comparison with the MOSES phrase-based baseline system [19].
机译:重新排序对机器翻译至关重要。解决重新排序问题可能导致翻译性能显着提高。在本文中,我们提出了一种新颖的方法来解决统计机器翻译中的重新排序问题。我们依靠从统计解析器中检索的依赖关系,其中包含语言手工制作规则来创建转换。基于依赖性的转换可以产生短语和Word重新排序的词语的问题,这是基于解析树的方法是一个难题。然后在培训和解码过程中将这种转换作为预处理器应用于英语语言,以获得更接近越南语的底层字。关于手工制作的规则,我们从英语和越南语之间的单词顺序的句法差异中提取。这种方法简单易于使用小规则集实现,不会导致规则爆炸。我们描述了使用我们在Vclevc语料库上的模型[18]的实验,并考虑与英语到越南语的翻译,与基于Moses短语的基线系统相比,大约2-4%的BLEU分数显示出大量改进[19]。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号