首页> 外文会议>Annual meeting of the Association for Computational Linguistics;ACL 2012 >Modified Distortion Matrices for Phrase-Based Statistical Machine Translation
【24h】

Modified Distortion Matrices for Phrase-Based Statistical Machine Translation

机译:基于短语的统计机器翻译的改进失真矩阵

获取原文

摘要

This paper presents a novel method to suggest long word reorderings to a phrase-based SMT decoder. We address language pairs where long reordering concentrates on few patterns, and use fuzzy chunk-based rules to predict likely reorderings for these phenomena. Then we use reordered n-gram LMs to rank the resulting permutations and select the n-best for translation. Finally we encode these reorderings by modifying selected entries of the distortion cost matrix, on a per-sentence basis. In this way, we expand the search space by a much finer degree than if we simply raised the distortion limit. The proposed techniques are tested on Arabic-English and German-English using well-known SMT benchmarks.
机译:本文提出了一种新颖的方法,可向基于短语的SMT解码器建议长单词重排。我们针对长时间重新排序集中于少量模式的语言对,并使用基于模糊块的规则来预测这些现象的可能重新排序。然后,我们使用重新排序的n-gram LM对排序结果进行排序,并选择n-best进行翻译。最终,我们通过修改基于句子的失真成本矩阵的选定条目来对这些重新排序进行编码。这样,与单纯提高失真极限相比,我们可以将搜索空间扩展得更好。使用众所周知的SMT基准在阿拉伯语-英语和德语-英语上对提出的技术进行了测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号