Modified Distortion Matrices for Phrase-Based Statistical Machine Translation

机译：基于短语的统计机器翻译的改进失真矩阵

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a novel method to suggest long word reorderings to a phrase-based SMT decoder. We address language pairs where long reordering concentrates on few patterns, and use fuzzy chunk-based rules to predict likely reorderings for these phenomena. Then we use reordered n-gram LMs to rank the resulting permutations and select the n-best for translation. Finally we encode these reorderings by modifying selected entries of the distortion cost matrix, on a per-sentence basis. In this way, we expand the search space by a much finer degree than if we simply raised the distortion limit. The proposed techniques are tested on Arabic-English and German-English using well-known SMT benchmarks.

机译：本文提出了一种新颖的方法，可向基于短语的SMT解码器建议长单词重排。我们针对长时间重新排序集中于少量模式的语言对，并使用基于模糊块的规则来预测这些现象的可能重新排序。然后，我们使用重新排序的n-gram LM对排序结果进行排序，并选择n-best进行翻译。最终，我们通过修改基于句子的失真成本矩阵的选定条目来对这些重新排序进行编码。这样，与单纯提高失真极限相比，我们可以将搜索空间扩展得更好。使用众所周知的SMT基准在阿拉伯语-英语和德语-英语上对提出的技术进行了测试。

著录项

来源
《Annual meeting of the Association for Computational Linguistics;ACL 2012》|2012年|p.478-487|共10页
会议地点
作者
Arianna Bisazza; Marcello Federico;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Integrating Rules and Dictionaries from Shallow-Transfer Machine Translation into Phrase-Based Statistical Machine Translation [J] . P#233, rez-Ortiz Juan Antonio, S#225, The Journal of Artificial Intelligence Research . 2016,第12期

机译：将规则和词典从浅传输机器翻译集成到基于短语的统计机器翻译
2. Integrating Rules and Dictionaries from Shallow-Transfer Machine Translation into Phrase-Based Statistical Machine Translation [J] . Sanchez-Cartagena Victor M., Antonio Perez-Ortiz Juan, Sanchez-Martinez Felipe The Journal of Artificial Intelligence Research . 2016,第Null期

机译：将规则和词典从浅传输机器翻译集成到基于短语的统计机器翻译
3. A unified framework and models for integrating translation memory into phrase-based statistical machine translation [J] . Yang Liu, Kun Wang, Chengqing Zong, Computer speech and language . 2019,第MARa期

机译：用于将翻译记忆库集成到基于短语的统计机器翻译中的统一框架和模型
4. Modified Distortion Matrices for Phrase-Based Statistical Machine Translation [C] . Arianna Bisazza, Marcello Federico Annual meeting of the Association for Computational Linguistics . 2012

机译：基于短语的统计机器翻译修改失真矩阵
5. Genuine Phrase-Based Statistical Machine Translation with Supervision [D] . Camacho, José Aires Gomes. 2016

机译：基于短语的统计机器翻译与监督
6. 3145 An Evaluation of Machine Learning and Traditional Statistical Methods for Discovery in Large-Scale Translational Data [O] . Megan C Hollister, Jeffrey D. Blume 2019

机译：3145对机器学习和传统统计方法的评估以发现大规模翻译数据
7. Integrating Rules and Dictionaries from Shallow-Transfer Machine Translation into Phrase-Based Statistical Machine Translation [O] . Sánchez-Cartagena, Víctor M., Pérez-Ortiz, Juan Antonio, Sánchez-Martínez, Felipe 2016

机译：将规则和字典从浅传输机器翻译集成到基于短语的统计机器翻译

Modified Distortion Matrices for Phrase-Based Statistical Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅