首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Machine Translation System Combination by Confusion Forest
【24h】

Machine Translation System Combination by Confusion Forest

机译:机器翻译系统混乱森林组合

获取原文

摘要

The state-of-the-art system combination method for machine translation (MT) is based on confusion networks constructed by aligning hypotheses with regard to word similarities. We introduce a novel system combination framework in which hypotheses are encoded as a confusion forest, a packed forest representing alternative trees. The forest is generated using syntactic consensus among parsed hypotheses: First, MT outputs are parsed. Second, a context free grammar is learned by extracting a set of rules that constitute the parse trees. Third, a packed forest is generated starting from the root symbol of the extracted grammar through non-terminal rewriting. The new hypothesis is produced by searching the best derivation in the forest. Experimental results on the WMT10 system combination shared task yield comparable performance to the conventional confusion network based method with smaller space.
机译:用于机器翻译(MT)的最先进的系统组合方法基于通过对齐关于字相似度的假设构成的混淆网络。我们介绍了一种新颖的系统组合框架,其中假设被编码为混乱的森林,这是一个代表替代树的包装森林。在解析假设中使用句法共识生成森林:首先,解析MT输出。其次,通过提取构成解析树的一组规则来学习一个上下文自由语法。第三,通过非终端重写从提取的语法的根符号开始生成包装的森林。通过在森林中寻找最佳推导来产生新的假设。 WMT10系统组合共享任务的实验结果将具有相当性能的较小空间的传统混乱网络方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号