首页> 外文会议>International conference on language resources and evaluation >Can Statistical Post-Editing with a Small Parallel Corpus Save a Weak MT Engine?
【24h】

Can Statistical Post-Editing with a Small Parallel Corpus Save a Weak MT Engine?

机译:可以使用小并行语料保图进行统计后编辑,保存弱MT发动机?

获取原文

摘要

Statistical post-editing has been shown in several studies to increase BLEU score for rule-based MT systems. However, previous studies have relied solely on BLEU and have not conducted further study to determine whether those gains indicated an increase in quality or in score alone. In this work we conduct a human evaluation of statistical post-edited output from a weak rule-based MT system, comparing the results with the output of the original rule-based system and a phrase-based statistical MT system trained on the same data. We show that for this weak rule-based system, despite significant BLEU score increases, human evaluators prefer the output of the original system. While this is not a generally conclusive condemnation of statistical post-editing, this result does cast doubt on the efficacy of statistical post-editing for weak MT systems and on the reliability of BLEU score for comparison between weak rule-based and hybrid systems built from them.
机译:统计后编辑已在几项研究中显示,以增加基于规则的MT系统的BLEU分数。然而,以前的研究完全依赖于Bleu,并没有进一步研究,以确定这些收益是否表明了质量或单独的分数增加。在这项工作中,我们对基于弱规则的MT系统进行了对统计后编辑输出的人类评估,将结果与基于原始规则的系统输出和基于短语的统计MT系统进行了比较,并在相同的数据上培训。我们表明,对于基于弱规则的系统,尽管具有重要的BLEU分数增加,人类评估人员更喜欢原始系统的输出。虽然这不是统计编辑统计审视的普遍认为,但这结果确实对弱MT系统的统计后编辑的疗效以及Bleu评分的可靠性来促进疑虑,以便在基于弱规则和混合系统之间进行比较他们。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号