首页> 外文会议>International conference on language resources and evaluation >Can Statistical Post-Editing with a Small Parallel Corpus Save a Weak MT Engine?
【24h】

Can Statistical Post-Editing with a Small Parallel Corpus Save a Weak MT Engine?

机译:小型并行语料库的统计后编辑能否节省弱的MT引擎?

获取原文

摘要

Statistical post-editing has been shown in several studies to increase BLEU score for rule-based MT systems. However, previous studies have relied solely on BLEU and have not conducted further study to determine whether those gains indicated an increase in quality or in score alone. In this work we conduct a human evaluation of statistical post-edited output from a weak rule-based MT system, comparing the results with the output of the original rule-based system and a phrase-based statistical MT system trained on the same data. We show that for this weak rule-based system, despite significant BLEU score increases, human evaluators prefer the output of the original system. While this is not a generally conclusive condemnation of statistical post-editing, this result does cast doubt on the efficacy of statistical post-editing for weak MT systems and on the reliability of BLEU score for comparison between weak rule-based and hybrid systems built from them.
机译:多项研究表明,统计后编辑可以提高基于规则的MT系统的BLEU分数。但是,以前的研究仅依靠BLEU,没有进行进一步的研究来确定这些增加是否表明质量或得分的提高。在这项工作中,我们对基于弱规则的MT系统的统计后编辑输出进行了人工评估,将结果与原始基于规则的系统和基于相同数据训练的基于短语的统计MT系统的输出进行了比较。我们表明,对于这个基于规则的薄弱系统,尽管BLEU得分显着提高,但人类评估者还是更喜欢原始系统的输出。尽管这并不是对统计后编辑的普遍结论性谴责,但这一结果确实使人怀疑统计后编辑对弱MT系统的功效以及BLEU分数在基于规则的弱系统和混合系统之间进行比较的可靠性。他们。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号