首页> 外文会议>International conference on computational linguistics >Decoder-based Discriminative Training of Phrase Segmentation for Statistical Machine Translation
【24h】

Decoder-based Discriminative Training of Phrase Segmentation for Statistical Machine Translation

机译:统计机器翻译的基于解码器的短语细分判别训练

获取原文

摘要

In this paper, we propose a new method of training phrase segmentation model for phrase-based statistical machine translation(SMT). We define a good segmentation as the segmentation producing a good translation. According to this definition, we propose a method that can discriminate between a good segmentation and a bad segmentation based on the translation quality. The proposed approach constructs the phrase labeled data by using the SMT decoder, so that the phrase segmentations supporting good translations can be acquired. Furthermore, our iterative training algorithm of the segmentation model can gradually improve the performance of the SMT decoder. Experimental results show that the proposed method is effective in improving the translation quality of the phrase-based SMT system.
机译:本文提出了一种基于短语的统计机器翻译(SMT)训练短语分割模型的新方法。我们将良好的细分定义为产生良好翻译的细分。根据此定义,我们提出了一种基于翻译质量可以区分好的分割和不好的分割的方法。所提出的方法通过使用SMT解码器构造短语标记的数据,从而可以获取支持良好翻译的短语分割。此外,我们的分割模型的迭代训练算法可以逐渐提高SMT解码器的性能。实验结果表明,该方法对提高基于短语的SMT系统的翻译质量是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号