首页> 外文期刊>IEICE transactions on information and systems >Bayesian Word Alignment and Phrase Table Training for Statistical Machine Translation
【24h】

Bayesian Word Alignment and Phrase Table Training for Statistical Machine Translation

机译:Bayesian Word Alignment and Phrase Table Training for Statistical Machine Translation

获取原文
获取原文并翻译 | 示例
       

摘要

In most phrase-based statistical machine translation (SMT) systems, the translation model relies on word alignment, which serves as a constraint for the subsequent building of a phrase table. Word alignment is usually inferred by GIZA++, which implements all the IBM models and HMM model in the framework of Expectation Maximum (EM). In this paper, we present a fully Bayesian inference for word alignment. Different from the EM approach, the Bayesian inference makes use of all possible parameter values rather than estimating a single parameter value, from which we expect a more robust inference. After inferring the word alignment, current SMT systems usually train the phrase table from Viterbi word alignment, which is prone to learn incorrect phrases due to the word alignment mistakes. To overcome this drawback, a new phrase extraction method is proposed based on multiple Gibbs samples from Bayesian inference for word alignment. Empirical results show, promising improvements over baselines in alignment quality as well as the translation performance.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号