首页> 外文会议>IEEE International Conference on Information Science and Technology >Syntax encapsulated phrase model for statistical machine translation
【24h】

Syntax encapsulated phrase model for statistical machine translation

机译:语法封装短语统计机器翻译模型

获取原文

摘要

In the past few years, much attention has been paid on extending phrase-based statistical machine translation with syntactic structures. In this paper we introduce a novel syntax encapsulated phrase(SEP) model, in which treebank tag sequences are employed to decorate the bilingual phrase pairs. We use tag sequences, instead of phrase pairs, to train the lexicalized reordering model. Since the number of treebank tags is much smaller than the number of words, the tag sequence based reordering model is smaller and more accurate than the phrase based reordering model. Experiments were carried out on four types of models: the phrase model, the hierarchical phrase model, the POS tag encapsulated phrase(PTEP) model and the syntactic tag encapsulated phrase(STEP) model. The STEP model obtained higher BLEU-4 score than other models on NIST 2005 MT task.
机译:在过去的几年里,在扩展基于短语的统计机器翻译与句法结构的情况下,已经支付了很多关注。 在本文中,我们介绍了一种新颖的语法封装短语(SEP)模型,其中使用树木班币序列来装饰双语短语对。 我们使用标签序列,而不是短语对,以培训lexicalized重新排序模型。 由于树木间标签的数量远小于单词的数量,因此基于标签序列的重新排序模型比基于短语的重新排序模型更小,更准确。 在四种类型的模型中进行了实验:短语模型,分层短语模型,POS标签封装短语(PTEP)模型和句法标签封装短语(步骤)模型。 步骤模型获得了比NIST 2005 MT任务的其他模型更高的BLEU-4分数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号