首页> 外文会议>Iberian conference on pattern recognition and image analysis >Combining Phrase-Based and Template-Based Alignment Models in Statistical Translation
【24h】

Combining Phrase-Based and Template-Based Alignment Models in Statistical Translation

机译:在统计翻译中结合基于词组和基于模板的对齐模型

获取原文

摘要

In statistical machine translation, single-word based models have an important deficiency; they do not take contextual information into account for the translation decision. A possible solution called Phrase-Based, consists in translating a sequence of words instead of a single word. We show how this approach obtains interesting results in some corpora. One shortcoming of the phrase-based alignment models is that they do not have the generalization capability in word reordering. A possible solution could be the template-based approach, which uses sequences of classes of words instead of sequences of words. We present a template-based alignment model that uses a Part Of Speech tagger for word classes. We also propose an improved model that combines both models. The basic idea is that if a sequence of words has been seen in training, the phrase-based model can be used; otherwise, the template-based model can be used. We present the results from different tasks.
机译:在统计机器翻译中,基于单词的模型具有重要的缺陷;他们没有考虑到翻译决定的背景信息。一种叫词组的可能解决方案,包括转换一系列单词而不是单个单词。我们展示了这种方法如何在某些基层中获得有趣的结果。基于短语的对齐模型的一个缺点是它们在重新排序中没有泛化能力。可能的解决方案可能是基于模板的方法,它使用单词类别的序列而不是单词序列。我们提出了一种基于模板的对齐模型,它使用了一部分语音标记用于Word类。我们还提出了一种改进的模型,这些模型结合了两个模型。基本思想是,如果在训练中看到一系列单词,则可以使用基于短语的模型;否则,可以使用基于模板的模型。我们提出了不同任务的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号