首页> 中文期刊> 《计算机与现代化》 >融合时态特征的日英层次短语翻译模型

融合时态特征的日英层次短语翻译模型

         

摘要

In view of the problem that limited contextual information is used in the hierarchical phrase-based ( HPB) translation model and the quality of tense translation is not high, this paper proposes a method to integrate tense features into Japanese-Eng-lish HPB translation. Our method adopts the information of tense as constraints for tense classification model construction, and in-tegrates tense features into HPB translation model, the decoder can get the best-matching rules according to the results of potential tense classification of rules. Firstly, we extract training data from bilingual training corpus to train tense classification models by using maximum entropy. Secondly, we extract tense features from hierarchy phrase rules to classify each kind of rules which in-clude tense information, then we take the tense classification results as a kind of new translation features, and integrate the fea-tures into hierarchy phrase-based translation model. The experimental results show that our method can achieve good performance in Japanese-English HPB translation.%针对基于层次短语翻译模型的统计机器翻译使用上下文信息有限,时态翻译质量不高的问题,提出一种融合时态特征的日英统计机器翻译方法.该方法通过引入翻译规则的时态分类约束信息,解码器可以根据每条规则的潜在时态分类,为相应时态的句子匹配到最合适的规则进行翻译.首先从双语训练语料中抽取时态特征构建最大熵分类模型,然后再抽取包含各类时态信息的层次短语规则的时态特征,最后将规则的时态分类结果作为一类新特征,融入基于层次短语的翻译系统中.实验结果表明,与基线系统相比,该方法在多个测试集上提高了翻译质量,在一定程度上解决了日英层次短语模型的时态翻译问题.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号