首页> 外文会议> >Use of direct modeling in natural language generation for Chinese and English translation
【24h】

Use of direct modeling in natural language generation for Chinese and English translation

机译:在自然语言生成中使用直接建模进行中英文翻译

获取原文

摘要

This paper proposes a new direct-modeling-based approach to improve the maximum entropy based natural language generation (NLG) in the IBM MASTOR system, an interlingua-based speech translation system. Due to the intrinsic disparity between Chinese and English sentences, the previous method employed only linguistic constituents from output language sentences to train the NLG model. The new algorithm exploits a direct-modeling scheme to admit linguistic constituent information from both source and target languages into the training process seamlessly when incorporating a concept padding scheme. When concept sequences from the top level of semantic parse trees are considered, the concept error rate (CER) is significantly reduced to 14.3%, compared to 23.9% in the baseline NLG. Similarly, when concept sequences from all levels of semantic parse trees are tested, the direct-modeling scheme yields a CER of 10.8% compared to 17.8% in the baseline. A sensible improvement on the overall translation is made when the direct-modeling scheme improves the BLEU score from 0.252 to 0.294.
机译:本文提出了一种新的基于直接建模的方法,以改进基于语言的语音翻译系统IBM MASTOR系统中基于最大熵的自然语言生成(NLG)。由于中英文句子之间的内在差异,以前的方法仅使用输出语言句子中的语言成分来训练NLG模型。当结合概念填充方案时,新算法利用直接建模方案将来自源语言和目标语言的语言成分信息无缝地接受到训练过程中。当考虑来自语义分析树的顶级概念序列时,概念错误率(CER)显着降低至14.3%,而基线NLG中为23.9%。类似地,当测试来自语义分析树的所有级别的概念序列时,直接建模方案产生的CER为10.8%,而基线为17.8%。当直接建模方案将BLEU分数从0.252提高到0.294时,对整体翻译进行了有意义的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号