首页> 外文期刊>ACM transactions on Asian language information processing >Adaptation of Language Models for SMT Using Neural Networks with Topic Information
【24h】

Adaptation of Language Models for SMT Using Neural Networks with Topic Information

机译:使用带有主题信息的神经网络调整SMT语言模型

获取原文
获取原文并翻译 | 示例
       

摘要

Neural network language models (LMs) are shown to be effective in improving the performance of statistical machine translation (SMT) systems. However, state-of-the-art neural network LMs usually use words before the current position as context and neglect global topic information, which can help machine translation (MT) systems to select better translation candidates from a higher perspective. In this work, we propose improvement of the state-of-the-art feedforward neural language model with topic information. Two main issues need to be tackled when adding topics into neural network LMs for SMT: one is how to incorporate topics to the neural network; the other is how to get target-side topic distribution before translation. We incorporate topics by appending topic distribution to the input layer of a feedforward LM. We adopt a multinomial logistic-regression (MLR) model to predict the target-side topic distribution based on source side information. Moreover, we propose a feedforward neural network model to learn joint representations on the source side for topic prediction. LM experiments demonstrate that the perplexity on validation set can be greatly reduced by the topic-enhanced feedforward LM, and the prediction of target-side topics can be improved dramatically with the MLR model equipped with the joint source representations. A final MT experiment, conducted on a large-scale Chinese-English dataset, shows that our feedforward LM with predicted topics improves the translation performance against a strong baseline.
机译:神经网络语言模型(LM)被证明可有效地改善统计机器翻译(SMT)系统的性能。但是,最新的神经网络LM通常将当前位置之前的单词用作上下文,而忽略全局主题信息,这可以帮助机器翻译(MT)系统从更高的角度选择更好的翻译候选者。在这项工作中,我们建议使用主题信息来改进最新的前馈神经语言模型。将主题添加到SMT的神经网络LM中时,需要解决两个主要问题:一个是如何将主题合并到神经网络中。另一个是在翻译之前如何获得目标端主题分布。我们通过将主题分布附加到前馈LM的输入层来合并主题。我们采用多项式Lo​​gistic回归(MLR)模型来基于源方面的信息预测目标方面的主题分布。此外,我们提出了一个前馈神经网络模型,以在源端学习联合表示以进行主题预测。 LM实验表明,通过主题增强的前馈LM可以大大减少验证集上的困惑,并且通过配备联合源表示的MLR模型可以显着改善目标侧主题的预测。在大规模的汉英数据集上进行的最终MT实验表明,具有预期主题的前馈LM相对于强大的基准可以提高翻译性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号