Adaptation of Language Models for SMT Using Neural Networks with Topic Information

YINGGONG ZHAO; SHUJIAN HUANG; XIN-YU DAI; JIAJUN CHEN

首页> 外文期刊>ACM transactions on Asian language information processing >Adaptation of Language Models for SMT Using Neural Networks with Topic Information

【24h】

Adaptation of Language Models for SMT Using Neural Networks with Topic Information

机译：使用带有主题信息的神经网络调整SMT语言模型

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Neural network language models (LMs) are shown to be effective in improving the performance of statistical machine translation (SMT) systems. However, state-of-the-art neural network LMs usually use words before the current position as context and neglect global topic information, which can help machine translation (MT) systems to select better translation candidates from a higher perspective. In this work, we propose improvement of the state-of-the-art feedforward neural language model with topic information. Two main issues need to be tackled when adding topics into neural network LMs for SMT: one is how to incorporate topics to the neural network; the other is how to get target-side topic distribution before translation. We incorporate topics by appending topic distribution to the input layer of a feedforward LM. We adopt a multinomial logistic-regression (MLR) model to predict the target-side topic distribution based on source side information. Moreover, we propose a feedforward neural network model to learn joint representations on the source side for topic prediction. LM experiments demonstrate that the perplexity on validation set can be greatly reduced by the topic-enhanced feedforward LM, and the prediction of target-side topics can be improved dramatically with the MLR model equipped with the joint source representations. A final MT experiment, conducted on a large-scale Chinese-English dataset, shows that our feedforward LM with predicted topics improves the translation performance against a strong baseline.

机译：神经网络语言模型（LM）被证明可有效地改善统计机器翻译（SMT）系统的性能。但是，最新的神经网络LM通常将当前位置之前的单词用作上下文，而忽略全局主题信息，这可以帮助机器翻译（MT）系统从更高的角度选择更好的翻译候选者。在这项工作中，我们建议使用主题信息来改进最新的前馈神经语言模型。将主题添加到SMT的神经网络LM中时，需要解决两个主要问题：一个是如何将主题合并到神经网络中。另一个是在翻译之前如何获得目标端主题分布。我们通过将主题分布附加到前馈LM的输入层来合并主题。我们采用多项式Logistic回归（MLR）模型来基于源方面的信息预测目标方面的主题分布。此外，我们提出了一个前馈神经网络模型，以在源端学习联合表示以进行主题预测。 LM实验表明，通过主题增强的前馈LM可以大大减少验证集上的困惑，并且通过配备联合源表示的MLR模型可以显着改善目标侧主题的预测。在大规模的汉英数据集上进行的最终MT实验表明，具有预期主题的前馈LM相对于强大的基准可以提高翻译性能。

著录项

来源
《ACM transactions on Asian language information processing》 |2016年第3期|19.1-19.15|共15页
作者
YINGGONG ZHAO; SHUJIAN HUANG; XIN-YU DAI; JIAJUN CHEN;
展开▼
作者单位

State Key Laboratory for Novel Software Technology, Nanjing University, 163 Xianlin Avenue, Nanjing 210023, China;

State Key Laboratory for Novel Software Technology, Nanjing University, 163 Xianlin Avenue, Nanjing 210023, China;

State Key Laboratory for Novel Software Technology, Nanjing University, 163 Xianlin Avenue, Nanjing 210023, China;

State Key Laboratory for Novel Software Technology, Nanjing University, 163 Xianlin Avenue, Nanjing 210023, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Statistical machine translation; feedforward neural network language model; topic model; multinomial logistic regression; joint representation;

机译：统计机器翻译;前馈神经网络语言模型;主题模型;多项逻辑回归联合代表;

相似文献

外文文献
中文文献
专利

1. A Hybrid Language Model Based on a Recurrent Neural Network and Probabilistic Topic Modeling [J] . M. S. Kudinov, A. A. Romanenko Pattern recognition and image analysis: advances in mathematical theory and applications in the USSR . 2016,第3期

机译：基于递归神经网络和概率主题建模的混合语言模型
2. Feature Based Domain Adaptation for Neural Network Language Models with Factorised Hidden Layers [J] . Michael HENTSCHEL, Marc DELCROIX, Atsunori OGAWA, IEICE transactions on information and systems . 2019,第3期

机译：具有分解隐藏层的神经网络语言模型的基于特征的域自适应
3. An empirical study of statistical language models: n-gram language models vs. neural network language models [J] . Freha Mezzoudj, Abdelkader Benyettou International Journal of Innovative Computing and Applications . 2018,第4期

机译：统计语言模型的实证研究：n-gram语言模型与神经网络语言模型
4. Topic Detection for Language Model Adaptation of Highly-Inflected Languages by Using a Fuzzy Comparison Function [C] . Mirjam Sepesy Maucec, Zdravko Kacic European conference on speech communication and technology . 2001

机译：使用模糊比较功能主题检测对高变形语言的语言模型适应
5. Systematic Analysis of Deep Neural Networks: Retrieving Sensitive Samples via SMT Solving [D] . Docena, Amel Nestor B. 2020

机译：深度神经网络的系统分析：通过SMT求解检测敏感样本
6. Tracking Child Language Development With Neural Network Language Models [O] . Kenji Sagae 2021

机译：用神经网络语言模型跟踪儿童语言开发
7. Learning Topic Representation for SMT with Neural Networks∗ [O] . Lei Cui, Dongdong Zhang, Shujie Liu, 2015

机译：使用神经网络学习smT的主题表示*
8. Technical Topic 3.2.2.d Bayesian and Non-Parametric Statistics: Integration of Neural Networks with Bayesian Networks for Data Fusion and Predictive Modeling. [R] . Bell, S. 2016

机译：技术主题3.2.2.d贝叶斯和非参数统计：神经网络与贝叶斯网络的集成，用于数据融合和预测建模。

Adaptation of Language Models for SMT Using Neural Networks with Topic Information

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅