Feature Based Domain Adaptation for Neural Network Language Models with Factorised Hidden Layers

Michael HENTSCHEL; Marc DELCROIX; Atsunori OGAWA; Tomoharu IWATA; Tomohiro NAKATANI

首页> 外文期刊>IEICE transactions on information and systems >Feature Based Domain Adaptation for Neural Network Language Models with Factorised Hidden Layers

【24h】

Feature Based Domain Adaptation for Neural Network Language Models with Factorised Hidden Layers

机译：具有分解隐藏层的神经网络语言模型的基于特征的域自适应

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Language models are a key technology in various tasks, such as, speech recognition and machine translation. They are usually used on texts covering various domains and as a result domain adaptation has been a long ongoing challenge in language model research. With the rising popularity of neural network based language models, many methods have been proposed in recent years. These methods can be separated into two categories: model based and feature based adaptation methods. Feature based domain adaptation has compared to model based domain adaptation the advantage that it does not require domain labels in the corpus. Most existing feature based adaptation methods are based on bias adaptation. We propose a novel feature based domain adaptation technique using hidden layer factorisation. This method is fundamentally different from existing methods because we use the domain features to calculate a linear combination of linear layers. These linear layers can capture domain specific information and information common to different domains. In the experiments, we compare our proposed method with existing adaptation methods. The compared adaptation techniques are based on two different ideas, that is, bias based adaptation and gating of hidden units. All language models in our comparison use state-of-the-art long short-term memory based recurrent neural networks. We demonstrate the effectiveness of the proposed method with perplexity results for the well-known Penn Treebank and speech recognition results for a corpus of TED talks.

机译：语言模型是语音识别和机器翻译等各种任务中的关键技术。它们通常用于涵盖各个领域的文本上，因此，领域适应一直是语言模型研究中长期存在的挑战。随着基于神经网络的语言模型的日益普及，近年来已经提出了许多方法。这些方法可以分为两类：基于模型的适应方法和基于特征的适应方法。与基于模型的领域自适应相比，基于特征的领域自适应具有以下优势：不需要语料库中的域标签。大多数现有的基于特征的自适应方法都是基于偏差自适应的。我们提出了一种使用隐藏层分解的基于特征的领域自适应技术。此方法与现有方法根本不同，因为我们使用域特征来计算线性层的线性组合。这些线性层可以捕获特定于域的信息以及不同域共有的信息。在实验中，我们将我们提出的方法与现有的适应方法进行了比较。比较的自适应技术基于两种不同的思想，即基于偏差的自适应和隐藏单元的门控。我们比较中的所有语言模型都使用基于最新的长期短期记忆的递归神经网络。我们用著名的Penn Treebank的困惑结果和TED演讲的语音识别结果证明了该方法的有效性。

著录项

来源
《IEICE transactions on information and systems》 |2019年第3期|共11页
作者
Michael HENTSCHEL; Marc DELCROIX; Atsunori OGAWA; Tomoharu IWATA; Tomohiro NAKATANI;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词
language modelLSTMdomain adaptationunsupervisedlatent Dirichlet allocation;

机译：语言模型LSTM域自适应非监督潜在Dirichlet分配;

相似文献

外文文献
中文文献
专利

1. Factorized Hidden Layer Adaptation for Deep Neural Network Based Acoustic Modeling [J] . Lahiru Samarakoon, Khe Chai Sim Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第12期

机译：基于深度神经网络的声学建模的分解隐藏层自适应
2. A new deep neural network based on a stack of single-hidden-layer feedforward neural networks with randomly fixed hidden neurons [J] . Hu Junying, Zhang Jiangshe, Zhang Chunxia, Neurocomputing . 2016,第JANa1期

机译：一种新的深度神经网络，该网络基于具有随机固定的隐藏神经元的单层前馈神经网络的堆栈
3. Threefold vs. Fivefold Cross Validation in One-Hidden-Layer and Two-Hidden-Layer Predictive Neural Network Modeling of Machining Surface Roughness Data [J] . Chang-Xue Jack Feng, Zhi-Guang (Samuel) Yu, Unnati Kingi, Journal of Manufacturing Systems . 2005,第2期

机译：加工表面粗糙度数据的一隐藏层和两隐藏层预测神经网络建模中的三重与五重交叉验证
4. Factorised Hidden Layer Based Domain Adaptation for Recurrent Neural Network Language Models [C] . Michael Hentschel, Marc Delcroix, Atsunori Ogawa, Asia-Pacific Signal and Information Processing Association Annual Summit and Conference . 2018

机译：递归神经网络语言模型的基于分解隐藏层的域自适应
5. Physiologically-based vision modeling applications and gradient descent-based parameter adaptation of pulse coupled neural networks. [D] . Broussard, Randy Paul. 1997

机译：基于生理的视觉建模应用和基于梯度下降的脉冲耦合神经网络参数自适应。
6. Rapid Airplane Detection in Remote Sensing Images Based on Multilayer Feature Fusion in Fully Convolutional Neural Networks [O] . Yuelei Xu, Mingming Zhu, Peng Xin, 2018

机译：全卷积神经网络中基于多层特征融合的遥感图像快速飞机检测
7. Layered Neural Network training with Model Switching and Hidden Layer Feature Regularization [O] . Keisuke Kameyama, Kei Taga 2007

机译：带有模型切换和隐藏层特征正则化的分层神经网络训练
8. Exploiting Hidden Layer Responses of Deep Neural Networks for Language Recognition. [R] . Li, R., Mallidi, S. H., Burget, L., 2016

机译：利用深层神经网络隐藏层响应进行语言识别。

Feature Based Domain Adaptation for Neural Network Language Models with Factorised Hidden Layers

摘要

著录项

相似文献

相关主题

期刊订阅