首页> 外文期刊>IEICE transactions on information and systems >Feature Based Domain Adaptation for Neural Network Language Models with Factorised Hidden Layers
【24h】

Feature Based Domain Adaptation for Neural Network Language Models with Factorised Hidden Layers

机译:具有分解隐藏层的神经网络语言模型的基于特征的域自适应

获取原文
       

摘要

Language models are a key technology in various tasks, such as, speech recognition and machine translation. They are usually used on texts covering various domains and as a result domain adaptation has been a long ongoing challenge in language model research. With the rising popularity of neural network based language models, many methods have been proposed in recent years. These methods can be separated into two categories: model based and feature based adaptation methods. Feature based domain adaptation has compared to model based domain adaptation the advantage that it does not require domain labels in the corpus. Most existing feature based adaptation methods are based on bias adaptation. We propose a novel feature based domain adaptation technique using hidden layer factorisation. This method is fundamentally different from existing methods because we use the domain features to calculate a linear combination of linear layers. These linear layers can capture domain specific information and information common to different domains. In the experiments, we compare our proposed method with existing adaptation methods. The compared adaptation techniques are based on two different ideas, that is, bias based adaptation and gating of hidden units. All language models in our comparison use state-of-the-art long short-term memory based recurrent neural networks. We demonstrate the effectiveness of the proposed method with perplexity results for the well-known Penn Treebank and speech recognition results for a corpus of TED talks.
机译:语言模型是语音识别和机器翻译等各种任务中的关键技术。它们通常用于涵盖各个领域的文本上,因此,领域适应一直是语言模型研究中长期存在的挑战。随着基于神经网络的语言模型的日益普及,近年来已经提出了许多方法。这些方法可以分为两类:基于模型的适应方法和基于特征的适应方法。与基于模型的领域自适应相比,基于特征的领域自适应具有以下优势:不需要语料库中的域标签。大多数现有的基于特征的自适应方法都是基于偏差自适应的。我们提出了一种使用隐藏层分解的基于特征的领域自适应技术。此方法与现有方法根本不同,因为我们使用域特征来计算线性层的线性组合。这些线性层可以捕获特定于域的信息以及不同域共有的信息。在实验中,我们将我们提出的方法与现有的适应方法进行了比较。比较的自适应技术基于两种不同的思想,即基于偏差的自适应和隐藏单元的门控。我们比较中的所有语言模型都使用基于最新的长期短期记忆的递归神经网络。我们用著名的Penn Treebank的困惑结果和TED演讲的语音识别结果证明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号