Exploiting the Succeeding Words in Recurrent Neural Network Language Models

机译：在经常性神经网络语言模型中利用后续文字

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In automatic speech recognition, conventional language models recognize the current word using only information from preceding words. Recently, Recurrent Neural Network Language Models (rnnlms) have drawn increased research attention because of their ability to outperform conventional n-gram language models. The superiority of rnnlms is based in their ability to capture long-distance word dependencies. RNNLMs are, in practice, applied in an N-best rescoring framework, which offers new possibilities for information integration. In particular, it becomes interesting to extend the ability of rnnlms to capture long distance information by also allowing them to exploit information from succeeding words during the rescoring process. This paper proposes three approaches for exploiting succeeding word information in RNNLMs. The first is a forward-backward model that combines RNNLMs exploiting preceding and succeeding words. The second is an extension of a Maximum Entropy RNNLM (RNNME) that incorporates succeeding word information. The third is an approach that combines language models using two-pass alternating rescoring. Experimental results demonstrate the ability of succeeding word information to improve RNNLM performance, both in terms of perplexity and Word Error Rate (WER). The best performance is achieved by a combined model that exploits the three words succeeding the current word.

机译：在自动语音识别中，传统的语言模型仅使用来自前面单词的信息识别当前单词。最近，经常性的神经网络语言模型（RNNLMS）由于它们能够优于常规的N-GRAM语言模型而增强了研究人员。 RNNLMS的优越性基于捕获长距离字依赖性的能力。在实践中，RNNLMS在一个最佳救援框架中应用，它为信息集成提供了新的可能性。特别地，延长RNNLMS捕获长距离信息的能力也是有趣的，允许它们在救援过程期间从后续的单词进行利用信息来捕获长距离信息。本文提出了三种用于在RNNLMS中开发成功的Word信息的方法。首先是一个前后模型，它结合了rnnlms利用之前和成功的单词。第二是包含成功的Word信息的最大熵RNNLM（RNNME）的扩展。第三是一种使用双通过交替重新扫描来结合语言模型的方法。实验结果表明，在困惑和字错误率（WER）方面，在困惑和单词误差率方面取得了改善RNNLM性能的能力。通过综合模型实现了最佳性能，该模型利用了在当前单词成功的三个单词中实现。

著录项

来源
《Conference of the International Speech Communication Association》|2013年||共5页
会议地点
作者
Yangyang Shi; Martha Larson; Pascal Wiggers; Catholijn M. Jonker;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912.3-532;
关键词
Exploiting; Succeeding; Words;

机译：剥削;成功;单词;

相似文献

外文文献
中文文献
专利

1. Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition [J] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, IEICE transactions on information and systems . 2019,第12期

机译：潜在词递归神经网络语言模型用于自动语音识别
2. A Two-Level Recurrent Neural Network Language Model Based on the Continuous Bag-of-Words Model for Sentence Classification [J] . Lee Yo Han, Kim Dong W., Lim Myo Taeg International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2019,第1期

机译：一种基于句子分类连续袋式模型的两级反复性神经网络语言模型
3. Enhancing recurrent neural network-based language models by word tokenization [J] . Hatem M. Noaman, Shahenda S. Sarhan, Mohsen. A. A. Rashwan Human-centric Computing and Information Sciences . 2018,第1期

机译：通过单词标记化增强基于递归神经网络的语言模型
4. Exploiting the Succeeding Words in Recurrent Neural Network Language Models [C] . Yangyang Shi, Martha Larson, Pascal Wiggers, Conference of the International Speech Communication Association . 2013

机译：在经常性神经网络语言模型中利用后续文字
5. Deep Neural Language Model for Text Classification Based on Convolutional and Recurrent Neural Networks [D] . Hassan, Abdalraouf. 2018

机译：基于卷积神经网络和递归神经网络的深度神经语言文本分类模型
6. Identifying antimicrobial peptides using word embedding with deep recurrent neural networks [O] . Md-Nafiz Hamid, Iddo Friedberg -1

机译：通过深度递归神经网络的词嵌入识别抗菌肽
7. Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition [O] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, 2019

机译：潜在的自动语音识别复发性神经网络语言模型

Exploiting the Succeeding Words in Recurrent Neural Network Language Models

摘要

著录项

相似文献

相关主题

期刊订阅