首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Exploiting different word clusterings for class-based RNN language modeling in speech recognition

【24h】

Exploiting different word clusterings for class-based RNN language modeling in speech recognition

机译：在语音识别中为基于类的RNN语言建模开发不同的单词聚类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose to exploit the potential of multiple word clusterings in class-based recurrent neural network (RNN) language models for ensemble RNN language modeling. By varying the clustering criteria and the space of word embedding, different word clusterings are obtained to define different word/class factorizations. For each such word/class factorization, several base RNNLMs are learned, and the word prediction probabilities of the base RNNLMs are then combined to form an ensemble prediction. We use a greedy backward model selection procedure to select a subset of models and combine these models for word prediction. The proposed ensemble language modeling method has been evaluated on Penn Treebank test set as well as Wall Street Journal (WSJ) Eval 92 and 93 test sets, where it improved test set perplexity and word error rate over the state-of-the-art single RNNLMs as well as multiple RNNLMs produced by varying RNN learning conditions.

机译：我们建议在基于类的递归神经网络（RNN）语言模型中开发多个单词聚类的潜力，以进行整体RNN语言建模。通过改变聚类标准和词嵌入的空间，可以获得不同的词聚类以定义不同的词/类分解。对于每个这样的词/类分解，学习了几个基本的RNNLM，然后将基本的RNNLM的词预测概率组合起来以形成整体预测。我们使用贪婪的向后模型选择过程来选择模型的子集，并将这些模型组合在一起进行单词预测。在Penn Treebank测试集以及《华尔街日报》（WSJ）Eval 92和93测试集上对所提出的集成语言建模方法进行了评估，在此方法上，它比最新的测试集提高了测试集的困惑度和字错误率RNNLM以及通过不同的RNN学习条件产生的多个RNNLM。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing 》|2017年|5735-5739|共5页
会议地点
作者
Minguang Song; Yunxin Zhao; Shaojun Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Hidden Markov models; Recurrent neural networks; Adaptation models; Predictive models; Clustering methods; Computational modeling; Speech recognition;

机译：隐马尔可夫模型;递归神经网络;适应模型;预测模型;聚类方法;计算模型;语音识别;

相似文献

外文文献
中文文献
专利

1. RNN language model with word clustering and class-based output layer [J] . Yongzhe Shi, Wei-Qiang Zhang, Jia Liu, EURASIP Journal on Audio, Speech, and Music Processing . 2013 ,第1期

机译：具有词聚类和基于类的输出层的RNN语言模型
2. RNN language model with word clustering and class-based output layer [J] . Yongzhe Shi, Wei-Qiang Zhang, Jia Liu, EURASIP journal on audio, speech, and music processing . 2013 ,第1期

机译：具有词聚类和基于类的输出层的RNN语言模型
3. Application of Morphosyntactic and Class-Based Language Models in Automatic Speech Recognition of Polish [J] . Smywinski-Pohl Alexsander, Ziolko Bartosz International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2016 ,第2期

机译：形态语法和基于类的语言模型在波兰语自动语音识别中的应用
4. Exploiting different word clusterings for class-based RNN language modeling in speech recognition [C] . Minguang Song, Yunxin Zhao, Shaojun Wang IEEE International Conference on Acoustics, Speech and Signal Processing . 2017

机译：在语音识别中利用基于类的RNN语言建模的不同词群
5. Learning sub-word units and exploiting contextual information for open vocabulary speech recognition. [D] . Parada, Maria Carolina. 2011

机译：学习子词单位并利用上下文信息进行开放式词汇语音识别。
6. Sublexical Properties of Spoken Words Modulate Activity in Broca’s Area but Not Superior Temporal Cortex: Implications for Models of Speech Recognition [O] . Kenneth I. Vaden Jr., Tepring Piquado, Gregory Hickok -1

机译：在布罗卡区而不是高级颞叶皮质口语词调节活动的形旁亚词汇性质：对语音识别的模式
7. RNN language model with word clustering and class-based output layer [O] . Yongzhe Shi, Wei-Qiang Zhang, Jia Liu, 2013

机译：具有词聚类和基于类的输出层的RNN语言模型
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Exploiting different word clusterings for class-based RNN language modeling in speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅