A new bigram-PLSA language model for speech recognition

Bahrani M.; Sameti H.

首页> 外文期刊>EURASIP journal on advances in signal processing >A new bigram-PLSA language model for speech recognition

【24h】

A new bigram-PLSA language model for speech recognition

机译：用于语音识别的新型bigram-PLSA语言模型

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A novel method for combining bigram model and Probabilistic Latent Semantic Analysis (PLSA) is introduced for language modeling. The motivation behind this idea is the relaxation of the bag of words assumption fundamentally present in latent topic models including the PLSA model. An EM-based parameter estimation technique for the proposed model is presented in this paper. Previous attempts to incorporate word order in the PLSA model are surveyed and compared with our new proposed model both in theory and by experimental evaluation. Perplexity measure is employed to compare the effectiveness of recently introduced models with the new proposed model. Furthermore, experiments are designed and carried out on continuous speech recognition (CSR) tasks using word error rate (WER) as the evaluation criterion. The superiority of the new bigram-PLSA model over Nie et al.'s bigram-PLSA and simple PLSA models is demonstrated in the results of our experiments. Experiments on BLLIP WSJ corpus show about 12 reduction in perplexity and 2.8 WER improvement compared to Nie et al.'s bigram-PLSA model.

机译：介绍了一种将bigram模型与概率潜在语义分析（PLSA）相结合的新方法进行语言建模。这个想法背后的动机是放宽从根本上存在于潜在主题模型（包括PLSA模型）中的单词假设。本文提出了一种基于EM的参数估计技术。调查了先前尝试将词序纳入PLSA模型的尝试，并在理论上和通过实验评估将其与我们新提出的模型进行了比较。困惑度度量用于比较最近引入的模型与新提出的模型的有效性。此外，设计并以单词错误率（WER）为评估标准，对连续语音识别（CSR）任务进行了实验。实验结果表明，新的bigram-PLSA模型优于Nie等人的bigram-PLSA模型和简单的PLSA模型。与Nie等人的bigram-PLSA模型相比，在BLLIP WSJ语料库上进行的实验表明，困惑度降低了约12，WER改善了2.8。

著录项

来源
《EURASIP journal on advances in signal processing 》 |2010年第12期| 共8页
作者
Bahrani M.; Sameti H.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类通信 ;
关键词

相似文献

外文文献
中文文献
专利

1. A new bigram-PLSA language model for speech recognition [J] . Bahrani M., Sameti H. EURASIP journal on advances in signal processing . 2010 ,第12期

机译：用于语音识别的新型bigram-PLSA语言模型
2. Syllable language models for Mandarin speech recognition: Exploiting character language models [J] . Liu X., Hieronymus J.L., Gales M.J.F., The Journal of the Acoustical Society of America . 2013 ,第1期

机译：普通话语音识别的音节语言模型：利用字符语言模型
3. Comparison of Performance of Enhanced Morpheme-based Language Model with Different Word-based Language Models for Improving the Performance of Tamil Speech Recognition System [J] . S. SARASWATHI, T.V. GEETHA ACM transactions on Asian language information processing . 2007 ,第3期

机译：增强的基于词素的语言模型与不同的基于单词的语言模型的性能比较，以提高泰米尔语语音识别系统的性能
4. Converting Neural Network Language Models into back-off language models for efficient decoding in automatic speech recognition [C] . Arisoy Ebru, Chen Stanley F., Ramabhadran Bhuvana, IEEE International Conference on Acoustics, Speech and Signal Processing . 2013

机译：将神经网络语言模型转换为退避语言模型，以在自动语音识别中进行有效解码
5. Arabic language modeling with stem-derived morphemes for automatic speech recognition. [D] . Heintz, Ilana. 2010

机译：具有词干衍生语素的阿拉伯语言建模，可实现自动语音识别。
6. Using Morphological Data in Language Modeling for Serbian Large Vocabulary Speech Recognition [O] . Edvin Pakoci, Branislav Popović, Darko Pekar 2019

机译：在塞尔维亚大型词汇语音识别的语言建模中使用形态学数据
7. A New Bigram-PLSA Language Model for Speech Recognition [O] . Mohammad Bahrani, Hossein Sameti 2010

机译：一种新的Bigram-PLSA语言模型用于语音识别

A new bigram-PLSA language model for speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅