Group Sparse Hidden Markov Models for Speech Recognition

机译：语音识别的群体稀疏隐马尔可夫模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents the group sparse hidden Markov models (GS-HMMs) where a sequence of acoustic features is driven by Markov chain and each feature vector is represented by two groups of basis vectors. The group of common bases represents the features across states within a HMM. The group of individual bases compensates the intra-state residual information. Importantly, the sparse prior for sensing weights is controlled by the Laplacian scale mixture (LSM) distribution which is obtained by multiplying Laplacian variable with an inverse Gamma variable. The scale mixture parameter in LSM makes the distribution even sparser. This parameter serves as an automatic relevance determination for selecting the relevant bases from two groups. The weights and two sets of bases in GS-HMMs are estimated via Bayesian learning. We apply this framework for acoustic modeling and show the robustness of GS-HMMs for speech recognition in presence of different noises types and SNRs.

机译：本文提出了一组稀疏隐马尔可夫模型（GS-HMM），其中一系列声学特征由马尔可夫链驱动，每个特征向量由两组基向量表示。通用基组代表HMM中跨状态的要素。单个碱基的组补偿状态内残差信息。重要的是，稀疏先验是通过拉普拉斯比例混合（LSM）分布来控制的，该分布是通过将拉普拉斯变量与反伽玛变量相乘而获得的。 LSM中的比例混合参数使分布更加稀疏。该参数用作自动相关性确定，用于从两组中选择相关的碱基。通过贝叶斯学习估计GS-HMM中的权重和两组碱基。我们将此框架应用于声学建模，并显示了在存在不同噪声类型和SNR的情况下GS-HMM用于语音识别的鲁棒性。

著录项

来源
《Annual conference of the International Speech Communication Association》|2012年|2645-2648|共4页
会议地点
作者
Jen-Tzung Chien; Cheng-Chun Chiang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Bayesian learning; group sparsity; hidden Markov model; speech recognition;

机译：贝叶斯学习;群体稀疏性隐马尔可夫模型;语音识别;

相似文献

外文文献
中文文献
专利

1. Hybrid approach to speech recognition using hidden Markov models and Markov chains [J] . Dai J. IEE Proceedings. Part K . 1994,第5期

机译：使用隐马尔可夫模型和马尔可夫链的混合语音识别方法
2. Speech Silicon: An FPGA Architecture for Real-Time Hidden Markov-Model-Based Speech Recognition [J] . Jeffrey Schuster, Kshitij Gupta, Raymond Hoare, EURASIP journal on embedded systems . 2006,第1期

机译：语音芯片：基于实时隐马尔可夫模型的语音识别的FPGA架构
3. Modelling asynchrony in automatic speech recognition using loosely coupled hidden Markov models [J] . H.J. Nock, S.J. Young Cognitive science . 2002,第3期

机译：使用松耦合隐马尔可夫模型在自动语音识别中建模异步
4. Exploiting sparsity in stranded hidden Markov models for automatic speech recognition [C] . Zhao Yong, Juang Biing-Hwang Asilomar Conference on Signals, Systems and Computers . 2012

机译：利用滞留隐马尔可夫模型中的稀疏性进行自动语音识别
5. Online Learning of Large Margin Hidden Markov Models for Automatic Speech Recognition. [D] . Cheng, Chih-Chieh. 2011

机译：在线学习大余量隐马尔可夫模型以进行自动语音识别。
6. Assessment of Dysarthria Using One-Word Speech Recognition with Hidden Markov Models [O] . Seung Hak Lee, Minje Kim, Han Gil Seo, 2019

机译：使用隐马尔可夫模型的单字语音识别评估构音障碍
7. Speech Silicon AM: An FPGA-Based Acoustic Modeling Pipeline for Hidden Markov Model based Speech Recognition [O] . Jeffrey W. Schuster, Kshitij Gupta, Raymond Hoare 2014

机译：语音芯片AM：基于FPGA的声学建模管道，用于基于隐马尔可夫模型的语音识别
8. Improving on hidden Markov models: An articulatorily constrained, maximum likelihood approach to speech recognition and speech coding [R] . Hogden, J. 1996

机译：改进隐马尔可夫模型：语音识别和语音编码的语义约束，最大似然方法

Group Sparse Hidden Markov Models for Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅