首页> 外文会议>Annual conference of the International Speech Communication Association >Group Sparse Hidden Markov Models for Speech Recognition
【24h】

Group Sparse Hidden Markov Models for Speech Recognition

机译:语音识别的群体稀疏隐马尔可夫模型

获取原文

摘要

This paper presents the group sparse hidden Markov models (GS-HMMs) where a sequence of acoustic features is driven by Markov chain and each feature vector is represented by two groups of basis vectors. The group of common bases represents the features across states within a HMM. The group of individual bases compensates the intra-state residual information. Importantly, the sparse prior for sensing weights is controlled by the Laplacian scale mixture (LSM) distribution which is obtained by multiplying Laplacian variable with an inverse Gamma variable. The scale mixture parameter in LSM makes the distribution even sparser. This parameter serves as an automatic relevance determination for selecting the relevant bases from two groups. The weights and two sets of bases in GS-HMMs are estimated via Bayesian learning. We apply this framework for acoustic modeling and show the robustness of GS-HMMs for speech recognition in presence of different noises types and SNRs.
机译:本文提出了一组稀疏隐马尔可夫模型(GS-HMM),其中一系列声学特征由马尔可夫链驱动,每个特征向量由两组基向量表示。通用基组代表HMM中跨状态的要素。单个碱基的组补偿状态内残差信息。重要的是,稀疏先验是通过拉普拉斯比例混合(LSM)分布来控制的,该分布是通过将拉普拉斯变量与反伽玛变量相乘而获得的。 LSM中的比例混合参数使分布更加稀疏。该参数用作自动相关性确定,用于从两组中选择相关的碱基。通过贝叶斯学习估计GS-HMM中的权重和两组碱基。我们将此框架应用于声学建模,并显示了在存在不同噪声类型和SNR的情况下GS-HMM用于语音识别的鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号