首页> 外文会议>2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays >Multi-style training of HMMS with stereo data for reverberation-robust speech recognition
【24h】

Multi-style training of HMMS with stereo data for reverberation-robust speech recognition

机译:带有立体声数据的HMMS的多样式训练,用于混响鲁棒语音识别

获取原文

摘要

A novel training algorithm using data pairs of clean and reverberant feature vectors for estimating robust Hidden Markov Models (HMMs), introduced in [1] for matched training, is employed in this paper for multi-style training. The multi-style HMMs are derived from well-trained clean-speech HMMs by aligning the clean data to the clean-speech HMM and using the resulting state-frame alignment to estimate the Gaussian mixture densities from the reverberant data of several different rooms. Thus, the temporal alignment is fixed for all reverberation conditions contained in the multi-style training set so that the model mismatch between the different rooms is reduced. Therefore, this training approach is particularly suitable for multi-style training. Multi-style HMMs trained by the proposed approach and adapted to the current room condition using maximum likelihood linear regression significantly outperform the corresponding adapted multi-style HMMs trained by the conventional Baum-Welch algorithm. In strongly reverberant rooms, the proposed adapted multi-style HMMs even outper-form Baum-Welch HMMs trained on matched data.
机译:本文采用了一种新颖的训练算法,该算法使用干净和混响特征向量的数据对来估计鲁棒的隐马尔可夫模型(HMM),该算法在[1]中引入用于匹配训练,该算法用于多样式训练。通过将干净的数据与干净的语音HMM对齐,并使用所得的状态框架对齐方式,从多个不同房间的混响数据中估计高斯混合密度,可以从训练有素的干净语音HMM派生出多种样式的HMM。因此,对于包含在多样式训练集中的所有混响条件,时间对齐是固定的,从而减少了不同房间之间的模型不匹配。因此,这种训练方法特别适合于多种风格的训练。通过提出的方法训练并使用最大似然线性回归适应当前房间条件的多样式HMM明显优于通过常规Baum-Welch算法训练的相应的经过调整的多样式HMM。在强烈混响的房间中,所提出的改编的多样式HMM甚至优于在匹配数据上训练过的Baum-Welch HMM。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号