Multi-style training of HMMS with stereo data for reverberation-robust speech recognition

机译：带有立体声数据的HMMS的多样式训练，用于混响鲁棒语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A novel training algorithm using data pairs of clean and reverberant feature vectors for estimating robust Hidden Markov Models (HMMs), introduced in [1] for matched training, is employed in this paper for multi-style training. The multi-style HMMs are derived from well-trained clean-speech HMMs by aligning the clean data to the clean-speech HMM and using the resulting state-frame alignment to estimate the Gaussian mixture densities from the reverberant data of several different rooms. Thus, the temporal alignment is fixed for all reverberation conditions contained in the multi-style training set so that the model mismatch between the different rooms is reduced. Therefore, this training approach is particularly suitable for multi-style training. Multi-style HMMs trained by the proposed approach and adapted to the current room condition using maximum likelihood linear regression significantly outperform the corresponding adapted multi-style HMMs trained by the conventional Baum-Welch algorithm. In strongly reverberant rooms, the proposed adapted multi-style HMMs even outper-form Baum-Welch HMMs trained on matched data.

机译：本文采用了一种新颖的训练算法，该算法使用干净和混响特征向量的数据对来估计鲁棒的隐马尔可夫模型（HMM），该算法在[1]中引入用于匹配训练，该算法用于多样式训练。通过将干净的数据与干净的语音HMM对齐，并使用所得的状态框架对齐方式，从多个不同房间的混响数据中估计高斯混合密度，可以从训练有素的干净语音HMM派生出多种样式的HMM。因此，对于包含在多样式训练集中的所有混响条件，时间对齐是固定的，从而减少了不同房间之间的模型不匹配。因此，这种训练方法特别适合于多种风格的训练。通过提出的方法训练并使用最大似然线性回归适应当前房间条件的多样式HMM明显优于通过常规Baum-Welch算法训练的相应的经过调整的多样式HMM。在强烈混响的房间中，所提出的改编的多样式HMM甚至优于在匹配数据上训练过的Baum-Welch HMM。

著录项

来源
《2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays》|2011年|p.196-200|共5页
会议地点
作者
Sehr Armin; Hofmann Christian; Maas Roland; Kellermann Walter;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类电声技术和语音信号处理;
关键词
Multi-style HMMtraining; distant-talking speech recognition; reverberation; robust ASR; stereo data;

机译：多样式HMM训练;远程语音识别;混响;健壮的ASR;立体声数据;

相似文献

外文文献
中文文献
专利

1. Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems [J] . Garcia-Moral A. I., Solera-Urena R., Pelaez-Moreno C., Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第3期

机译：数据平衡可有效训练混合ANN / HMM自动语音识别系统
2. Discriminative training of HMMs for automatic speech recognition: A survey [J] . Hui Jiang Computer speech and language . 2010,第4期

机译：用于自动语音识别的HMM的歧视性培训：一项调查
3. Addable Stress Speech Recognition with Multiplexing HMM: Training and Non-training Decision [J] . Pakapong Amornkul, Kosin Chamnongthai, Punnarumol Temdee Wireless personal communications: An Internaional Journal . 2014,第3期

机译：HMM的加重语音识别：训练和非训练决策
4. Multi-style training of HMMS with stereo data for reverberation-robust speech recognition [C] . Sehr Armin, Hofmann Christian, Maas Roland, Joint Workshop on Hands-free Speech Communication and Microphone Arrays . 2011

机译：具有混响 - 强大的语音识别的立体声数据的多种式培训HMMS
5. Classification and recognition of speech under perceptual stress using neural networks and N-D HMMs. [D] . Womack, Brian David. 1996

机译：使用神经网络和N-D HMM在感知压力下对语音进行分类和识别。
6. Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data [O] . Ayesha Pervaiz, Fawad Hussain, Huma Israr, 2020

机译：通过训练数据的噪声增强将噪声鲁棒性纳入语音命令识别中
7. Mismatched Training Data Enhancement for Automatic Recognition of Children’s Speech using DNN-HMM [O] . Qian Mengjie, McLoughlin Ian Vince, Guo Wu, 2016

机译：不正确的训练数据增强功能，无法使用DNN-HMM自动识别儿童的语音

Multi-style training of HMMS with stereo data for reverberation-robust speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅