Using contextual information in joint factor eigenspace MLLR for speech recognition in diverse scenarios

机译：在不同场景下使用联合因子本征空间MLLR中的上下文信息进行语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a new approach for rapid adaptation in the presence of highly diverse scenarios that takes advantage of information describing the input signals. We introduce a new method for joint factorisation of the background and the speaker in an eigenspace MLLR framework: Joint Factor Eigenspace MLLR (JFEMLLR). We further propose to use contextual information describing the speaker and background, such as tags or more complex metadata, to provide an immediate estimation of the best MLLR transformation for the utterance. This provides instant adaptation, since it does not require any transcription from a previous decoding stage. Evaluation in a highly diverse Automatic Speech Recognition (ASR) task, a modified version of WSJCAM0, yields an improvement of 26.9% over the baseline, which is an extra 1.2% reduction over two-pass MLLR adaptation.

机译：本文提出了一种在高度多样化的情况下快速适应的新方法，该方法利用了描述输入信号的信息。我们介绍了一种在本征空间MLLR框架中对背景和说话人进行联合分解的新方法：联合因子本征空间MLLR（JFEMLLR）。我们进一步建议使用描述说话者和背景的上下文信息（例如标签或更复杂的元数据）来为言语提供最佳MLLR转换的立即估计。这提供了即时适应性，因为它不需要来自先前解码阶段的任何转录。在高度多样化的自动语音识别（ASR）任务（WSJCAM0的修改版）中进行评估，比基线提高了26.9％，比两次通过MLLR自适应降低了1.2％。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2014年|6314-6318|共5页
会议地点
作者
Saz Oscar; Hain Thomas;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speech recognition; adaptation; eigenspace MLLR; joint factorisation; metadata;

机译：语音识别;适应;本征空间MLLR;联合分解;元数据;

相似文献

外文文献
中文文献
专利

1. Noise robust speech recognition by integration of MLLR adaptation and feature extraction for noise reduced speech [J] . Masakiyo Fujimoto, Yasuo Ariki 電子情報通信学会技術研究報告. 音声. Speech . 2001,第522期

机译：通过集成MLLR自适应和特征提取以降低噪声的语音，增强了噪声鲁棒性
2. Noise robust speech recognition by integration of MLLR adaptation and feature extraction for noise reduced speech [J] . Masakiyo Fujimoto, Yasuo Ariki 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2001,第520期

机译：噪声稳健性语音识别通过集成MLLR自适应和特征提取来减少语音
3. Noise robust speech recognition by integration of MLLR adaptation and feature extraction for noise reduced speech [J] . Masakiyo Fujimoto, Yasuo Ariki 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2001,第520期

机译：噪声稳健性语音识别通过集成MLLR自适应和特征提取来减少语音
4. USING CONTEXTUAL INFORMATION IN JOINT FACTOR EIGENSPACE MLLR FOR SPEECH RECOGNITION IN DIVERSE SCENARIOS [C] . Oscar Saz, Thomas Hain IEEE International Conference on Acoustics, Speech and Signal Processing . 2014

机译：在不同方案中使用联合因子EIGenspace MLLR中的上下文信息
5. An exploration of the impact of speech recognition technologies on group efficiency and effectiveness during an electronic idea generation scenario. [D] . Prince, Bradley Justin. 2006

机译：探索语音识别技术对电子创意产生场景中的团队效率和有效性的影响。
6. Contextual variability during speech-in-speech recognition [O] . Susanne Brouwer, a), Ann R. Bradlow -1

机译：语音识别中的上下文变异
7. Using contextual information in Joint Factor Eigenspace MLLR for speech recognition in diverse scenarios [O] . Saz O., Hain T. 2014

机译：使用关节因子特征空间mLLR中的上下文信息在不同场景中进行语音识别

Using contextual information in joint factor eigenspace MLLR for speech recognition in diverse scenarios

摘要

著录项

相似文献

相关主题

期刊订阅