Bimodal Speech Recognition Using Coupled Hidden Markov Models

机译：耦合隐马尔可夫模型的双峰语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present a bimodal speech recognition system in which the audio and visual modalities are mdoeled and integrated using coupled hidden Markov models (CHMMs). CHMMs are probabilistic inference graphs that have hidden Markov models as sub-graphs. Chains in the corresponding inference graph are coupled through matrices of conditional probabilities mdoeling temporal influences between their hidden state variables. The coupled through matrices of conditional probabilities modeling temporal influences between their hidden state variables. The coupling probabilities are both cross chain and cross time. the later is essential for allowing temporal influences between chains, which is important in modeling bimodal speech. Our bimodal speech recognition system employs a twoo-chain CHMM< with one chain being associated with the acoustic observation, the other with the visual features. A deterministic approxiamtion for maximum a posteriori (MAP) esttimation is used to enable fast classification and parameter estiamtion. We evaluted the system on a speaker independent connected-digit task. Comparing with an acoustic-only ASR sytem trained using only the audio channel of the same database, the bimodal system consistently demonstrates improved noise robustness at all SNRs. We further compare the CHMM system reported in this paper with our earlier bimodal speech recognition system in which the two modalities are fused by concatenating the audio and visual features. The recognition resutls clearly show the advantages of the CHMM framework in the context of bimodal speech recognition.

机译：在本文中，我们提出了一种双峰语音识别系统，其中使用耦合隐马尔可夫模型（CHMM）对音频和视觉模态进行了合并和集成。 CHMM是将Markov模型作为子图隐藏起来的概率推理图。相应推理图中的链通过条件概率矩阵耦合，该条件概率矩阵考虑了其隐藏状态变量之间的时间影响。通过条件概率矩阵耦合，可以对它们的隐藏状态变量之间的时间影响进行建模。耦合概率既是跨链的，又是跨时间的。后者对于允许链之间的时间影响至关重要，这在建模双峰语音中很重要。我们的双峰语音识别系统使用双向链CHMM <，其中一条链与声学观测相关联，另一条链与视觉特征相关联。最大后验（MAP）估计的确定性近似用于实现快速分类和参数估计。我们评估了该系统在说话者无关的连接数字任务上的作用。与仅使用同一数据库的音频通道训练的纯声学ASR系统相比，双峰系统始终显示出在所有SNR上均具有改进的噪声鲁棒性。我们进一步将本文报道的CHMM系统与我们较早的双峰语音识别系统进行比较，在该系统中，通过组合音频和视觉特征将两种模态融合在一起。识别结果清楚地显示了CHMM框架在双峰语音识别背景下的优势。

著录项

来源
《6th International Conference on Spoken Language Processing ICSLP 2000 Oct.16-Oct.20 2000 Beijing International Convention Center, Beijing, China》|2000年|p.747-750|共4页
会议地点
作者
Stephen M.Chu; Thomas S.Huang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类世界各国文化与文化事业;
关键词

相似文献

外文文献
中文文献
专利

1. Modelling asynchrony in automatic speech recognition using loosely coupled hidden Markov models [J] . H.J. Nock, S.J. Young Cognitive science . 2002,第3期

机译：使用松耦合隐马尔可夫模型在自动语音识别中建模异步
2. Characteristics of the use of coupled hidden Markov models for audio-visual Polish speech recognition [J] . M. KUBANEK, J. BOBULSKI, L. ADRJANOWICZ Bulletin of the Polish Academy of Sciences. Technical Sciences . 2012,第2期

机译：使用耦合隐马尔可夫模型进行波兰语视听语音识别的特征
3. Characteristics of the use of coupled hidden Markov models for audio-visual polish speech recognition [J] . J. Bobulski, L. Adrjanowicz, M. Kubanek Bulletin of the Polish Academy of Sciences. Technical Sciences . 2012,第2期

机译：使用耦合隐马尔可夫模型进行视听波兰语语音识别的特征
4. Bimodal Speech Recognition Using Coupled Hidden Markov Models [C] . Stephen M.Chu, Thomas S.Huang International conference on spoken language processing . 2000

机译：使用耦合隐马尔可夫模型的双峰性语音识别
5. Online Learning of Large Margin Hidden Markov Models for Automatic Speech Recognition. [D] . Cheng, Chih-Chieh. 2011

机译：在线学习大余量隐马尔可夫模型以进行自动语音识别。
6. Assessment of Dysarthria Using One-Word Speech Recognition with Hidden Markov Models [O] . Seung Hak Lee, Minje Kim, Han Gil Seo, 2019

机译：使用隐马尔可夫模型的单字语音识别评估构音障碍
7. Speech Silicon AM: An FPGA-Based Acoustic Modeling Pipeline for Hidden Markov Model based Speech Recognition [O] . Jeffrey W. Schuster, Kshitij Gupta, Raymond Hoare 2014

机译：语音芯片AM：基于FPGA的声学建模管道，用于基于隐马尔可夫模型的语音识别
8. Improving on hidden Markov models: An articulatorily constrained, maximum likelihood approach to speech recognition and speech coding [R] . Hogden, J. 1996

机译：改进隐马尔可夫模型：语音识别和语音编码的语义约束，最大似然方法

Bimodal Speech Recognition Using Coupled Hidden Markov Models

摘要

著录项

相似文献

相关主题

期刊订阅