HMM-Based Mask Estimation for a Speech Recognition Front-End Using Computational Auditory Scene Analysis

Ji Hun PARK; Jae Sam YOON; Hong Kook KIM

首页> 外文期刊>IEICE Transactions on Information and Systems >HMM-Based Mask Estimation for a Speech Recognition Front-End Using Computational Auditory Scene Analysis

【24h】

HMM-Based Mask Estimation for a Speech Recognition Front-End Using Computational Auditory Scene Analysis

机译：基于HMM的语音听觉前端的语音识别前端蒙版估计

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we propose a new mask estimation method for the computational auditory scene analysis (CASA) of speech using two microphones. The proposed method is based on a hidden Markov model (HMM) in order to incorporate an observation that the mask information should be correlated over contiguous analysis frames. In other words, HMM is used to estimate the mask information represented as the inter-aural time difference (ITD) and the interaural level difference (ILD) of two channel signals, and the estimated mask information is finally employed in the separation of desired speech from noisy speech. To show the effectiveness of the proposed mask estimation, we then compare the performance of the proposed method with that of a Gaussian kernel-based estimation method in terms of the performance of speech recognition. As a result, the proposed HMM-based mask estimation method provided an average word error rate reduction of 61.4% when compared with the Gaussian kernel-based mask estimation method.

机译：在本文中，我们为使用两个麦克风的语音计算听觉场景分析（CASA）提出了一种新的掩码估计方法。所提出的方法基于隐马尔可夫模型（HMM），以便合并观察值，即掩码信息应在连续的分析帧上进行关联。换句话说，HMM用于估计表示为两个声道信号的耳间时间差（ITD）和耳间电平差（ILD）的掩码信息，并且最终将所估计的掩码信息用于所需语音的分离中从嘈杂的演讲中。为了显示所提出的掩膜估计的有效性，然后在语音识别的性能方面，我们将所提出的方法的性能与基于高斯核的估计方法的性能进行了比较。结果，与基于高斯核的掩模估计方法相比，所提出的基于HMM的掩模估计方法提供了61.4％的平均字错误率降低。

著录项

来源
《IEICE Transactions on Information and Systems》 |2008年第9期|p.2360-2364|共5页
作者
Ji Hun PARK; Jae Sam YOON; Hong Kook KIM;
展开▼
作者单位

Department of Information and Communications, Gwangju Institute of Science and Technology (GIST), 1 Oryong-dong, Buk-gu, Gwangju 500-712, Korea;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
computational auditory scene analysis; mask estimation; hidden markov model; speech recognition;

机译：计算听觉场景分析;遮罩估计;隐马尔可夫模型;语音识别;

相似文献

外文文献
中文文献
专利

1. A computational auditory scene analysis system for speech segregation and robust speech recognition [J] . Yang Shao, Soundararajan Srinivasan, Zhaozhang Jin, Computer speech and language . 2010,第1期

机译：用于语音分离和鲁棒语音识别的计算听觉场景分析系统
2. SINGLE CHANNEL SPEECH ENHANCEMENT USING IDEAL BINARY MASK TECHNIQUE BASED ON COMPUTATIONAL AUDITORY SCENE ANALYSIS [J] . ABRAR HUSSAIN, KALAIVANI CHELLAPPAN, SITI ZAMRATOL M Journal of Theoretical and Applied Information Technology . 2016,第1期

机译：基于计算音频场景分析的理想二元掩膜技术的单通道语音增强
3. Monaural Speech Separation Based on Computational Auditory Scene Analysis and Objective Quality Assessment of Speech [J] . Li P., Guan Y., Xu B., IEEE transactions on audio, speech and language processing . 2006,第6期

机译：基于计算听觉场景分析和语音客观质量评估的单声道语音分离
4. HMM-BASED MASK ESTIMATION FOR A SPEECH RECOGNITION FRONT-END USING COMPUTATIONAL AUDITORY SCENE ANALYSIS [C] . Ji Hun Park, Jae Sam Yoon, Hong Kook Kim Hands-free Speech Communication and Microphone Arrays . 2008

机译：使用计算听觉场景分析的语音识别前端的基于HMM的掩模估计
5. Computational auditory scene analysis and robust automatic speech recognition. [D] . Narayanan, Arun. 2014

机译：计算听觉场景分析和强大的自动语音识别。
6. A comparison of several computational auditory scene analysis (CASA) techniques for monaural speech segregation [O] . Jihen Zeremdini, Mohamed Anouar Ben Messaoud, Aicha Bouzid 2015

机译：几种用于单声道语音隔离的计算听觉场景分析（CASA）技术的比较
7. HMM-Based Mask Estimation for a Speech Recognition Front-End Using Computational Auditory Scene Analysis [O] . J. H. PARK, J. S. YOON, H. K. KIM 2008

机译：使用计算听觉场景分析的语音识别前端的基于HMM的掩模估计

HMM-Based Mask Estimation for a Speech Recognition Front-End Using Computational Auditory Scene Analysis

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅