Mask estimation for missing data speech recognition based on statistics of binaural interaction

Harding S.; Barker J.; Brown G.J.

首页> 外文期刊>IEEE transactions on audio, speech and language processing >Mask estimation for missing data speech recognition based on statistics of binaural interaction

【24h】

Mask estimation for missing data speech recognition based on statistics of binaural interaction

机译：基于双耳互动统计的漏失数据语音识别模板估计

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes a perceptually motivated computational auditory scene analysis (CASA) system that combines sound separation according to spatial location with the "missing data" approach for robust speech recognition in noise. Missing data time-frequency masks are created using probability distributions based on estimates of interaural time and level differences (ITD and ILD) for mixed utterances in reverberated conditions; these masks indicate which regions of the spectrum constitute reliable evidence of the target speech signal. A number of experiments compare the relative efficacy of the binaural cues when used individually and in combination. We also investigate the ability of the system to generalize to acoustic conditions not encountered during training. Performance on a continuous digit recognition task using this method is found to be good, even in a particularly challenging environment with three concurrent male talkers.

机译：本文介绍了一种感知动机的计算听觉场景分析（CASA）系统，该系统将根据空间位置的声音分离与“缺失数据”方法相结合，以在噪声中进行可靠的语音识别。基于混响条件下混合话语的听觉时间和水平差（ITD和ILD）的估计，使用概率分布来创建丢失的数据时频掩码；这些掩码指示频谱的哪些区域构成目标语音信号的可靠证据。大量实验比较了单独使用或组合使用时双耳提示的相对功效。我们还研究了该系统推广到训练期间未遇到的声学条件的能力。发现即使使用三个并发男性讲话者的特别具有挑战性的环境，使用此方法在连续数字识别任务上的表现也很好。

著录项

来源
《IEEE transactions on audio, speech and language processing》 |2006年第1期|p.58-67|共10页
作者
Harding S.; Barker J.; Brown G.J.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
speech intelligibility; speech recognition; statistical distributions; binaural interaction statistics; computational auditory scene analysis system; interaural level differences; interaural time differences; mask estimation; missing data speech recognition; prob;

机译：语音清晰度;语音识别;统计分布;双耳互动统计;计算听觉场景分析系统;听觉水平差异;听觉时间差异;掩膜估计;数据丢失语音识别;概率;

相似文献

外文文献
中文文献
专利

1. Mask estimation and imputation methods for missing data speech recognition in a multisource reverberant environment [J] . Sami Keronen, Heikki Kallasjoki, Ulpu Remes, Computer speech and language . 2013,第3期

机译：多源混响环境中用于丢失数据语音识别的模板估计和归类方法
2. A Novel Mask Estimation Method Employing Posterior-Based Representative Mean Estimate for Missing-Feature Speech Recognition [J] . Wooil Kim, Hansen J.H.L. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第5期

机译：一种基于后验的代表性均值估计的新的掩模估计方法用于特征缺失语音识别
3. A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation [J] . Palomaki KJ, Brown GJ, Wang DL Speech Communication . 2004,第4期

机译：双耳处理器，用于在存在噪声和小房间混响的情况下丢失数据语音识别
4. Mask Estimation Based on Sound Localisation for Missing Data Speech Recognition [C] . Harding, S., Barker, . 2005

机译：基于声音定位的掩码估计用于数据丢失语音识别
5. Robust Recognition of Binaural Speech Signals Using Techniques Based on Human Auditory Processing [D] . Menon, Anjali I. 2019

机译：基于人类听觉处理技术的双耳语音信号的稳健识别
6. The Binaural Masking-Level Difference of Mandarin Tone Detection and the Binaural Intelligibility-Level Difference of Mandarin Tone Recognition in the Presence of Speech-Spectrum Noise [O] . Cheng-Yu Ho, Pei-Chun Li, Yuan-Chuan Chiang, -1

机译：语音频谱噪声下普通话检测的双耳掩蔽水平差异和普通话识别的双耳可懂度水平差异
7. Mask estimation for missing data speech recognition based on statistics of binaural interaction [O] . Sue Harding, Jon Barker, Guy J. Brown 2006

机译：基于双耳交互统计的缺失数据语音识别模板估计

Mask estimation for missing data speech recognition based on statistics of binaural interaction

摘要

著录项

相似文献

相关主题

期刊订阅