An effective cluster-based model for robust speech detection and speech recognition in noisy environments

Gorriz JM; Ramirez J; Segura JC; Puntonet CG

首页> 外文期刊>The Journal of the Acoustical Society of America >An effective cluster-based model for robust speech detection and speech recognition in noisy environments

【24h】

An effective cluster-based model for robust speech detection and speech recognition in noisy environments

机译：在嘈杂环境中用于鲁棒语音检测和语音识别的有效基于群集的模型

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper shows an accurate speech detection algorithm for improving the performance of speech recognition systems working in noisy environments. The proposed method is based on a hard decision clustering approach where a set of prototypes is used to characterize the noisy channel. Detecting the presence of speech is enabled by a decision rule formulated in terms of an averaged distance between the observation vector and a cluster-based noise model. The algorithm benefits from using contextual information, a strategy that considers not only a single speech frame but also a neighborhood of data in order to smooth the decision function and improve speech detection robustness. The proposed scheme exhibits reduced computational cost making it adequate for real time applications, i.e., automated speech recognition systems. An exhaustive analysis is conducted on the AURORA 2 and AURORA 3 databases in order to assess the performance of the algorithm and to compare it to existing standard voice activity detection (VAD) methods. The results show significant improvements in detection accuracy and speech recognition rate over standard VADs such as ITU-T G.729, ETSI GSM AMR, and ETSI AFE for distributed speech recognition and a representative set of recently reported VAD algorithms. (c) 2006 Acoustical Society of America.

机译：本文展示了一种精确的语音检测算法，可以提高在嘈杂环境中工作的语音识别系统的性能。所提出的方法基于硬决策聚类方法，其中使用一组原型来表征噪声通道。通过根据观察矢量和基于群集的噪声模型之间的平均距离制定的决策规则，可以检测语音的存在。该算法得益于上下文信息的使用，上下文信息是一种不仅考虑单个语音帧，而且考虑数据邻域的策略，以使决策函数更加平滑并提高语音检测的鲁棒性。所提出的方案表现出降低的计算成本，使其适合于实时应用，即自动语音识别系统。为了评估算法的性能并将其与现有的标准语音活动检测（VAD）方法进行比较，对AURORA 2和AURORA 3数据库进行了详尽的分析。结果表明，与用于分布式语音识别的标准VAD（例如ITU-T G.729，ETSI GSM AMR和ETSI AFE）以及一组代表性的最近报告的VAD算法相比，检测准确性和语音识别率有了显着提高。（c）2006年美国声学学会。

著录项

来源
《The Journal of the Acoustical Society of America》 |2006年第1期|共12页
作者
Gorriz JM; Ramirez J; Segura JC; Puntonet CG;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类声学;
关键词
VOICE ACTIVITY DETECTION; GAUSSIAN MODEL; REDUCTION; END; VAD;

机译：语音活动检测;高斯模型;还原;结束;VAD;

相似文献

外文文献
中文文献
专利

1. An effective cluster-based model for robust speech detection and speech recognition in noisy environments [J] . Gorriz JM, Ramirez J, Segura JC, The Journal of the Acoustical Society of America . 2006,第1期

机译：在嘈杂环境中用于鲁棒语音检测和语音识别的有效基于群集的模型
2. A spatio-temporal speech enhancement scheme for robust speech recognition in noisy environments [J] . Erik Visser, Manabu Otsuka, Te-Won Lee Speech Communication . 2003,第2a3期

机译：时空语音增强方案，用于嘈杂环境中的健壮语音识别
3. Auditory processing of speech signals for robust speech recognition in real-world noisy environments [J] . Doh-Suk Kim, Soo-Young Lee IEEE Transactions on Speech and Audio Proceeding . 1999,第1期

机译：语音信号的听觉处理，可在实际嘈杂的环境中实现强大的语音识别
4. A robust endpoint detection of speech for noisy environments with application to automatic speech recognition [C] . Bou-Ghazale, S.E., Assaleh, . 2002

机译：用于嘈杂环境的强大语音端点检测，可应用于自动语音识别
5. Advances in Audiovisual Speech Processing for Robust Voice Activity Detection and Automatic Speech Recognition [D] . Tao, Fei. 2018

机译：用于鲁棒语音活动检测和自动语音识别的视听语音处理方面的进展
6. Modeling Interactions between Speech Production and Perception: Speech Error Detection at Semantic and Phonological Levels and the Inner Speech Loop [O] . Bernd J. Kröger, Eric Crawford, Trevor Bekolay, 2016

机译：建模语音产生和知觉之间的相互作用：语义和语音层次上的语音错误检测以及内部语音循环
7. Robust emotional speech recognition based on binaural model and emotional auditory mask in noisy environments [O] . Meysam Bashirpour, Masoud Geravanchizadeh 2018

机译：基于双耳模型和情绪听觉掩模在嘈杂环境中的强大情绪语音识别

An effective cluster-based model for robust speech detection and speech recognition in noisy environments

摘要

著录项

相似文献

相关主题

期刊订阅