首页> 外文期刊>The Journal of the Acoustical Society of America >An effective cluster-based model for robust speech detection and speech recognition in noisy environments
【24h】

An effective cluster-based model for robust speech detection and speech recognition in noisy environments

机译:在嘈杂环境中用于鲁棒语音检测和语音识别的有效基于群集的模型

获取原文
获取原文并翻译 | 示例
           

摘要

This paper shows an accurate speech detection algorithm for improving the performance of speech recognition systems working in noisy environments. The proposed method is based on a hard decision clustering approach where a set of prototypes is used to characterize the noisy channel. Detecting the presence of speech is enabled by a decision rule formulated in terms of an averaged distance between the observation vector and a cluster-based noise model. The algorithm benefits from using contextual information, a strategy that considers not only a single speech frame but also a neighborhood of data in order to smooth the decision function and improve speech detection robustness. The proposed scheme exhibits reduced computational cost making it adequate for real time applications, i.e., automated speech recognition systems. An exhaustive analysis is conducted on the AURORA 2 and AURORA 3 databases in order to assess the performance of the algorithm and to compare it to existing standard voice activity detection (VAD) methods. The results show significant improvements in detection accuracy and speech recognition rate over standard VADs such as ITU-T G.729, ETSI GSM AMR, and ETSI AFE for distributed speech recognition and a representative set of recently reported VAD algorithms. (c) 2006 Acoustical Society of America.
机译:本文展示了一种精确的语音检测算法,可以提高在嘈杂环境中工作的语音识别系统的性能。所提出的方法基于硬决策聚类方法,其中使用一组原型来表征噪声通道。通过根据观察矢量和基于群集的噪声模型之间的平均距离制定的决策规则,可以检测语音的存在。该算法得益于上下文信息的使用,上下文信息是一种不仅考虑单个语音帧,而且考虑数据邻域的策略,以使决策函数更加平滑并提高语音检测的鲁棒性。所提出的方案表现出降低的计算成本,使其适合于实时应用,即自动语音识别系统。为了评估算法的性能并将其与现有的标准语音活动检测(VAD)方法进行比较,对AURORA 2和AURORA 3数据库进行了详尽的分析。结果表明,与用于分布式语音识别的标准VAD(例如ITU-T G.729,ETSI GSM AMR和ETSI AFE)以及一组代表性的最近报告的VAD算法相比,检测准确性和语音识别率有了显着提高。 (c)2006年美国声学学会。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号