...
首页> 外文期刊>The Journal of the Acoustical Society of America >Statistical voice activity detection based on integrated bispectrum likelihood ratio tests for robust speech recognition
【24h】

Statistical voice activity detection based on integrated bispectrum likelihood ratio tests for robust speech recognition

机译:基于集成双谱似然比测试的统计语音活动检测,可增强语音识别能力

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Currently, there are technology barriers inhibiting speech processing systems that work in extremely noisy conditions from meeting the demands of modem applications. These systems often require a noise reduction system working in combination with a precise voice activity detector (VAD). This paper shows statistical likelihood ratio tests formulated in terms of the integrated bispectrum of the noisy signal. The integrated bispectrum is defined as a cross spectrum between the signal and its square, and therefore a function of a single frequency variable. It inherits the ability of higher order statistics to detect signals in noise with many other additional advantages: (i) Its computation as a cross spectrum leads to significant computational savings, and (ii) the variance of the estimator is of the same order as that of the power spectrum estimator. The proposed approach incorporates contextual information to the decision rule, a strategy that has reported significant benefits for robust speech recognition applications. The proposed VAD is compared to the G.729, adaptive multirate, and advanced front-end standards as well as recently reported algorithms showing a sustained advantage in speechonspeech detection accuracy and speech recognition performance. (C) 2007 Acoustical Society of America.
机译:当前,存在技术上的障碍,阻碍了在非常嘈杂的条件下工作的语音处理系统无法满足现代应用的需求。这些系统通常需要与精确的语音活动检测器(VAD)结合使用的降噪系统。本文显示了根据噪声信号的整体双谱制定的统计似然比检验。积分双谱被定义为信号与其平方之间的互谱,因此是单个频率变量的函数。它继承了高阶统计量检测噪声中信号的能力,还有其他许多其他优点:(i)将其作为互谱进行计算可节省大量计算量;(ii)估计量的方差与功率谱估计器所提出的方法将上下文信息合并到决策规则中,该策略已报告了强大的语音识别应用程序的显着优势。拟议的VAD与G.729,自适应多速率和高级前端标准以及最近报告的算法进行了比较,这些算法在语音/非语音检测准确性和语音识别性能方面显示出持续的优势。 (C)2007美国声学学会。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号