首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Optimal Near-End Speech Intelligibility Improvement Incorporating Additive Noise and Late Reverberation Under an Approximation of the Short-Time SII
【24h】

Optimal Near-End Speech Intelligibility Improvement Incorporating Additive Noise and Late Reverberation Under an Approximation of the Short-Time SII

机译:在短时SII的近似作用下,结合加性噪声和后期混响,优化近端语音清晰度

获取原文
获取原文并翻译 | 示例

摘要

The presence of environmental additive noise in the vicinity of the user typically degrades the speech intelligibility of speech processing applications. This intelligibility loss can be compensated by properly preprocessing the speech signal prior to play-out, often referred to as near-end speech enhancement. Although the majority of such algorithms focus primarily on the presence of additive noise, reverberation can also severely degrade intelligibility. In this paper we investigate how late reverberation and additive noise can be jointly taken into account in the near-end speech enhancement process. For this effort we use a recently presented approximation of the speech intelligibility index under a power constraint, which we optimize for speech degraded by both additive noise and late reverberation. The algorithm results in time–frequency dependent amplification factors that depend on both the additive noise power spectral density as well as the late reverberation energy. These amplification factors redistribute speech energy across frequency and perform a dynamic range compression. Experimental results using both instrumental intelligibility measures as well as intelligibility listening tests show that the proposed approach improves speech intelligibility over state-of-the-art reference methods when speech signals are degraded simultaneously by additive noise and reverberation. Speech intelligibility improvements in the order of 20% are observed.
机译:用户附近环境附加噪声的存在通常会降低语音处理应用程序的语音清晰度。可以通过在播出之前对语音信号进行适当的预处理(通常称为近端语音增强)来补偿这种清晰度损失。尽管大多数此类算法主要着眼于加性噪声的存在,但是混响也会严重降低清晰度。在本文中,我们研究了在近端语音增强过程中如何将后期混响和附加噪声共同考虑在内。为此,我们使用了功率限制条件下语音清晰度指数的最新近似值,我们针对因加性噪声和后期混响而退化的语音进行了优化。该算法产生依赖于时频的放大因子,该放大因子既依赖于加性噪声功率谱密度,也依赖于后期混响能量。这些放大因子会在整个频率范围内重新分配语音能量,并执行动态范围压缩。使用仪器清晰度措施和清晰度听觉测试的实验结果表明,当语音信号由于加性噪声和混响而同时劣化时,所提出的方法比最新的参考方法提高了语音清晰度。语音清晰度提高了大约20%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号