首页> 外文期刊>Signal processing >Multichannel speech reinforcement based on binaural unmasking
【24h】

Multichannel speech reinforcement based on binaural unmasking

机译:基于双耳解掩的多通道语音增强

获取原文
获取原文并翻译 | 示例

摘要

Speech reinforcement or near-end listening enhancement is a technique that modifies the far-end signal to mitigate the effect of the near-end noise, usually based on the power spectra of the far-end signal and the near-end noise. Psychoacoustic experiments have shown that the location of a noise source with respect to that of a signal source affects the amount of masking. Since conventional speech reinforce- ment methods obtain spectral gain based only on the power spectra, this psychoacoustic phenomenon called binaural unmasking has not been considered in those approaches. In this paper, we propose a novel speech reinforcement algorithm that modifies the far-end speech signal based on both the power spectrum and the direction-of-arrival (DoA) of the noise. Specifically, we have computed the equivalent frontal noise level from the observed noise level and the estimated DoA, and used it to compute spectral gains as in conventional partial loudness restoration-based speech reinforcement. Experimental results showed that the proposed method outperformed the conventional methods based on partial loudness restoration and speech intelligibility index (SII) optimization in terms of the overall perceived quality through subjective listening tests.
机译:语音增强或近端聆听增强是一种通常根据远端信号和近端噪声的功率谱来修改远端信号以减轻近端噪声影响的技术。心理声学实验表明,噪声源相对于信号源的位置会影响掩膜的数量。由于常规的语音增强方法仅基于功率谱获得频谱增益,因此在这些方法中未考虑这种称为双耳解掩的心理声学现象。在本文中,我们提出了一种新颖的语音增强算法,该算法基于功率谱和噪声的到达方向(DoA)来修改远端语音信号。具体来说,我们已经从观察到的噪声水平和估计的DoA计算了等效的正面噪声水平,并像传统的基于部分响度恢复的语音增强一样,将其用于计算频谱增益。实验结果表明,通过主观听觉测试,该方法优于基于部分响度恢复和语音清晰度指数(SII)优化的常规方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号