首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Spectral Dynamics Recovery for Enhanced Speech Intelligibility in Noise
【24h】

Spectral Dynamics Recovery for Enhanced Speech Intelligibility in Noise

机译:频谱动态恢复可增强语音中的语音清晰度

获取原文
获取原文并翻译 | 示例
           

摘要

Speech intelligibility in noisy environments decreases with an increase in the noise power. We hypothesize that the differences of subsequent short-term spectra of speech, which we collectively refer to as the speech spectral dynamics, can be used to characterize speech intelligibility. We propose a distortion measure to characterize the deviation of the dynamics of the noisy modified speech from the dynamics of natural speech. Optimizing this distortion measure, we derive a parametric relationship between the signal band-power before and after modification. The parametric nature of the solution ensures adaptation to the noise level, the speech statistics and a penalty on the power gain. A multi-band speech modification system based on the single-band optimal solution is designed under a total signal power constraint and evaluated in selected noise conditions. The results indicate that the proposed approach compares favorably to a reference method based on optimizing a measure of the speech intelligibility index. Very low computational complexity and high intelligibility gain make this an attractive approach for speech modification in a wide range of application scenarios.
机译:噪声环境中的语音清晰度随着噪声功率的增加而降低。我们假设随后的短期语音频谱差异(我们统称为语音频谱动力学)可以用来表征语音清晰度。我们提出一种失真度量,以表征嘈杂的修改语音的动态与自然语音的动态之间的偏差。优化此失真度量,我们可以得出修改前后信号带功率之间的参数关系。该解决方案的参数性质可确保适应噪声水平,语音统计数据和功率增益损失。在总信号功率约束下设计了基于单频带最佳解决方案的多频带语音修改系统,并在选定的噪声条件下进行了评估。结果表明,该方法与基于优化语音清晰度指标的参考方法相比具有优势。极低的计算复杂度和较高的清晰度使之成为广泛应用场景中语音修改的一种有吸引力的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号