首页> 外文期刊>The Journal of the Acoustical Society of America >Effects of manipulating the signal-to-noise envelope power ratio on speech intelligibility
【24h】

Effects of manipulating the signal-to-noise envelope power ratio on speech intelligibility

机译:操纵信噪包络功率比对语音清晰度的影响

获取原文
获取原文并翻译 | 示例
           

摘要

Jorgensen and Dau [(2011). J. Acoust. Soc. Am. 130, 1475-1487] suggested a metric for speech intelligibility prediction based on the signal-to-noise envelope power ratio (SNRenv), calculated at the output of a modulation-frequency selective process. In the framework of the speech-based envelope power spectrum model (sEPSM), the SNRenv was demonstrated to account for speech intelligibility data in various conditions with linearly and nonlinearly processed noisy speech, as well as for conditions with stationary and fluctuating interferers. Here, the relation between the SNRenv and speech intelligibility was investigated further by systematically varying the modulation power of either the speech or the noise before mixing the two components, while keeping the overall power ratio of the two components constant. A good correspondence between the data and the corresponding sEPSM predictions was obtained when the noise was manipulated and mixed with the unprocessed speech, consistent with the hypothesis that SNRenv is indicative of speech intelligibility. However, discrepancies between data and predictions occurred for conditions where the speech was manipulated and the noise left untouched. In these conditions, distortions introduced by the applied modulation processing were detrimental for speech intelligibility, but not reflected in the SNRenv metric, thus representing a limitation of the modeling framework. (C) 2015 Acoustical Society of America.
机译:Jorgensen和Dau [(2011年。 J. Acoust。 Soc。上午。 [130,1475-1487]提出了一种基于信噪比包络功率比(SNRenv)的语音清晰度预测指标,该指标是在调制频率选择过程的输出处计算得出的。在基于语音的包络功率谱模型(sEPSM)的框架中,SNRenv被证明可解决线性和非线性处理的嘈杂语音在各种条件下以及固定干扰源和波动干扰源条件下的语音清晰度数据。在这里,在保持两个分量的总功率比不变的情况下,通过在混合两个分量之前系统地改变语音或噪声的调制功率来进一步研究SNRenv与语音清晰度之间的关系。当噪声被处理并与未处理的语音混合时,在数据和相应的sEPSM预测之间获得了良好的对应关系,这与SNRenv表示语音可懂度的假设相一致。但是,在语音被操纵且噪声未受影响的情况下,数据与预测之间出现差异。在这些情况下,由应用的调制处理引入的失真对于语音可懂度是有害的,但没有反映在SNRenv度量标准中,因此代表了建模框架的局限性。 (C)2015年美国声学学会。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号