首页> 外文会议>AES international conference >IMPROVED PREDICTION OF MULTICHANNEL AUDIO QUALITY BY THE USE OF ENVELOPE ITD OF HIGH FREQUENCY SOUNDS
【24h】

IMPROVED PREDICTION OF MULTICHANNEL AUDIO QUALITY BY THE USE OF ENVELOPE ITD OF HIGH FREQUENCY SOUNDS

机译:通过使用高频声音包络ITD改进的多通道音频质量预测

获取原文
获取外文期刊封面目录资料

摘要

Both spatial and timbral factors are important in the assessment of the multichannel audio coding systems. The Choi et al. model [1] that extending the ITU-R Rec. BS. 1387-1 [2] to the multichannel audio coding systems, with the use of three spatial features, ITDDist (Interaural Time Difference Distortion), ILDDist (Interaural Level Difference Distortion), and IACCDist (InterAural Cross Correlation Distortion), is such an example. In that implementation, ITDDists were only computed for the low frequency (below 1500Hz) sounds and ILD distortions were only computed for the high frequency components. Such implementation is reasonable under classical duplex theory [3]. However, in the high frequency range, the interaural difference in temporal envelopes is also important in spatial perception, especially in sound localization. A new model to compute the ITD distortions of temporal envelopes in high frequency components is introduced in this paper to investigate the role of such ITD on prediction of perceived spatial quality quantitatively. The computed ITD distortions of temporal envelopes in high frequency components were highly correlated with perceived sound quality. Moreover, when the proposed envelope ITD distortion was included in the prediction model in [1], as one of the multiple features to predict overall sound quality, the overall performance of sound quality prediction was enhanced compared to the model in [1].
机译:空间和音色因素在多通道音频编码系统的评估中都很重要。崔等。 ITU-R Rec.1扩展的模型[1]。 BS。 1387-1 [2]是多通道音频编码系统的示例,它使用了三个空间特征,即ITDDist(听觉时差失真),ILDDist(听觉水平差失真)和IACCDist(听觉互相关失真)。 。在该实现中,仅针对低频(低于1500Hz)声音计算ITDDist,仅针对高频分量计算ILD失真。在经典双工理论下,这种实现是合理的[3]。但是,在高频范围内,时间包络的听觉差异在空间感知(尤其是声音定位)中也很重要。本文介绍了一种计算高频分量中时间包络的ITD失真的新模型,以研究这种ITD在定量预测感知的空间质量中的作用。高频分量中时间包络的计算出的ITD失真与感知的声音质量高度相关。此外,当将建议的包络ITD失真包括在[1]中的预测模型中时,作为预测整体声音质量的多个功能之一,与[1]中的模型相比,声音质量预测的整体性能得到了增强。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号