首页> 外文会议>International Conference on speech and computer >Acoustic Cues for the Perceptual Assessment of Surround Sound
【24h】

Acoustic Cues for the Perceptual Assessment of Surround Sound

机译:环绕声感知评估的声学提示

获取原文

摘要

Speech and audio codecs are implemented in a variety of multimedia applications, and multichannel sound is offered by first streaming or cloud-based services. Beside the objective of perceptual quality, coding-related research is focused on low bitrate and minimal latency. The IETF-standardized Opus codec provides a high perceptual quality, low latency and the capability of coding multiple channels in various audio bandwidths up to Fullband (20 kHz). In a previous perceptual study on Opus-processed 5.1 surround sound, uncompressed and degraded stimuli were rated on a five-point degradation category scale (DMOS) for six channels at total bitrates between 96 and 192 kbit/s. This study revealed that the perceived quality depends on the music characteristics. In the current study we analyze spectral and music-feature differences between those five music stimuli at three coding bitrates and uncompressed sound to identify objective causes for perceptual differences. The results show that samples with annoying audible degradations involve higher spectral differences within the LFE channel as well as highly uncorrelated LSPs.
机译:语音和音频编解码器在各种多媒体应用程序中实现,并且多通道声音是由第一流或基于云的服务提供的。除了感知质量的目标外,与编码相关的研究还集中在低比特率和最小延迟方面。 IETF标准化的Opus编解码器提供了高感知质量,低延迟,并能够在高达全频带(20 kHz)的各种音频带宽中对多个通道进行编码。在先前关于Opus处理的5.1环绕声的感知研究中,未压缩和降级的刺激在五个点降级类别标度(DMOS)上对六个通道的总比特率在96到192 kbit / s之间进行了评级。这项研究表明,感知质量取决于音乐特征。在当前的研究中,我们分析了三种编码比特率和未压缩声音在这五个音乐刺激之间的频谱和音乐特征差异,以找出造成感知差异的客观原因。结果表明,具有令人讨厌的可听衰减的样本涉及LFE通道内较高的频谱差异以及高度不相关的LSP。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号