...
首页> 外文期刊>EURASIP journal on audio, speech, and music processing >Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier Transform
【24h】

Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier Transform

机译:基于非均匀离散傅里叶变换的频率子带信道间时差估计

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Binaural cue coding (BCC) is an efficient technique for spatial audio rendering by using the side information such as interchannel level difference (ICLD), interchannel time difference (ICTD), and interchannel correlation (ICC). Of the side information, the ICTD plays an important role to the auditory spatial image. However, inaccurate estimation of the ICTD may lead to the audio quality degradation. In this paper, we develop a novel ICTD estimation algorithm based on the nonuniform discrete Fourier transform (NDFT) and integrate it with the BCC approach to improve the decoded auditory image. Furthermore, a new subjective assessment method is proposed for the evaluation of auditory image widths of decoded signals. The test results demonstrate that the NDFT-based scheme can achieve much wider and more externalized auditory image than the existing BCC scheme based on the discrete Fourier transform (DFT). It is found that the present technique, regardless of the image width, does not deteriorate the sound quality at the decoder compared to the traditional scheme without ICTD estimation.
机译:双耳提示编码(BCC)是通过使用诸如通道间电平差(ICLD),通道间时间差(ICTD)和通道间相关性(ICC)之类的辅助信息进行空间音频渲染的有效技术。在附带信息中,ICTD对听觉空间图像起着重要作用。但是,ICTD的估计不正确可能会导致音频质量下降。在本文中,我们开发了一种基于非均匀离散傅立叶变换(NDFT)的新颖的ICTD估计算法,并将其与BCC方法集成在一起,以改善解码后的听觉图像。此外,提出了一种新的主​​观评估方法,用于评估解码信号的听觉图像宽度。测试结果表明,与基于离散傅里叶变换(DFT)的现有BCC方案相比,基于NDFT的方案可以实现更广泛,更外在的听觉图像。已经发现,与没有ICTD估计的传统方案相比,本发明的技术,无论图像宽度如何,都不会使解码器处的声音质量恶化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号