首页> 外文会议>International Conference on Information and Communication Technology Convergence >Subband-based upmixing of stereo to 5.1-channel audio signals using deep neural networks
【24h】

Subband-based upmixing of stereo to 5.1-channel audio signals using deep neural networks

机译:使用深度神经网络的基于子带的立体声到5.1声道音频信号的上混

获取原文

摘要

In this paper, we propose a subband-based stereo to 5.1-channels upmixing method using deep neural networks (DNNs) in MPEG-H 3D audio framework. In the training stage, DNN models of rear and center channels are respectively trained by using log-spectral magnitudes of quadrature mirror filter (QMF) sub-bands. In the upmixing stage, stereo input signals are converted into rear and center channels by feed-forward decoding with the trained DNN models. The performance of the proposed method is evaluated using both objective and subjective measures and it is compared with those of conventional methods. Consequently, the proposed method outperforms the conventional methods.
机译:在本文中,我们提出了一种在MPEG-H 3D音频框架中使用深度神经网络(DNN)的基于子带的立体声到5.1声道上混的方法。在训练阶段,通过使用正交镜像滤波器(QMF)子带的对数频谱幅度分别训练后声道和中央声道的DNN模型。在上混音阶段,使用经过训练的DNN模型通过前馈解码将立体声输入信号转换为后置声道和中央声道。使用客观和主观措施对所提出方法的性能进行了评估,并将其与常规方法进行了比较。因此,所提出的方法优于常规方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号