Subband-based upmixing of stereo to 5.1-channel audio signals using deep neural networks

机译：使用深度神经网络的基于子带的立体声到5.1声道音频信号的上混

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a subband-based stereo to 5.1-channels upmixing method using deep neural networks (DNNs) in MPEG-H 3D audio framework. In the training stage, DNN models of rear and center channels are respectively trained by using log-spectral magnitudes of quadrature mirror filter (QMF) sub-bands. In the upmixing stage, stereo input signals are converted into rear and center channels by feed-forward decoding with the trained DNN models. The performance of the proposed method is evaluated using both objective and subjective measures and it is compared with those of conventional methods. Consequently, the proposed method outperforms the conventional methods.

机译：在本文中，我们提出了一种在MPEG-H 3D音频框架中使用深度神经网络（DNN）的基于子带的立体声到5.1声道上混的方法。在训练阶段，通过使用正交镜像滤波器（QMF）子带的对数频谱幅度分别训练后声道和中央声道的DNN模型。在上混音阶段，使用经过训练的DNN模型通过前馈解码将立体声输入信号转换为后置声道和中央声道。使用客观和主观措施对所提出方法的性能进行了评估，并将其与常规方法进行了比较。因此，所提出的方法优于常规方法。

著录项

来源
《International Conference on Information and Communication Technology Convergence》|2016年|377-380|共4页
会议地点
作者
Su Yeon Park; Chan Jun Chun; Hong Kook Kim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Transform coding; Three-dimensional displays; Channel estimation; Neural networks; Decoding; Low-pass filters;

机译：训练;变换编码;三维显示;信道估计;神经网络;解码;低通滤波器;

相似文献

外文文献
中文文献
专利

1. Recognition of words from brain-generated signals of speech-impaired people: Application of autoencoders as a neural Turing machine controller in deep neural networks [J] . Boloukian Behzad, Safi-Esfahani Faramarz Neural Networks: The Official Journal of the International Neural Network Society . 2020,第期

机译：识别语音障碍的脑生成信号的单词：AutoEncoders在深神经网络中的神经图定型机控制器中的应用
2. Automatic Classification of Motor Impairment Neural Disorders from EEG Signals Using Deep Convolutional Neural Networks [J] . Vrbancic Grega, Podgorelec Vili Elektronika ir Elektrotechnika . 2018,第4期

机译：利用深卷积神经网络自动分类电机障碍神经障碍的脑电图
3. Classification of Audio Radar signals Using Radial Basis Function Neural Networks [J] . Trent McConaghy, Henry Leung, Eloi Bosse, IEEE Transactions on Instrumentation and Measurement . 2003,第6期

机译：基于径向基函数神经网络的音频雷达信号分类
4. Subband-based upmixing of stereo to 5.1-channel audio signals using deep neural networks [C] . Su Yeon Park, Chan Jun Chun, Hong Kook Kim International Conference on Information and Communication Technology Convergence . 2016

机译：基于STEREO到5.1通道音频信号的基于子带的upmixing使用深神经网络
5. Going Deeper with Recurrent Convolutional Neural Networks for Classifying P300 BCI Signals [D] . Maddula, Ramesh Krishna. 2017

机译：利用递归卷积神经网络对P300 BCI信号进行分类
6. A Deep Learning Model for Fault Diagnosis with a Deep Neural Network and Feature Fusion on Multi-Channel Sensory Signals [O] . Qing Ye, Shaohu Liu, Changhua Liu 2020

机译：深度神经网络故障诊断的深度学习模型多通道感觉信号的特征融合
7. Deep Neural Networks for Shimmer Approximation in Synthesized Audio Signal [O] . García, Mario Alejandro, Destéfanis, Eduardo A. 2017

机译：深度神经网络用于合成音频信号中的微光逼近

Subband-based upmixing of stereo to 5.1-channel audio signals using deep neural networks

摘要

著录项

相似文献

相关主题

期刊订阅