首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Bandwidth Extension of Musical Audio Signals With No Side Information Using Dilated Convolutional Neural Networks
【24h】

Bandwidth Extension of Musical Audio Signals With No Side Information Using Dilated Convolutional Neural Networks

机译:带有扩展卷积神经网络的无边信息的音乐音频信号的带宽扩展

获取原文

摘要

Bandwidth extension has a long history in audio processing. While speech processing tools do not rely on side information, production-ready bandwidth extension tools of general audio signals rely on side information that has to be transmitted alongside the bitstream of the low frequency part, mostly because polyphonic music has a more complex and less predictable spectral structure than speech.This paper studies the benefit of considering a dilated fully convolutional neural network to perform the bandwidth extension of musical audio signals with no side information on the magnitude spectra. Experimental evaluation using two public datasets, medley-solos-db and gtzan, respectively of monophonic and polyphonic music demonstrate that the proposed architecture achieves state of the art performance.
机译:带宽扩展在音频处理中具有悠久的历史。虽然语音处理工具不依赖于辅助信息,但通用音频信号的生产就绪带宽扩展工具依赖于必须与低频部分的比特流一起发送的辅助信息,这主要是因为和弦音乐的复杂性和可预测性较差本文研究了考虑使用膨胀的全卷积神经网络来执行音乐音频信号的带宽扩展,而幅度谱上没有辅助信息的好处。使用分别用于单音和和音音乐的两个公共数据集medley-solos-db和gtzan进行的实验评估表明,所提出的体系结构达到了最先进的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号