Audio style transfer using shallow convolutional networks and random filters

Jiyou Chen; Gaobo Yang; Huihuang Zhao; Manimaran Ramasamy

首页> 外文期刊>Multimedia Tools and Applications >Audio style transfer using shallow convolutional networks and random filters

【24h】

Audio style transfer using shallow convolutional networks and random filters

机译：使用浅卷积网络和随机滤波器进行音频风格转移

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Recently, with the advent of Convolutional Neural Network (CNN) era, Neural style transfer on images has become a very active research topic and the style of an image can be transferred to another image through a CNN so that the image retains both its own content and another style of image. In this work, we propose an algorithm for audio style transfer that uses the force of CNN to generate a new audio from a style audio. We use Continuous Wavelet Transfer(CWT) to convert the audio into a spectrogram and then use the spectrogram as the representation of the audio image through image style transfer method to obtain a new image, and finally, generate an audio using iterative phase reconstruction with Griffin-Lim. We succeed in transferring audio such as light music but had difficulty in transferring audio that has lyrics and high-level metrics such as emotion or tone. We propose several measures to improve the quality of audio and a lot of experimental results shows that our method is better than other methods in terms of sound quality.

机译：最近，随着卷积神经网络（CNN）时代的出现，图像上的神经样式转移已成为一个非常活跃的研究主题，图像的样式可以通过CNN传送到另一图像，使得图像保留其自身内容和另一种形象。在这项工作中，我们提出了一种用于音频样式传输的算法，它使用CNN的力来生成来自样式音频的新音频。我们使用连续小波传输（CWT）将音频转换为频谱图，然后通过图像样式传输方法使用频谱图作为音频图像的表示，以获取新图像，最后，使用Griffin使用迭代相重建生成音频 - 我。我们成功地转移了轻松音乐等音频，但难以传输具有歌词和高级度量的音频，例如情感或音调。我们提出了几项措施来提高音频质量和许多实验结果表明，我们的方法比声音质量方面的方法更好。

著录项

来源
《Multimedia Tools and Applications》 |2020年第22期|15043-15057|共15页
作者
Jiyou Chen; Gaobo Yang; Huihuang Zhao; Manimaran Ramasamy;
展开▼
作者单位

College of Information Science and Engineering Hunan University Changsha 410082 China Hunan Provincial Key Laboratory of Intelligent Information Processing and Application Hengyang Normal University Hengyang 421002 China;

College of Information Science and Engineering Hunan University Changsha 410082 China;

Hunan Provincial Key Laboratory of Intelligent Information Processing and Application Hengyang Normal University Hengyang 421002 China;

Hunan Provincial Key Laboratory of Intelligent Information Processing and Application Hengyang Normal University Hengyang 421002 China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Audio style transfer; Continuous wavelet transfer; Deep neural network; Spectrogram;

机译：音频样式转移;连续小波转移;深神经网络;谱图;

相似文献

外文文献
中文文献
专利

1. TimeScaleNet: A Multiresolution Approach for Raw Audio Recognition Using Learnable Biquadratic IIR Filters and Residual Networks of Depthwise-Separable One-Dimensional Atrous Convolutions [J] . Bavu Eric, Ramamonjy Aro, Pujol Hadrien, Selected Topics in Signal Processing, IEEE Journal of . 2019,第2期

机译：TimeScaleNet：使用可学习的二阶IIR滤波器和深度可分离的一维Atrous卷积的余数网络的原始音频识别的多分辨率方法
2. Image style transfer using convolutional neural networks based on transfer learning [J] . Varun Gupta, Rajat Sadana, Swastikaa Moudgil International journal of computational systems engineering . 2019,第1期

机译：基于转移学习的卷积神经网络图像风格转移
3. Image style transfer using convolutional neural networks based on transfer learning [J] . Varun Gupta, Rajat Sadana, Swastikaa Moudgil International journal of computational systems engineering . 2019,第1期

机译：基于转移学习的卷积神经网络图像风格转移
4. Multimodal Transfer: A Hierarchical Deep Convolutional Neural Network for Fast Artistic Style Transfer [C] . Xin Wang, Geoffrey Oxholm, Da Zhang, IEEE Conference on Computer Vision and Pattern Recognition . 2017

机译：多峰传输：快速艺术风格转换的分层深度卷积神经网络
5. An Empirical and Theoretical Investigation of Random Reinforced Forests and Shallow Convolutional Neural Networks [D] . Ganta, Nikhil. 2021

机译：随机钢筋林和浅卷积神经网络的实证与理论研究
6. Automatic skin lesion segmentation by coupling deep fully convolutional networks and shallow network with textons [O] . Lei Zhang, Guang Yang, Xujiong Ye 2019

机译：通过将深层全卷积网络和浅层网络与texton耦合来自动进行皮肤病变分割
7. Image style transfer using convolutional neural networks based on transfer learning [O] . Swastikaa Moudgil, Varun Gupta, Rajat Sadana 2019

机译：基于转移学习的卷积神经网络图像风格转移

Audio style transfer using shallow convolutional networks and random filters

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅