Audio Signal Mapping into Spectrogram-Based Images for Deep Learning Applications

机译：音频信号映射到基于谱的基于频谱学习应用程序的图像

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Various features generated from raw audio signals can be used as an input of a deep learning model. They include hand-crafted features such as mel-frequency cepstral coefficients, two-dimensional time-frequency representations and raw audio data. In most cases, the time-frequency representations are related to so-called spectrogram-based images. Having an image at the deep learning input enables to apply performance improvement accumulated in video and image processing. However, spectrogram-based images have some specific properties that should be taken into account when a deep learning model is designed. This paper deals with mapping of audio signals into the most common spectrogram-based images. Some unique properties of these images as well as the way how they are generated are analyzed here for a particular case of fridge sounds.

机译：从原始音频信号产生的各种特征可以用作深度学习模型的输入。它们包括诸如熔融频率谱系齐数，二维时频表示和原始音频数据的手工制作的特征。在大多数情况下，时频表示与所谓的基于频谱图的图像有关。在深度学习输入处具有图像，可以应用累积在视频和图像处理中的性能改进。然而，基于频谱图的图像具有一些特定的属性，当设计深度学习模型时应考虑。本文涉及音频信号映射到最常见的基于频谱图的图像。这些图像的一些独特属性以及如何在此处分析它们的方式，以针对冰箱声音的特定情况进行分析。

著录项

来源
《International Symposium INFOTEH-JAHORINA》|2021年|1-6|共6页
会议地点
作者
Dejan Ćirić; Zoran Perić; Jelena Nikolić; Nikola Vučić;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deep learning; Time-frequency analysis; Cepstral analysis; Image processing; Transforms; Signal mapping;

机译：深度学习;时间频率分析;临时分析;图像处理;转换;信号映射;

相似文献

外文文献
中文文献
专利

1. Image Formation, Deep Learning, and Physical Implication of Multiple Time-Series One-Dimensional Signals: Method and Application [J] . Liu Guangyu, Zhu Ling, Yu Weijie, IEEE transactions on industrial informatics . 2021,第7期

机译：多时间级一维信号的图像形成，深度学习和物理含义：方法和应用
2. Development of Optimal Feature Selection and Deep Learning Toward Hungry Stomach Detection Using Audio Signals [J] . Maria A., Jeyaseelan A. Sengol Journal of control, automation and electrical systems . 2021,第4期

机译：使用音频信号开发最佳特征选择和深度学习饥饿胃检测
3. An Associative Memorization Architecture of Extracted Musical Features from Audio Signals by Deep Learning Architecture [J] . Tadaaki Niwa, Keitaro Naruse, Ryosuke Ooe, Procedia Computer Science . 2014,第1期

机译：深度学习架构从音频信号中提取音乐特征的关联记忆架构
4. A Deep Learning Approach for Low-Latency Packet Loss Concealment of Audio Signals in Networked Music Performance Applications [C] . Prateek Verma, Alessandro Ilic Mezzay, Chris Chafe, Conference of Open Innovations Association . 2020

机译：网络音乐表演应用中用于音频信号的低延迟丢包隐藏的深度学习方法
5. Machine Learning Methods for Quantification of Depression Severity and Prediction of Recovery Trajectory Using Longitudinal Video and Audio Data, with Applications to Deep Brain Stimulation Treatment Optimization [D] . Harati, Sahar. 2019

机译：机器学习方法，用于量化抑郁症严重性和恢复轨迹预测使用纵向视频和音频数据，应用于深脑刺激处理优化
6. Towards End-to-End Acoustic Localization Using Deep Learning: From Audio Signals to Source Position Coordinates [O] . Juan Manuel Vera-Diaz, Daniel Pizarro, Javier Macias-Guarasa 2018

机译：使用深度学习实现端到端声学定位：从音频信号到源位置坐标
7. An Associative Memorization Architecture of Extracted Musical Features from Audio Signals by Deep Learning Architecture [O] . Niwa Tadaaki, Naruse Keitaro, Ooe Ryosuke, 2014

机译：深度学习架构从音频信号中提取音乐特征的关联记忆架构

Audio Signal Mapping into Spectrogram-Based Images for Deep Learning Applications

摘要

著录项

相似文献

相关主题

期刊订阅