Multi-Channel Audio Source Separation Using Azimuth-Frequency Analysis and Convolutional Neural Network

机译：使用方位频分析和卷积神经网络进行多通道音频源分离

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Since MPEG-H supports not only channel-based but also object-based audio content, there is a need for a sound source separation technique that converts channel-based to object-based audio. Among the various sound source separation techniques, azimuth-frequency (AF) based sound source separation has been proposed for converting channel-based audio to object-based audio. Unfortunately, it is difficult to set the optimal azimuth and width using this technique. In this paper, we propose a method to determine the optimal azimuth and width based on a convolutional neural network (CNN) classifier. First, depending on numerous azimuths and widths, different sets of audio signals are separated. After that, each audio set is categorized into a specific audio class using the CNN classifier. Then, in order to separate a desired audio signal, the azimuth and width with the highest similarity for a given class are selected. The performance of the CNN classifier is evaluated in terms of separation accuracy and objective measures such as signal-to-distortion ratio (SDR), signal-to-interference ratio (SIR), and signal-to-artifacts ratio (SAR). Consequently, the proposed method provides higher SDR, SAR, SIR, and separation accuracy than a minimum variance distortionless response (MVDR) beamformer as well as a method that only uses AF analysis.

机译：由于MPEG-H不仅支持基于频道而且基于对象的音频内容，因此需要一种声源分离技术，其将基于对象的音频转换为基于对象的音频。在各种声源分离技术中，已经提出了基于方位频（AF）的声源分离，用于将基于信道的音频转换为基于对象的音频。不幸的是，很难使用这种技术设置最佳方位角和宽度。在本文中，我们提出了一种基于卷积神经网络（CNN）分类器的最佳方位角和宽度的方法。首先，取决于许多方位角和宽度，分离不同的音频信号。之后，使用CNN分类器将每个音频集分类为特定的音频类。然后，为了分离所需的音频信号，选择具有对给定类的最高相似性的方位角和宽度。根据分离精度和客观度量评估CNN分类器的性能，例如信令对失真率（SDR），信号到干扰比（SIR）和信号到伪像比（SAR）。因此，所提出的方法提供更高的SDR，SAR，SAR和分离精度，而不是最小方差失真响应（MVDR）波束形成器以及仅使用AF分析的方法。

著录项

来源
《International Conference on Artificial Intelligence in Information and Communication》|2019年|584 p. :|共4页
会议地点
作者
Jung Min Moon; Chan Jun Chun; Jun Ho Kim; Hong Kook Kim; Tae Woo Kim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Source separation; Azimuth; Convolutional neural networks; Distortion measurement; Signal to noise ratio; Transform coding; Distortion;

机译：源分离;方位角;卷积神经网络;失真测量;信噪比;转换编码;失真;

相似文献

外文文献
中文文献
专利

1. An Emotion Analysis Method Using Multi-Channel Convolution Neural Network in Social Networks [J] . Lu Xinxin, Zhang Hong Computer Modeling in Engineering & Sciences . 2020,第1期

机译：一种在社交网络中使用多通道卷积神经网络的情感分析方法
2. Sentiment analysis in non-fixed length audios using a Fully Convolutional Neural Network [J] . Garcia-Ordas Maria Teresa, Alaiz-Moreton Hector, Benitez-Andrades Jose Alberto, Biomedical signal processing and control . 2021,第Auga期

机译：使用完全卷积神经网络的非固定长度Audios的情感分析
3. Audio Steganalysis based on collaboration of fractal dimensions and convolutional neural networks [J] . Mohtasham-zadeh Vahid, Mosleh Mohammad Multimedia Tools and Applications . 2019,第9期

机译：基于分形维和卷积神经网络协作的音频隐写分析
4. Multi-Channel Audio Source Separation Using Azimuth-Frequency Analysis and Convolutional Neural Network [C] . Jung Min Moon, Chan Jun Chun, Jun Ho Kim, The 1st International Conference onArtificial Intelligence in Information and Communication . 2019

机译：基于方位角频率分析和卷积神经网络的多声道音频源分离
5. Separation of a Known Speaker's Voice with a Convolutional Neural Network [D] . Threet, Michael. 2018

机译：用卷积神经网络分离说话人的语音
6. Configuration-Invariant Sound Localization Technique Using Azimuth-Frequency Representation and Convolutional Neural Networks [O] . Chanjun Chun, Kwang Myung Jeon, Wooyeol Choi 2020

机译：配置 - 不变的声音本地化技术使用方位频率表示和卷积神经网络
7. Mmdenselstm: An Efficient Combination of Convolutional and Recurrent Neural Networks for Audio Source Separation [O] . Naoya Takahashi, Nabarun Goswami, Yuki Mitsufuji 2018

机译：MMDenSelstm：有效组合卷积和经常性神经网络，用于音频源分离
8. Neural Networks for Blind Separation with Unknown Number of Sources [R] . Cichocki, A., Karhunen, J., Kasprzak, W., 1998

机译：具有未知源数的盲分离神经网络

Multi-Channel Audio Source Separation Using Azimuth-Frequency Analysis and Convolutional Neural Network

摘要

著录项

相似文献

相关主题

期刊订阅