首页> 美国卫生研究院文献>Sensors (Basel Switzerland) >Configuration-Invariant Sound Localization Technique Using Azimuth-Frequency Representation and Convolutional Neural Networks

【2h】

Configuration-Invariant Sound Localization Technique Using Azimuth-Frequency Representation and Convolutional Neural Networks

机译：配置 - 不变的声音本地化技术使用方位频率表示和卷积神经网络

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep neural networks (DNNs) have achieved significant advancements in speech processing, and numerous types of DNN architectures have been proposed in the field of sound localization. When a DNN model is deployed for sound localization, a fixed input size is required. This is generally determined by the number of microphones, the fast Fourier transform size, and the frame size. if the numbers or configurations of the microphones change, the DNN model should be retrained because the size of the input features changes. in this paper, we propose a configuration-invariant sound localization technique using the azimuth-frequency representation and convolutional neural networks (CNNs). the proposed CNN model receives the azimuth-frequency representation instead of time-frequency features as the input features. the proposed model was evaluated in different environments from the microphone configuration in which it was originally trained. for evaluation, single sound source is simulated using the image method. Through the evaluations, it was confirmed that the localization performance was superior to the conventional steered response power phase transform (SRP-PHAT) and multiple signal classification (MUSIC) methods.

机译：深度神经网络（DNN）在语音处理方面取得了显着的进步，并且在声音定位领域已经提出了许多类型的DNN架构。部署DNN模型以进行声音定位时，需要固定输入大小。这通常由麦克风，快速傅里叶变换大小和帧大小决定。如果麦克风的数量或配置改变，则应扰断DNN模型，因为输入功能的大小发生变化。在本文中，我们提出了一种使用方位频表示和卷积神经网络（CNN）的配置 - 不变的声音本地化技术。所提出的CNN模型接收方位频表示而不是时频特征作为输入特征。从最初培训的麦克风配置中的不同环境中评估了所提出的模型。对于评估，使用图像方法模拟单声源。通过评估，证实了本地化性能优于传统的转向响应功率相变（SRP-PHAT）和多个信号分类（音乐）方法。

著录项

期刊名称 Sensors (Basel Switzerland)
作者
Chanjun Chun; Kwang Myung Jeon; Wooyeol Choi;
展开▼
作者单位

展开▼
年(卷),期 2020(20),13
年度 2020
页码 -1
总页数 10
原文格式 PDF
正文语种
中图分类
关键词
azimuth-frequency representation; configuration-invariant; convolutional neural network (CNN); sound localization;

相似文献

外文文献
中文文献
专利

1. Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks [J] . Adavanne Sharath, Politis Archontis, Nikunen Joonas, Selected Topics in Signal Processing, IEEE Journal of . 2019,第1期

机译：使用卷积递归神经网络进行声音事件定位和重叠源检测
2. Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks [J] . Adavanne Sharath, Politis Archontis, Nikunen Joonas, Selected Topics in Signal Processing, IEEE Journal of . 2019,第1期

机译：使用卷积经常性神经网络定位和检测重叠源的定位
3. DOANet: a deep dilated convolutional neural network approach for search and rescue with drone-embedded sound source localization [J] . Alif Bin Abdul Qayyum, K. M. Naimul Hassan, Adrita Anika, EURASIP journal on audio, speech, and music processing . 2020,第1期

机译：Doanet：用无人机嵌入式声源定位搜索和救出深度扩张的卷积神经网络方法
4. Multi-Channel Audio Source Separation Using Azimuth-Frequency Analysis and Convolutional Neural Network [C] . Jung Min Moon, Chan Jun Chun, Jun Ho Kim, The 1st International Conference onArtificial Intelligence in Information and Communication . 2019

机译：基于方位角频率分析和卷积神经网络的多声道音频源分离
5. Object Detection Techniques Using Convolutional Neural Networks [D] . Panthula, Ganesh Anirudh 2018

机译：使用卷积神经网络的目标检测技术
6. EEG signal analysis using classification techniques: Logistic regression artificial neural networks support vector machines and convolutional neural networks [O] . Maria Camila Guerrero, Juan Sebastián Parada, Helbert Eduardo Espitia 2021

机译：EEG信号分析使用分类技术：Logistic回归人工神经网络支持向量机和卷积神经网络
7. Hierarchical Detection of Sound Events and their Localization Using Convolutional Neural Networks with Adaptive Thresholds [O] . Sotirios Panagiotis Chytas, Gerasimos Potamianos 2019

机译：使用具有自适应阈值的卷积神经网络分层检测声音事件及其本地化

Configuration-Invariant Sound Localization Technique Using Azimuth-Frequency Representation and Convolutional Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅