首页> 美国卫生研究院文献>Sensors (Basel Switzerland) >Configuration-Invariant Sound Localization Technique Using Azimuth-Frequency Representation and Convolutional Neural Networks
【2h】

Configuration-Invariant Sound Localization Technique Using Azimuth-Frequency Representation and Convolutional Neural Networks

机译:配置 - 不变的声音本地化技术使用方位频率表示和卷积神经网络

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Deep neural networks (DNNs) have achieved significant advancements in speech processing, and numerous types of DNN architectures have been proposed in the field of sound localization. When a DNN model is deployed for sound localization, a fixed input size is required. This is generally determined by the number of microphones, the fast Fourier transform size, and the frame size. if the numbers or configurations of the microphones change, the DNN model should be retrained because the size of the input features changes. in this paper, we propose a configuration-invariant sound localization technique using the azimuth-frequency representation and convolutional neural networks (CNNs). the proposed CNN model receives the azimuth-frequency representation instead of time-frequency features as the input features. the proposed model was evaluated in different environments from the microphone configuration in which it was originally trained. for evaluation, single sound source is simulated using the image method. Through the evaluations, it was confirmed that the localization performance was superior to the conventional steered response power phase transform (SRP-PHAT) and multiple signal classification (MUSIC) methods.
机译:深度神经网络(DNN)在语音处理方面取得了显着的进步,并且在声音定位领域已经提出了许多类型的DNN架构。部署DNN模型以进行声音定位时,需要固定输入大小。这通常由麦克风,快速傅里叶变换大小和帧大小决定。如果麦克风的数量或配置改变,则应扰断DNN模型,因为输入功能的大小发生变化。在本文中,我们提出了一种使用方位频表示和卷积神经网络(CNN)的配置 - 不变的声音本地化技术。所提出的CNN模型接收方位频表示而不是时频特征作为输入特征。从最初培训的麦克风配置中的不同环境中评估了所提出的模型。对于评估,使用图像方法模拟单声源。通过评估,证实了本地化性能优于传统的转向响应功率相变(SRP-PHAT)和多个信号分类(音乐)方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号