首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >A General Network Architecture for Sound Event Localization and Detection Using Transfer Learning and Recurrent Neural Network
【24h】

A General Network Architecture for Sound Event Localization and Detection Using Transfer Learning and Recurrent Neural Network

机译:使用传输学习和经常性神经网络的声音事件定位和检测的通用网络架构

获取原文

摘要

Polyphonic sound event detection and localization (SELD) task is challenging because it is difficult to jointly optimize sound event detection (SED) and direction-of-arrival (DOA) estimation in the same network. We propose a general network architecture for SELD in which the SELD network comprises sub-networks that are pre-trained to solve SED and DOA estimation independently, and a recurrent layer that combines the SED and DOA estimation outputs into SELD outputs. The recurrent layer does the alignment between the sound classes and DOAs of sound events while being unaware of how these outputs are produced by the upstream SED and DOA estimation algorithms. This simple network architecture is compatible with different existing SED and DOA estimation algorithms. It is highly practical since the sub-networks can be improved independently. The experimental results using the DCASE 2020 SELD dataset show that the performances of our proposed network architecture using different SED and DOA estimation algorithms and different audio formats are competitive with other state-of-the-art SELD algorithms. The source code for the proposed SELD network architecture is available at Github 1.
机译:Polyphonic Sound事件检测和定位(SELED)任务是具有挑战性的,因为很难在同一网络中共同优化声音事件检测(SED)和到达方向(DOA)估计。我们提出了一种用于SELED的一般网络架构,其中SELD网络包括预先训练以独立解决SED和DOA估计的子网,以及将SED和DOA估计输出的复制层与SELD输出相结合。复发层在声音类和DOAS之间进行对齐,同时不知道这些输出如何由上游SED和DOA估计算法产生。这个简单的网络架构与不同现有的SED和DOA估计算法兼容。由于子网可以独立改进,因此非常实用。使用DCASE 2020 SELD数据集的实验结果表明,使用不同的SED和DOA估计算法和不同的音频格式的所提出的网络架构的性能与其他最先进的SELD算法竞争。拟议的SELD网络架构的源代码可在Github上获得 1

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号