首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Quaternion Convolutional Neural Networks for Detection and Localization of 3D Sound Events
【24h】

Quaternion Convolutional Neural Networks for Detection and Localization of 3D Sound Events

机译:四元数卷积神经网络用于3D声音事件的检测和定位

获取原文

摘要

Learning from data in the quaternion domain enables us to exploit internal dependencies of 4D signals and treating them as a single entity. One of the models that perfectly suits with quaternion-valued data processing is represented by 3D acoustic signals in their spherical harmonics decomposition. In this paper, we address the problem of localizing and detecting sound events in the spatial sound field by using quaternion-valued data processing. In particular, we consider the spherical harmonic components of the signals captured by a first-order ambisonic microphone and process them by using a quaternion convolutional neural network. Experimental results show that the proposed approach exploits the correlated nature of the ambisonic signals, thus improving accuracy results in 3D sound event detection and localization.
机译:从四元数域中的数据学习使我们能够利用4D信号的内部依赖性并将其视为单个实体。其中一种非常适合四元数值数据处理的模型由3D声波信号在其球谐分解中表示。在本文中,我们通过使用四元数值数据处理解决了在空间声场中定位和检测声音事件的问题。特别是,我们考虑了由一阶歧义麦克风捕获的信号的球谐分量,并使用四元数卷积神经网络对其进行了处理。实验结果表明,所提出的方法利用了歧义信号的相关性质,从而提高了3D声音事件检测和定位的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号