首页> 外文期刊>Multimedia Tools and Applications >Estimation of spherical harmonic coefficients in sound field recording using feed-forward neural networks
【24h】

Estimation of spherical harmonic coefficients in sound field recording using feed-forward neural networks

机译:馈通神经网络估计声场录制的球面谐波系数

获取原文
获取原文并翻译 | 示例
           

摘要

Sound field recording using spherical harmonics (SH) has been widely used. However, too many microphones are needed when recording sound fields over large areas, due to the capture of the higher order of spherical harmonic coefficients. The theory of GO in deep learning inspired us. With training the data much less than all GO's legal positions data, the Alpha Go has defeated top GO players. According to the information learned from a specific dataset, the higher spherical harmonics coefficients may be estimated with few captured sound pressures. In this paper, a learning-based approach for estimation of the SH coefficients has been investigated. In the proposed approach, SH coefficients are estimated with a feed-forward neural network (FNN) based on measurements of a spherical array. We generate a uniformly distributed dataset, try to evaluate the method on an average situation. Moreover, with the real sound field data in the SOFiA dataset, we try to evaluate the performance of our method when the correlations of data are weak. Experimental results show that the proposed approach achieves higher estimation accuracy of SH coefficients than a previously reported method. In simulations, 9 microphones' performance using the proposed approach can approximate an array with 16 microphones. The experiments confirmed the feasibility of estimating the SH coefficients with the data-driven method. Thus in a specific application, it can be used to reduce the required number of microphones.
机译:使用球形谐波(SH)的声场录制已被广泛使用。然而,由于球形谐波系数的高阶捕获,在大面积上记录大面积的声场时需要太多麦克风。深入学习的理论激发了美国。通过培训数据远低于所有Go的法律职位数据,Alpha Go已击败了顶级游戏。根据来自特定数据集的信息,可以估计较少的球形谐波系数,据捕获的声压很少。本文研究了用于估计SH系数的基于学习的方法。在所提出的方法中,基于球面阵列的测量,通过前馈神经网络(FNN)估计SH系数。我们生成一个统一分布的数据集,尝试在平均情况下评估方法。此外,在Sofia数据集中的真实声场数据,我们尝试在数据的相关性弱时评估我们的方法的性能。实验结果表明,该方法达到了SH系数的估计精度,而不是先前报道的方法。在仿真中,使用所提出的方法的9个麦克风的性能可以近似具有16个麦克风的数组。实验证实了利用数据驱动方法估计SH系数的可行性。因此,在特定应用中,它可以用于减少所需数量的麦克风。

著录项

  • 来源
    《Multimedia Tools and Applications》 |2021年第4期|6187-6202|共16页
  • 作者单位

    National Engineering Research Center for Multimedia Software School of Computer Science Wuhan University Wuhan 430072 China Hubei Key Laboratory of Multimedia and Network Communication Engineering Wuhan University Wuhan 430072 China;

    National Engineering Research Center for Multimedia Software School of Computer Science Wuhan University Wuhan 430072 China Hubei Key Laboratory of Multimedia and Network Communication Engineering Wuhan University Wuhan 430072 China;

    National Engineering Research Center for Multimedia Software School of Computer Science Wuhan University Wuhan 430072 China Hubei Key Laboratory of Multimedia and Network Communication Engineering Wuhan University Wuhan 430072 China;

    National Engineering Research Center for Multimedia Software School of Computer Science Wuhan University Wuhan 430072 China Collaborative Innovation Center of Geospatial Technology Wuhan 430079 China;

    National Engineering Research Center for Multimedia Software School of Computer Science Wuhan University Wuhan 430072 China Collaborative Innovation Center of Geospatial Technology Wuhan 430079 China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Spherical harmonics; Microphone array; Sound field recording; Sound field reproduction;

    机译:球形谐波;麦克风阵列;声场录音;声场再现;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号