Estimation of spherical harmonic coefficients in sound field recording using feed-forward neural networks

Lingkun Zhang; Xiaochen Wang; Ruimin Hu; Dengshi Li; Weipin Tu

首页> 外文期刊>Multimedia Tools and Applications >Estimation of spherical harmonic coefficients in sound field recording using feed-forward neural networks

【24h】

Estimation of spherical harmonic coefficients in sound field recording using feed-forward neural networks

机译：馈通神经网络估计声场录制的球面谐波系数

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sound field recording using spherical harmonics (SH) has been widely used. However, too many microphones are needed when recording sound fields over large areas, due to the capture of the higher order of spherical harmonic coefficients. The theory of GO in deep learning inspired us. With training the data much less than all GO's legal positions data, the Alpha Go has defeated top GO players. According to the information learned from a specific dataset, the higher spherical harmonics coefficients may be estimated with few captured sound pressures. In this paper, a learning-based approach for estimation of the SH coefficients has been investigated. In the proposed approach, SH coefficients are estimated with a feed-forward neural network (FNN) based on measurements of a spherical array. We generate a uniformly distributed dataset, try to evaluate the method on an average situation. Moreover, with the real sound field data in the SOFiA dataset, we try to evaluate the performance of our method when the correlations of data are weak. Experimental results show that the proposed approach achieves higher estimation accuracy of SH coefficients than a previously reported method. In simulations, 9 microphones' performance using the proposed approach can approximate an array with 16 microphones. The experiments confirmed the feasibility of estimating the SH coefficients with the data-driven method. Thus in a specific application, it can be used to reduce the required number of microphones.

机译：使用球形谐波（SH）的声场录制已被广泛使用。然而，由于球形谐波系数的高阶捕获，在大面积上记录大面积的声场时需要太多麦克风。深入学习的理论激发了美国。通过培训数据远低于所有Go的法律职位数据，Alpha Go已击败了顶级游戏。根据来自特定数据集的信息，可以估计较少的球形谐波系数，据捕获的声压很少。本文研究了用于估计SH系数的基于学习的方法。在所提出的方法中，基于球面阵列的测量，通过前馈神经网络（FNN）估计SH系数。我们生成一个统一分布的数据集，尝试在平均情况下评估方法。此外，在Sofia数据集中的真实声场数据，我们尝试在数据的相关性弱时评估我们的方法的性能。实验结果表明，该方法达到了SH系数的估计精度，而不是先前报道的方法。在仿真中，使用所提出的方法的9个麦克风的性能可以近似具有16个麦克风的数组。实验证实了利用数据驱动方法估计SH系数的可行性。因此，在特定应用中，它可以用于减少所需数量的麦克风。

著录项

来源
《Multimedia Tools and Applications》 |2021年第4期|6187-6202|共16页
作者
Lingkun Zhang; Xiaochen Wang; Ruimin Hu; Dengshi Li; Weipin Tu;
展开▼
作者单位

National Engineering Research Center for Multimedia Software School of Computer Science Wuhan University Wuhan 430072 China Hubei Key Laboratory of Multimedia and Network Communication Engineering Wuhan University Wuhan 430072 China;

National Engineering Research Center for Multimedia Software School of Computer Science Wuhan University Wuhan 430072 China Hubei Key Laboratory of Multimedia and Network Communication Engineering Wuhan University Wuhan 430072 China;

National Engineering Research Center for Multimedia Software School of Computer Science Wuhan University Wuhan 430072 China Hubei Key Laboratory of Multimedia and Network Communication Engineering Wuhan University Wuhan 430072 China;

National Engineering Research Center for Multimedia Software School of Computer Science Wuhan University Wuhan 430072 China Collaborative Innovation Center of Geospatial Technology Wuhan 430079 China;

National Engineering Research Center for Multimedia Software School of Computer Science Wuhan University Wuhan 430072 China Collaborative Innovation Center of Geospatial Technology Wuhan 430079 China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Spherical harmonics; Microphone array; Sound field recording; Sound field reproduction;

机译：球形谐波;麦克风阵列;声场录音;声场再现;

相似文献

外文文献
中文文献
专利

1. Wave field synthesis of a sound field described by spherical harmonics expansion coefficients [J] . Ahrens J., Spors S. The Journal of the Acoustical Society of America . 2012,第3aPta1期

机译：用球谐函数展开系数描述的声场的波场合成
2. Translations of spherical harmonics expansion coefficients for a sound field using plane wave expansions (L) [J] . Wang Yan, Chen Kean The Journal of the Acoustical Society of America . 2018,第6aPta1期

机译：使用平面波扩建的球形谐波膨胀系数对声场的转换（L）
3. Translations of spherical harmonics expansion coefficients for a sound field using plane wave expansions (L) [J] . Wang Yan, Chen Kean The Journal of the Acoustical Society of America . 2018,第6aPta2期

机译：使用平面波扩建的球形谐波膨胀系数对声场的转换（L）
4. Data mining of spherical harmonic (SH) coefficients using artificial neural networks (ANN) [C] . Muhammad Athar Javaid, Wolfgang Keller 2017 Fifth International Conference on Aerospace Science amp; Engineering . 2017

机译：使用人工神经网络（ANN）进行球谐（SH）系数的数据挖掘
5. Analytic Spherical Harmonic Coefficients for Polygonal Area Lights [D] . Wang, Jingwen. 2018

机译：多边形区域灯的分析球形谐波系数
6. Sound Source Localization Using Non-Conformal Surface Sound Field Transformation Based on Spherical Harmonic Wave Decomposition [O] . Lanyue Zhang, Dandan Ding, Desen Yang, 2017

机译：基于球面谐波分解的非共形表面声场变换的声源定位
7. Classifying Heart Sound Recordings using Deep Convolutional Neural Networks and Mel:Frequency Cepstral Coefficients [O] . Jonathan Rubin, Rui Abreu, Anurag Ganguli, 2016

机译：使用深卷积神经网络和MEL进行分类心脏录音：频率肌肌系数

Estimation of spherical harmonic coefficients in sound field recording using feed-forward neural networks

摘要

著录项

相似文献

相关主题

期刊订阅