首页> 外文会议>International Symposium on Communication Systems, Networks and Digital Signal Processing >Sound Source Localization by Proposed Subband Adaptive GEVD Algorithm Based on GammaTone Filter Bank in Undesirable Acoustical Conditions
【24h】

Sound Source Localization by Proposed Subband Adaptive GEVD Algorithm Based on GammaTone Filter Bank in Undesirable Acoustical Conditions

机译:基于伽马托滤波器组的基于伽马托滤波器组的提出的子带自适应GEVD算法声源定位

获取原文

摘要

Sound source localization methods are implemented with different algorithms. Some methods are based on received energy to microphones and some other methods estimate the Time Difference Of Arrival (TDOA) of sound sources. The energy-based methods have high computational complexity but the Time Delay Estimation (TDE)-based methods have low accuracy in undesirable acoustical conditions. Adaptive Generalized EigenValue Decomposition (GEVD) algorithm is a method for TDE that has appropriate accuracy in noisy conditions but it does not have an acceptable performance in noisy-reverberant scenarios. The proposed method in this paper is subband adaptive GEVD based on GammaTone filter bank for TDE. Since the speech signal information is different in frequency bands, then it is necessary to present a method to concentrate on low frequency components of speech signal to have better accuracy for sound source localization. In proposed method, firstly the two microphone signals are divided to different subbands with GammaTone filter bank. This filter bank is designed based on the human auditory. Then, the GEVD function is implemented on these subbands information. Finally, the output of GEVD function are weighted and combined based on the frequency spectrum energy in different subbands. The experiments on noisy and reverberant scenarios show the superiority of proposed method in comparison with GEVD algorithm in different scenarios. Although proposed method has more computational complexity because of subband processing, but the improvement in accuracy compensate this complexity.
机译:使用不同的算法实现声源定位方法。一些方法基于接收的能量对麦克风和一些其他方法估计声源的到达时间差(TDOA)。基于能量的方法具有高计算复杂性,但是在不希望的声学条件下,基于时间延迟估计(TDE)的方法具有低精度。自适应广义特征值分解(GEVD)算法是一种用于TDE的方法,其在嘈杂的条件下具有适当的准确性,但它在嘈杂的混响场景中没有可接受的性能。本文中所提出的方法是基于用于TDE的伽马酸胶滤波器组的子带自适应GEVD。由于语音信号信息在频带中不同,因此必须呈现一种专用于语音信号的低频分量的方法,以具有更好的声源定位精度。在提出的方法中,首先,两个麦克风信号被划分为具有伽马托滤波器组的不同子带。该过滤器银行基于人类听觉设计。然后,在这些子带信息上实现GEVD函数。最后,基于不同子带中的频谱能量,基于不同子带中的频谱能量来加权和组合GEVD函数的输出。关于嘈杂和混响场景的实验表明了与不同场景中的GEVD算法相比的提出方法的优势。尽管所提出的方法由于子带处理而具有更多的计算复杂性,但精度的提高补偿了这种复杂性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号