首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >Acoustic Source Localization Based on Geometric Projection in Reverberant and Noisy Environments
【24h】

Acoustic Source Localization Based on Geometric Projection in Reverberant and Noisy Environments

机译:混响和嘈杂环境中基于几何投影的声源定位

获取原文
获取原文并翻译 | 示例

摘要

Acoustic source localization (ASL) is a fundamental yet still challenging signal processing problem in sound acquisition, speech communication, and human-machine interfaces. Many ASL algorithms have been developed, such us the steered response power (SRP), the SRP-phase transform, the minimum variance distortionless response, the multiple signal classification (MUSIC), the householder transform-based methods, to name but a few. Most of those algorithms require hundreds or even thousands of snapshots to produce one reliable estimate, which make them difficult to track moving sources. Moreover, not much efforts have been reported in the literature to show the intrinsic relationships among those methods. This paper deals with the ASL problem with its focal point placed on how to achieve ASL with a short frame of acoustic signal (corresponding to a single snapshot in the frequency domain). It reformulates the ASL problem from the perspective of geometric projection. Four types of' power functions are proposed, leading to several different algorithms for ASL. By analyzing those power functions, we show the equivalence between the popularly used conventional algorithms and our methods, which provides some new insights into the conventional algorithms. The relationships among different algorithms are discussed, which make it easy to comprehend the pros and cons of each of those methods. Experiments in real acoustic environments corroborate the theoretical analysis, which in turn justifies the contribution of this paper.
机译:在声音采集,语音通信和人机界面中,声源定位(ASL)是一个基本但仍具有挑战性的信号处理问题。已经开发了许多ASL算法,例如转向响应功率(SRP),SRP相位变换,最小方差无失真响应,多信号分类(MUSIC),基于Householder变换的方法,仅举几例。这些算法中的大多数都需要数百甚至数千个快照来生成一个可靠的估计,这使它们很难跟踪移动的源。而且,在文献中没有太多的努力表明这些方法之间的内在联系。本文以ASL问题为重点,重点在于如何用短帧的声信号(对应于频域中的单个快照)实现ASL。从几何投影的角度重新阐述了ASL问题。提出了四种类型的幂函数,导致针对ASL的几种不同算法。通过分析那些幂函数,我们显示了常用的常规算法与我们的方法之间的等效性,这为常规算法提供了一些新见识。讨论了不同算法之间的关系,这使您很容易理解每​​种方法的优缺点。在真实声学环境中进行的实验证实了理论分析的正确性,这反过来证明了本文的贡献。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号