首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering
【24h】

Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering

机译:基于TF-Wise空间谱聚类的多声源计数与定位

获取原文
获取原文并翻译 | 示例

摘要

This paper addresses the problem of multiple sound source counting and localization in adverse acoustic environments, using microphone array recordings. The proposed time-frequency (TF) wise spatial spectrum clustering based method contains two stages. First, given the received sensor signals, the spatial correlation matrix is computed and denoised in the TF domain. The TF-wise spatial spectrum is estimated based on the signal subspace information, and further enhanced by an exponential transform, which can increase the reliability of the source presence possibility reflected by spatial spectrum. Second, to jointly count and localize sound sources, the enhanced TF-wise spatial spectra are divided into several clusters with each cluster corresponding to one source. Sources are successively detected by searching the significant peaks of the remaining global spatial spectrum, which is formed using unassigned spatial spectra. After each new source detection, spatial spectra are reassigned to detected sources according to the dominance association between them. The interaction between sources is reduced by iteratively performing new source detection and spatial spectrum assignment. Experiments on both simulated data and real-world data demonstrate the superiority of the proposed method for multiple sound source counting and localization in the environment with different levels of noise and reverberation.
机译:本文使用麦克风阵列录音解决了在不利的声学环境中对多个声源进行计数和定位的问题。所提出的基于时频(TF)的空间频谱聚类方法包含两个阶段。首先,给定接收到的传感器信号,在TF域中计算空间相关矩阵并将其去噪。基于信号子空间信息估计TF方向的空间频谱,并通过指数变换进一步增强,这可以提高空间频谱反映的源存在可能性的可靠性。第二,为了共同计数和定位声源,将增强的TF方向空间频谱分为几个簇,每个簇对应一个声源。通过搜索剩余的全局空间光谱的重要峰值来连续检测源,该剩余空间光谱是使用未分配的空间光谱形成的。在每次进行新的源检测之后,根据空间光谱之间的优势关联,将空间光谱重新分配给它们。通过迭代执行新的源检测和空间频谱分配,可以减少源之间的交互。在模拟数据和真实数据上进行的实验表明,该方法在噪声和混响水平不同的环境中进行多声源计数和定位的优越性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号