Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering

Yang Bing; Liu Hong; Pang Cheng; Li Xiaofei

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering

【24h】

Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering

机译：基于TF-Wise空间谱聚类的多声源计数与定位

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper addresses the problem of multiple sound source counting and localization in adverse acoustic environments, using microphone array recordings. The proposed time-frequency (TF) wise spatial spectrum clustering based method contains two stages. First, given the received sensor signals, the spatial correlation matrix is computed and denoised in the TF domain. The TF-wise spatial spectrum is estimated based on the signal subspace information, and further enhanced by an exponential transform, which can increase the reliability of the source presence possibility reflected by spatial spectrum. Second, to jointly count and localize sound sources, the enhanced TF-wise spatial spectra are divided into several clusters with each cluster corresponding to one source. Sources are successively detected by searching the significant peaks of the remaining global spatial spectrum, which is formed using unassigned spatial spectra. After each new source detection, spatial spectra are reassigned to detected sources according to the dominance association between them. The interaction between sources is reduced by iteratively performing new source detection and spatial spectrum assignment. Experiments on both simulated data and real-world data demonstrate the superiority of the proposed method for multiple sound source counting and localization in the environment with different levels of noise and reverberation.

机译：本文使用麦克风阵列录音解决了在不利的声学环境中对多个声源进行计数和定位的问题。所提出的基于时频（TF）的空间频谱聚类方法包含两个阶段。首先，给定接收到的传感器信号，在TF域中计算空间相关矩阵并将其去噪。基于信号子空间信息估计TF方向的空间频谱，并通过指数变换进一步增强，这可以提高空间频谱反映的源存在可能性的可靠性。第二，为了共同计数和定位声源，将增强的TF方向空间频谱分为几个簇，每个簇对应一个声源。通过搜索剩余的全局空间光谱的重要峰值来连续检测源，该剩余空间光谱是使用未分配的空间光谱形成的。在每次进行新的源检测之后，根据空间光谱之间的优势关联，将空间光谱重新分配给它们。通过迭代执行新的源检测和空间频谱分配，可以减少源之间的交互。在模拟数据和真实数据上进行的实验表明，该方法在噪声和混响水平不同的环境中进行多声源计数和定位的优越性。

著录项

来源
《Audio, Speech, and Language Processing, IEEE/ACM Transactions on》 |2019年第8期|1241-1255|共15页
作者
Yang Bing; Liu Hong; Pang Cheng; Li Xiaofei;
展开▼
作者单位

Peking Univ Shenzhen Grad Sch Key Lab Machine Percept Beijing 100871 Peoples R China;

INRIA Grenoble Rhone Alpes F-38330 Montbonnot St Martin France;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Source counting; multiple sound source localization; TF-wise spatial spectrum clustering; signal subspace;

机译：源计数;多声源定位;TF-wise空间频谱聚类;信号子空间;

相似文献

外文文献
中文文献
专利

1. Sound Source Localization Based on SRP-PHAT Spatial Spectrum and Deep Neural Network [J] . Xiaoyan Zhao, Shuwen Chen, Lin Zhou, Computers, Materials & Continua . 2020,第1期

机译：基于SRP-PHAT空间谱和深神经网络的声源定位
2. Sound Source Localization Based on SRP-PHAT Spatial Spectrum and Deep Neural Network [J] . Journal of neurosurgical sciences . 2020,第1期

机译：基于SRP-PHAT空间谱和深神经网络的声源定位
3. Multiple Sound Source Position Estimation by Drone Audition Based on Data Association Between Sound Source Localization and Identification [J] . Wakabayashi Mizuho, Okuno Hiroshi G., Kumon Makoto IEEE Robotics and Automation Letters . 2020,第2期

机译：基于声音源定位与识别之间的数据关联的无人机验光的多声源位置估计
4. Near Field Sound Source Localization Based on Cross-power Spectrum Phase Analysis with Multiple Microphones [C] . Kohei Hayashida, Masanori Morise, Takanobu Nishiura Annual conference of the International Speech Communication Association;INTERSPEECH 2010 . 2011

机译：基于多麦克风跨功率谱相位分析的近场声源定位
5. Robust signal processing techniques for source localization and multisource spatial sound rendering for immersive environments. [D] . Georgiou, Panayiotis G. 2002

机译：强大的信号处理技术，可用于沉浸式环境中的源定位和多源空间声音渲染。
6. Detection and Localization of Early-Stage Multiple Brain Tumors Using a Hybrid Technique of Patch-Based Processing k-means Clustering and Object Counting [O] . Mohamed Nasor, Walid Obaid 2020

机译：使用基于补丁的处理k均值聚类和对象计数的混合技术对早期多发性脑肿瘤进行检测和定位
7. Multiple Sound Source Position Estimation by Drone Audition Based on Data Association Between Sound Source Localization and Identification [O] . Mizuho Wakabayashi, Hiroshi G. Okuno, Makoto Kumon 2020

机译：基于声音源定位与识别之间的数据关联的无人机验光的多声源位置估计

Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅