首页> 外文会议>European Signal Processing Conference >Blind spatial sound source clustering and activity detection using uncalibrated microphone array

【24h】

Blind spatial sound source clustering and activity detection using uncalibrated microphone array

机译：使用未校准的麦克风阵列进行盲空间声源聚类和活动检测

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents a method for estimating the number, as well as the activity periods of spatially distributed sound sources using an uncalibrated microphone array. This methodology is applied for the purposes of speaker diarization. In general, speaker diarization has difficulty with: 1) estimating the number of sound sources (speakers), and 2) activity detection of multiple sound sources including overlap of utterances. Several microphone array based techniques have already tackled these challenges. However, existing methods mainly assume that the steering vectors for the microphone array are calibrated in advance to identify sound sources, which is difficult to satisfy when ad-hoc or flexible microphone arrays are used. Thus our approach estimates the number of sound sources blindly in two steps. First, Time Delay of Arrival (TDOA) of the observed signal is clustered. Second, the sound source activity is detected by clustering the long-term spatial spectrum using the TDOA based steering vector for each cluster. The validity of the algorithm is confirmed by both synthesized signals and a real-world flexible microphone array application.

机译：本文提出了一种使用未校准的麦克风阵列估计数量以及空间分布声源的活动时间的方法。此方法适用于说话人二分法的目的。通常，说话人区分存在以下困难：1）估计声源（说话者）的数量，以及2）多个声源的活动检测，包括话语重叠。几种基于麦克风阵列的技术已经解决了这些挑战。然而，现有方法主要假设用于麦克风阵列的转向矢量被预先校准以识别声源，这在使用临时或柔性麦克风阵列时难以满足。因此，我们的方法分两步盲目估算声源的数量。首先，将观测信号的到达时延（TDOA）进行聚类。其次，通过为每个群集使用基于TDOA的导向向量对长期空间频谱进行群集，来检测声源活动。该算法的有效性由合成信号和现实世界的灵活麦克风阵列应用程序共同证实。

著录项

来源
《European Signal Processing Conference》|2017年|2438-2442|共5页
会议地点
作者
Keisuke Nakamura; Takeshi Mizumoto;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Microphone arrays; Estimation; Robots; Histograms; Reverberation; Robustness;

机译：麦克风阵列;估计;机器人;直方图;混响;稳健性;

相似文献

外文文献
中文文献
专利

1. Extraction of multiple sound sources using a two-dimensional microphone array system for near-field based on blind deconvolution [J] . Yoshifumi Chisaki, Tsuyoshi Eiza, Ayumi Hashimoto, 電子情報通信学会技術研究報告. 応用音響. Engineering Acoustics . 2002,第322期

机译：基于盲反卷积的二维麦克风阵列系统近场提取多个声源
2. Extraction of multiple sound sources using a two-dimensional microphone array system for near-field based on blind deconvolution [J] . Yoshifumi Chisaki, Tsuyoshi Eiza, Ayumi Hashimoto, 電子情報通信学会技術研究報告. 応用音響. Engineering Acoustics . 2002,第322期

机译：基于盲反卷积的使用二维麦克风阵列系统，用于近场多个声源提取
3. An evaluation of low-power microphone array sound source localization for deforestation detection [J] . Petrica Lucian Applied Acoustics . 2016,第deca期

机译：用于毁林检测的低功率麦克风阵列声源定位评估
4. Blind spatial sound source clustering and activity detection using uncalibrated microphone array [C] . Keisuke Nakamura, Takeshi Mizumoto European Signal Processing Conference . 2017

机译：使用未校准麦克风阵列的盲空间声源聚类和活动检测
5. Development and use of a spherical microphone array for measurement of spatial properties of reverberant sound fields. [D] . Gover, Bradford Noel. 2002

机译：球形麦克风阵列的开发和使用，用于测量混响声场的空间特性。
6. Design of UAV-Embedded Microphone Array System for Sound Source Localization in Outdoor Environments [O] . Kotaro Hoshiba, Kai Washizaki, Mizuho Wakabayashi, 2017

机译：用于室外环境声源定位的无人机嵌入式麦克风阵列系统设计
7. ROBUST TRACKING OF MULTIPLE SOUND SOURCES BY SPATIAL INTEGRATION OF ROOM AND ROBOT MICROPHONE ARRAYS [O] . Kazuhiro Nakadai, Hirofumi Nakajima, Masamitsu Murase, 2008

机译：通过机房和机器人麦克风阵列的空间集成实现多声源的鲁棒跟踪
8. Blind Adaptive Dereverberation of Speech Signals Using a Microphone Array [R] . Bakir, T. S. , Mersereau, R. M. 2003

机译：使用麦克风阵列进行语音信号的盲自适应去混响

Blind spatial sound source clustering and activity detection using uncalibrated microphone array

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅