首页> 外文会议>IEEE SoutheastCon >Computational strategy for accelerating robust sound source detection in dynamic scenes

【24h】

Computational strategy for accelerating robust sound source detection in dynamic scenes

机译：在动态场景中加速鲁棒声源检测的计算策略

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Efficient sound source detection and location with microphone arrays is important for many applications, including teleconferencing, surveillance, and smart rooms. While the steered response power algorithms exhibit robust performance relative to other approaches, their applications are limited by the high computational load required. For dynamic auditory scenes, the entire space must be scanned at regular intervals due to moving sound sources switching between active and inactive states. This paper introduces a time segmentation and parallelization strategy to speed up the steered response power algorithm for dynamic auditory scenes with multiple speech sources. The primary application targeted by this work is for immersive arrays and off-line auditory scene analysis with beamforming for speaker separation in cocktail party environments. Results from a Monte Carlo simulation with 6 speech sources in a mildly reverberant environment demonstrate a speed-up factor of 45, with a modest loss in the number of detections and a significant reduction in anomalous detections. Experimental results with real recordings demonstrate a performance consistent with those of the simulation.

机译：麦克风阵列的高效声源检测和定位对于许多应用（包括电话会议，监视和智能室）很重要。尽管转向响应功率算法相对于其他方法表现出强大的性能，但其应用受到所需的高计算负荷的限制。对于动态听觉场景，由于移动声源在活动状态和非活动状态之间切换，因此必须按规则的间隔扫描整个空间。本文介绍了一种时间分割和并行化策略，以加快具有多个语音源的动态听觉场景的转向响应功率算法。这项工作的主要应用是用于沉浸式阵列和带有波束形成的离线听觉场景分析，用于在鸡尾酒会环境中分离扬声器。在轻微混响环境中使用6个语音源进行的蒙特卡洛模拟的结果表明，加速因子为45，检测次数损失适中，异常检测次数显着减少。带有真实记录的实验结果表明，其性能与模拟结果一致。

著录项

来源
《IEEE SoutheastCon》|2014年|1-8|共8页
会议地点 Lexington KT(US)
作者
Donohue Kevin D.; Griffioen Paul M.;
展开▼
作者单位

Electrical Computer Engineering Department University of Kentucky Lexington USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Acoustic arrays; Acoustics; Estimation; MATLAB; Teleconferencing; Transforms; Vectors; MATLAB; Steered Response Power; cocktail party; microphone arrays; parallel processing; sound source detection;

机译：声学阵列；声学;估计； MATLAB;电话会议；转换；向量; MATLAB;转向响应能力；酒会;麦克风阵列；并行处理;声源检测;

相似文献

外文文献
中文文献
专利

1. Cellular Computations Underlying Detection of Gaps in Sounds and Lateralizing Sound Sources [J] . Oertel Donata, Cao Xiao-Jie, Ison James R., Trends in Neurosciences . 2017,第10期

机译：蜂窝计算潜在地检测声音和横向化声源的间隙
2. A Computational Auditory Scene Analysis-Enhanced Beamforming Approach for Sound Source Separation [J] . L. A. Drake, J. C. Rutledge, J. Zhang, EURASIP journal on advances in signal processing . 2009,第14期

机译：一种计算听觉场景分析增强波束形成的声源分离方法
3. A Computational Auditory Scene Analysis-Enhanced Beamforming Approach for Sound Source Separation [J] . L. A. Drake, J. C. Rutledge, J. Zhang, EURASIP journal on advances in signal processing . 2009,第1期

机译：一种计算听觉场景分析增强波束形成的声源分离方法
4. Computational Auditory Scene Analysis by using statistics of high-dimensional speech dynamics and sound source direction [C] . Johannes Nix, Michael Kleinschmidt, Volker Hohmann, European Conference on Speech Communication and Technology . 2003

机译：使用高维语音动态和声源方向的统计数据计算听觉场景分析
5. Sound source separation via computational auditory scene analysis (CASA)-enhanced beamforming. [D] . Drake, Laura Ann. 2001

机译：通过计算听觉场景分析（CASA）增强的波束形成进行声源分离。
6. Cellular Computations Underlying Detection of Gaps in Sounds and Lateralizing Sound Sources [O] . Donata Oertel, Xiao-Jie Cao, James R. Ison, -1

机译：声音中的间隙检测和声源横向化的基础细胞计算
7. Cellular Computations Underlying Detection of Gaps in Sounds and Lateralizing Sound Sources [O] . Donata Oertel, Xiao-Jie Cao, James R. Ison, 2017

机译：蜂窝计算潜在地检测声音和横向化声源的间隙

Computational strategy for accelerating robust sound source detection in dynamic scenes

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅