首页> 外文会议>International Congress and Exposition on Noise Control Engineering >Sound source localization based on Gaussian mixture models using ITD with diffuseness mask in a neck-band microphone array module
【24h】

Sound source localization based on Gaussian mixture models using ITD with diffuseness mask in a neck-band microphone array module

机译:基于高斯混合模型的声源定位在颈带麦克风阵列模块中使用ITD具有扩散掩模的ITD

获取原文

摘要

Sound source localization (SSL) using multiple array microphones is recently required in various fields, especially in wearable devices such as neck-bands that recognize the direction of dangerous sound sources for the hearing impaired. In this paper, we propose an algorithm based on Gaussian mixture models (GMMs) using the interaural time difference (ITD) as a method for sound source localization in neck-band devices. In addition, we apply a diffuseness mask for robust sound source localization in diffuse noise environments. The ITD feature to be learned in GMMs is extracted through generalized cross-correlation with phase transform (GCC-PHAT), and the diffuseness mask is estimated using coherent-to-diffuse power ratio (CDR) based on the spatial coherence between microphone observations to select time-frequency (t-f) bins providing ITDs robust to diffuse noise. Experimental results using real recorded data show that the GMM-based sound source localization with the diffuseness mask is robust in diffuse noise environments.
机译:在各种领域最近需要使用多个阵列麦克风的声源定位(SSL),尤其是可穿戴设备(例如颈带)识别听力障碍的危险声源方向的颈带。在本文中,我们提出了一种基于高斯混合模型(GMMS)的算法,使用互连时间差(ITD)作为颈带设备中声源定位的方法。此外,我们在漫反射噪声环境中应用用于强大声源定位的扩散掩码。通过与相变(GCC-PHAT)的广义交叉相关来提取要在GMMS中学习的ITD特征,并且基于麦克风观察之间的空间相干性,使用相干电力比(CDR)估计扩散掩模。选择时间频率(TF)箱,为ITDS膨胀噪声提供宽度。使用实际记录数据的实验结果表明,基于GMM的声源定位与扩散掩模在漫反射环境中具有稳健性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号