首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >Direction of Arrival Estimation for Reverberant Speech Based on Enhanced Decomposition of the Direct Sound
【24h】

Direction of Arrival Estimation for Reverberant Speech Based on Enhanced Decomposition of the Direct Sound

机译:基于直接声音增强分解的混响语音到达方向估计

获取原文
获取原文并翻译 | 示例
           

摘要

Direction of arrival (DOA) estimation for speech sources is an important task in audio signal processing. This task becomes a challenge in reverberant environments, which are typical to real scenarios. Several methods of DOA estimation for speech sources have been developed recently, in an attempt to overcome the effect of reverberation. One effective approach aims to identify time-frequency bins in the short time Fourier transform domain that are dominated by the direct sound. This approach was shown to be particularly adequate for spherical arrays, with processing in the spherical harmonics domain. The direct-path dominance (DPD) test, and a method which is based on the directivity of the sound field are recent examples. While these methods seem to perform well, high reverberation conditions may degrade their performance. In this paper, the structure of the spatial correlation matrix is comprehensively studied, showing that under some well-defined conditions, the DOA of the direct sound can be correctly extracted from its dominant eigenvector, even when contaminated by reflections. This new insight leads to the development of a new test, performing an enhanced decomposition of the direct sound (EDS), denoted the DPD-EDS test. The proposed test is compared to previous DPD tests, and to other recently proposed reverberation-robust methods, using computer simulations and an experimental study, demonstrating its potential advantage. The studies include multiple speakers in highly reverberant environments, therefore representing challenging real-life acoustics scenes.
机译:语音源的到达方向(DOA)估计是音频信号处理中的重要任务。在真实环境中常见的混响环境中,此任务成为一项挑战。为了克服混响的影响,最近已经开发了几种用于语音源的DOA估计方法。一种有效的方法旨在在短时傅立叶变换域中识别由直接声音主导的时频点。事实证明,这种方法特别适合球形阵列,并在球形谐波域中进行处理。最近的示例是直接路径优势(DPD)测试以及基于声场方向性的方法。尽管这些方法表现良好,但高混响条件可能会降低其性能。在本文中,对空间相关矩阵的结构进行了全面研究,表明在某些明确定义的条件下,即使受到反射污染,直接声音的DOA也可以从其主要特征向量中正确提取。这种新见解导致了新测试的发展,该测试执行了直接声音(EDS)的增强分解,称为DPD-EDS测试。使用计算机仿真和实验研究,将拟议的测试与先前的DPD测试以及最近提出的其他混响鲁棒方法进行了比较,证明了其潜在的优势。这些研究包括在高混响环境中的多个发言人,因此代表了具有挑战性的现实生活中的声学场景。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号