首页> 外文OA文献 >Using Reverberation to Improve Range and Elevation Discrimination for Small Array Sound Source Localization
【2h】

Using Reverberation to Improve Range and Elevation Discrimination for Small Array Sound Source Localization

机译:利用混响改善小阵列声源定位的距离和高程识别

摘要

Sound source localization (SSL) is an essential task in many applications involving speech capture and enhancement. As such, speaker localization with microphone arrays has received significant research attention. Nevertheless, existing SSL algorithms for small arrays still have two significant limitations: lack of range resolution, and accuracy degradation with increasing reverberation. The latter is natural and expected, given that strong reflections can have amplitudes similar to that of the direct signal, but different directions of arrival. Therefore, correctly modeling the room and compensating for the reflections should reduce the degradation due to reverberation. In this paper, we show a stronger result. If modeled correctly, early reflections can be used to provide more information about the source location than would have been available in an anechoic scenario. The modeling not only compensates for the reverberation, but also significantly increases resolution for range and elevation. Thus, we show that under certain conditions and limitations, reverberation can be used to improve SSL performance. Prior attempts to compensate for reverberation tried to model the room impulse response (RIR). However, RIRs change quickly with speaker position, and are nearly impossible to track accurately. Instead, we build a 3-D model of the room, which we use to predict early reflections, which are then incorporated into the SSL estimation. Simulation results with real and synthetic data show that even a simplistic room model is sufficient to produce significant improvements in range and elevation estimation, tasks which would be very difficult when relying only on direct path signal components.
机译:在涉及语音捕获和增强的许多应用程序中,声源本地化(SSL)是一项必不可少的任务。这样,利用麦克风阵列的扬声器定位已经引起了广泛的研究关注。然而,现有的用于小型阵列的SSL算法仍然存在两个重大局限性:缺乏范围分辨率,以及随着混响的增加而导致精度降低。考虑到强反射的幅度可能类似于直接信号,但到达的方向不同,因此后者是自然且预期的。因此,正确地对房间建模并补偿反射应减少由于混响引起的降级。在本文中,我们显示了更强大的结果。如果建模正确,与无回声场景相比,可以使用早期反射来提供有关源位置的更多信息。建模不仅可以补偿混响,还可以显着提高范围和仰角的分辨率。因此,我们表明在某些条件和限制下,混响可用于提高SSL性能。先前的补偿混响的尝试试图对房间脉冲响应(RIR)进行建模。但是,RIR随扬声器位置的变化而迅速变化,几乎不可能准确跟踪。取而代之的是,我们建立房间的3-D模型,用于预测早期反射,然后将其合并到SSL估计中。真实和合成数据的仿真结果表明,即使是简单的房间模型也足以显着改善距离和仰角估计,而仅依靠直接路径信号分量时,这些任务将非常困难。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号