首页> 外文OA文献 >Biologically inspired binaural sound source localization and tracking for mobile robots
【2h】

Biologically inspired binaural sound source localization and tracking for mobile robots

机译:受生物启发的双耳声源定位和移动机器人跟踪

摘要

This thesis proposes biologically inspired methods of binaural sound source localization for mobile robots. We also propose a method for modulating a robot's attention inspired from the barn owl and last, a tracking system which makes it possible for a robot to track objects emitting sounds. Regarding sound source localization, the method that was best understood and evaluated is a method based on the evaluation of interaural time differences (ITDs). There is a simple reason for this state of affairs. Interaural time differences are influenced mainly by the inter-microphone distance, provided there is no major obstruction between them. This would make the sound waves bend around the structure and thus increase path length and ITD in a frequency-specific manner. With no obstruction between the microphones and under the far-field assumption, the interaural time difference relates to azimuth through a simple equation, where only inter-microphone distance (constant) and speed of sound (can be regarded constant) are required additionally. Under these conditions, it is easy to adapt ITD localization to different hardware platforms. The method we use for ITD based localization relies on detecting phase coincidence for individual frequencies in the frequency domain and subsequent frequency integration to eliminate phase ambiguities. Overall, the results are excellent. Broadband signals can be localized with an accuracy of ±2°. Localization of pure tones is erratic, as was to be expected. The only unexpected behavior was a low accuracy in localizing 100 Hz – 1 kHz bandpass noise. Simulations in which the room acoustics could be controlled showed that this is caused by sound reflections from the environment. In larger rooms or, equivalently, rooms with a lower direct-to-reverberant ratio, localization precision of broadband signals also degrades significantly, which becomes evident in experiments on a real robot. All in all, care has to be taken as to the acoustic environment in which the ITD based localization is deployed, in order to achieve best performance. Interaural level differences based sound source localization relies on the acoustical properties of the microphone mount assembly and supporting structures. This means that adapting ILD localization to a new platform is more difficult. It requires mounting the microphones and then calibrating the whole setup to record the resulting azimuth/elevation/frequency dependent ILD values, which can then be used by the sound source localization algorithm. This is a quite elaborate, time-consuming procedure which has to be repeated every time something changes in the way the microphones are mounted - or if the microphones themselves are changed. Experiments with artificial owl ruffs illustrate this: even small changes in the ruff can have a huge impact on the ILDs (and, to a lesser degree, on the ITDs). The method for ILD based sound source localization relies on a neuronal model of the barn owl's auditory intensity pathway. Specifically, the neuronal responses in the VLVp and the ICc ls as well as the connections between these areas are modeled. The results of the experiments with the algorithm are encouraging. First tests showed that the system was able to accurately localize broadband sound sources in the range of -30°...+30°. More elaborate artificial ruffs experiments confirmed these results. Furthermore, with the correct acoustic design of the artificial ruff, it is possible to use the ILDs for various purposes as for example localization in elevation and/or verification/correction of the ITD based azimuth estimates. With the attentional module based on a neuronal saliency map it is possible to preactivate a robot's attention to a specific region of interest. It was possible to successfully reproduce with a robotic pan-tilt unit attentional latency experiments that were performed with barn owls. But the system we propose can easily be generalized to modulate (in several instances) the attention of the robot at various levels, from basic sensor level up to planning level. The Markov chain Monte Carlo based combined sound source and dynamic object tracking had a few problems accurately tracking simulated entities. Although the general viability of the method could be shown, the algorithm still has several shortcomings. MCMCDA with a virtual sensor is able to correctly track sound sources and objects alone, but the combination of both modalities in one track proved to be difficult. As long as individual entities are in clearly distinct positions, correct tracks are produced, but if they approach each other or - even worse - cross paths, tracking breaks down. This seems to be caused mainly by the lack of distance information in the sound source localization modality. As long as these shortcomings are not addressed, it makes little sense to test the method on a real robot. This is why the MCMCDA experiments in this thesis were limited to simulations.
机译:本文提出了生物启发的移动机器人双耳声源定位方法。我们还提出了一种用于调制受仓from启发的机器人注意力的方法,最后是一种跟踪系统,该跟踪系统使机器人可以跟踪发出声音的对象。关于声源定位,最好地理解和评估的方法是基于对耳间时间差(ITD)进行评估的方法。这种情况有一个简单的原因。耳间时间差主要受麦克风之间的距离影响,前提是它们之间没有大的障碍。这将使声波在结构周围弯曲,从而以特定于频率的方式增加路径长度和ITD。在麦克风之间没有障碍物且在远场假设下,耳间时间差通过一个简单的方程式与方位角有关,其中仅需要额外的麦克风间距离(恒定)和声速(可以视为恒定)。在这种情况下,很容易使ITD本地化适应不同的硬件平台。我们用于基于ITD的定位的方法依赖于检测频域中各个频率的相位重合以及随后的频率积分以消除相位模糊性。总体而言,结果非常好。宽带信号的定位精度为±2°。可以预期,纯音的本地化是不稳定的。唯一的意外行为是定位100 Hz – 1 kHz带通噪声时的精度较低。可以控制室内声学的模拟表明,这是由环境中的声音反射引起的。在较大的房间中,或等效地,在具有较低的直接混响比的房间中,宽带信号的定位精度也会显着降低,这在实际机器人上进行的实验中很明显。总而言之,必须注意在其中部署基于ITD的本地化的声学环境,以获得最佳性能。基于听觉水平差异的声源定位取决于麦克风支架组件和支撑结构的声学特性。这意味着使ILD本地化适应新平台更加困难。它需要安装麦克风,然后校准整个设置,以记录最终的方位角/仰角/频率相关的ILD值,然后可将其用于声源定位算法。这是一个非常复杂,耗时的过程,每当麦克风的安装方式发生变化或麦克风本身发生变化时,都必须重复执行此过程。人造猫头鹰r的实验说明了这一点:即使uff的微小变化也会对ILD(对ITD产生较小的影响)产生巨大影响。基于ILD的声源定位方法依赖于仓the的听觉强度通路的神经元模型。具体而言,对VLVp和ICcls中的神经元反应以及这些区域之间的联系进行了建模。该算法的实验结果令人鼓舞。初步测试表明,该系统能够准确定位-30°... + 30°范围内的宽带声源。更详尽的人造衣领实验证实了这些结果。此外,利用人造颈巾的正确的声学设计,可以将ILD用于各种目的,例如基于ITD的方位角估计的仰角定位和/或验证/校正。使用基于神经元显着性图的注意力模块,可以预先激活机器人对特定兴趣区域的注意力。可以使用机器人横摇倾斜装置成功地复制对仓n进行的注意力潜伏期实验。但是我们提出的系统可以很容易地推广到各个级别,从基本传感器级别到计划级别,以调制(在某些情况下)机器人的注意力。基于马尔可夫链蒙特卡罗的组合声源和动态对象跟踪存在一些问题,无法准确跟踪模拟实体。尽管可以显示该方法的一般可行性,但是该算法仍存在一些缺点。带有虚拟传感器的MCMCDA能够单独正确地跟踪声源和物体,但是事实证明,将两种模态组合在一条轨道上非常困难。只要各个实体处于明显不同的位置,就会产生正确的轨迹,但是如果它们彼此靠近甚至(甚至更糟)是交叉路径,则会破坏轨迹。这似乎主要是由于声源定位方式中距离信息的缺乏。只要不解决这些缺点,在实际的机器人上测试该方法就没有意义。这就是为什么本文中的MCMCDA实验仅限于模拟的原因。

著录项

  • 作者

    Calmes Laurent;

  • 作者单位
  • 年度 2009
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号