首页> 外文会议>International Workshop of Physical Agents >Attentional Mechanism Based on a Microphone Array for Embedded Devices and a Single Camera
【24h】

Attentional Mechanism Based on a Microphone Array for Embedded Devices and a Single Camera

机译:基于嵌入式设备和单个摄像机的麦克风阵列的注意机制

获取原文

摘要

This work presents an attentional mechanism with the capability of detecting the localization of a speaker for interaction purposes, based on audio and video information. The localization is computed in terms of azimuth and elevation angles, to be used as input values for controlling mobile systems such as a pan-tilt videocamera or a robotic head. For this purpose the SRP-PHAT AT algorithm has been implemented with a commercial array of microphones for embedded devices, in order to estimate the localization of a sound source in the surroundings of the array. In order to improve the limitations of the SRP-PHAT algorithm in the estimation of the z coordinate, the elevation angle is corrected via video information by using Haar cascade classifiers for face detection. Simulations and experiments show the accuracy of the system, as well as the application for controlling a pan-tilt videocamera in a real scenario with speakers and ambient noise.
机译:该工作提出了一种注意力机制,其能够基于音频和视频信息来检测扬声器的定位以进行互动目的的能力。 本地化在方位角和高度角度计算,以用作控制诸如Pan-Tilt Videocamera或机器人头的移动系统的输入值。 为此目的,算法的SRP-Phat已经用商业麦克风阵列用于嵌入式设备来实现,以估计阵列周围环境中的声源的定位。 为了提高SRP-PHAT算法在Z坐标的估计中的局限下,通过使用哈尔级联分类器来校正仰角,用于面部检测。 仿真和实验表明了系统的准确性,以及在具有扬声器和环境噪声的真实场景中控制泛倾斜录像机的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号