首页> 外文期刊>EURASIP journal on advances in signal processing >Audiovisual Head Orientation Estimation with Particle Filtering in Multisensor Scenarios
【24h】

Audiovisual Head Orientation Estimation with Particle Filtering in Multisensor Scenarios

机译:多传感器场景中带粒子滤波的视听头部定位估计

获取原文
获取原文并翻译 | 示例
           

摘要

This article presents a multimodal approach to head pose estimation of individuals in environments equipped with multiple cameras and microphones, such as SmartRooms or automatic video conferencing. Determining the individuals head orientation is the basis for many forms of more sophisticated interactions between humans and technical devices and can also be used for automatic sensor selection (camera, microphone) in communications or video surveillance systems. The use of particle filters as a unified framework for the estimation of the head orientation for both monomodal and multimodal cases is proposed. In video, we estimate head orientation from color information by exploiting spatial redundancy among cameras. Audio information is processed to estimate the direction of the voice produced by a speaker making use of the directivity characteristics of the head radiation pattern. Furthermore, two different particle filter multimodal information fusion schemes for combining the audio and video streams are analyzed in terms of accuracy and robustness. In the first one, fusion is performed at a decision level by combining each monomodal head pose estimation, while the second one uses a joint estimation system combining information at data level. Experimental results conducted over the CLEAR 2006 evaluation database are reported and the comparison of the proposed multimodal head pose estimation algorithms with the reference monomodal approaches proves the effectiveness of the proposed approach.
机译:本文介绍了一种多模态方法,用于在配备有多个摄像头和麦克风的环境中(例如SmartRooms或自动视频会议)估算个人的头部姿势。确定个人的头部方位是人与技术设备之间许多形式的更复杂交互的基础,并且还可用于通信或视频监视系统中的自动传感器选择(摄像头,麦克风)。提出了使用粒子滤波器作为统一框架,用于估计单峰和多峰情况下的头部方向。在视频中,我们通过利用摄像机之间的空间冗余从颜色信息估计头部的方向。利用头部辐射图的方向性特性,对音频信息进行处理以估计扬声器产生的语音方向。此外,从准确性和鲁棒性方面分析了两种不同的组合音频和视频流的粒子滤波器多模式信息融合方案。在第一个中,通过组合每个单峰头部姿态估计在决策级别执行融合,而第二个使用联合估计系统在数据级别组合信息。报告了在CLEAR 2006评估数据库上进行的实验结果,所提出的多峰头部姿态估计算法与参考单峰方法的比较证明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号