首页> 外文会议>International Workshop on Machine Learning for Multimodal Interaction >Estimating the Lecturer’s Head Pose in Seminar Scenarios - A Multi-view Approach
【24h】

Estimating the Lecturer’s Head Pose in Seminar Scenarios - A Multi-view Approach

机译:估计讲师的头部姿势在研讨会场景中 - 一种多视图方法

获取原文

摘要

In this paper, we present a system to track the horizontal head orientation of a lecturer in a smart seminar room, which is equipped with several cameras. We automatically detect and track the face of the lecturer and use neural networks to classify his or her face orientation in each camera view. By combining the single estimates of the speaker’s head orientation from multiple cameras into one joint hypothesis, we improve overall head pose estimation accuracy. We conducted experiments on annotated recordings from real seminars. Using the proposed fully automatic system we are able to correctly determine the lecturer’s head pose in 59% of the time and for 8 orientation classes. In 92% of the time, the correct pose class or a neighbouring pose class (i.e. a 45 degree error) were estimated.
机译:在本文中,我们提出了一种在智能研讨会上跟踪讲师的水平头定向的系统,该系统配备有多个摄像头。我们自动检测并跟踪讲师的面部,并使用神经网络在每个相机视图中对他或她的脸部取向进行分类。通过将扬声器的头向从多个相机的头向估计结合到一个联合假设中,我们提高了整体头部姿态估计精度。我们对真实研讨会的注释录音进行了实验。使用所提出的全自动系统,我们能够在59%的时间和8个方向课程中正确地确定讲师的头部姿势。在92%的时间内,估计正确的姿势类别或相邻的姿势类(即45度误差)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号