首页> 外文会议>International conference on multimodal interfaces and workshop on machine learning for multimodal interfaces 2009 >Realtime meeting analysis and 3D meeting viewer based on omnidirectional multimodal sensors
【24h】

Realtime meeting analysis and 3D meeting viewer based on omnidirectional multimodal sensors

机译:基于全向多模态传感器的实时会议分析和3D会议查看器

获取原文

摘要

This demo presents a realtime system for analyzing group meetings. Targeting round-table meetings, this system employs an omnidirectional camera-microphone system. The goal of this system is to automatically discover "who is talking to whom and when". To that purpose, the face pose/position of meeting participants are tracked on panorama images acquired from fisheye-based omnidirectional cameras. From audio signals obtained with microphone array, speaker diarization, i.e. the estimation of "who is speaking and when", is carried out. The visual focus of attention, i.e. "who is looking at whom", is esimated from the result of face tracking. The results are displayed based on a 3D visualization scheme. The advantage of our system is its realtimeness. We will demonstrate the portable version of the system consisting of two laptop PCs. In addition, we will showcase our meeting playback viewer with man-machine interfaces that allow users to freely control space and time of meeting scenes. With this viewer, users can also experince 3D positional sound effect linked with 3D viewpoint, using enhanced audio tracks for each participant.
机译:该演示演示了用于分析小组会议的实时系统。该系统针对圆桌会议,采用了全向摄像头-麦克风系统。该系统的目标是自动发现“谁在和谁说话,何时说话”。为此,在从基于鱼眼的全向摄像机获取的全景图像上跟踪会议参与者的面部姿势/位置。从利用麦克风阵列获得的音频信号中,进行扬声器二值化,即“谁在说话和何时”的估计。从面部跟踪的结果可以模拟出注意力的视觉焦点,即“谁在看谁”。基于3D可视化方案显示结果。我们系统的优势在于它的实时性。我们将演示由两台笔记本电脑组成的系统的便携式版本。此外,我们还将展示带有人机界面的会议回放查看器,允许用户自由控制会议场景的空间和时间。使用此查看器,用户还可以为每个参与者使用增强的音轨,体验与3D视点链接的3D位置声音效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号