首页> 外文会议>Conference on multimedia computing and networking >Integrated audiovisual processing for object localization and tracking
【24h】

Integrated audiovisual processing for object localization and tracking

机译:集成的视听处理,用于对象定位和跟踪

获取原文

摘要

Abstract: This paper presents a system that combines audio and visual cues for locating and tracking an object, typically a person, in real time. It is shown that combining a speech source localization algorithm with a video-based head tracking algorithm results in a more accurate and robust tracker than that obtained using any one of the audio or visual modalities. Performance evaluation results are presented with a system that runs in real time on a general purpose processor. The multimodal tracker has several applications such as teleconferencing, multimedia kiosks and interactive games. !26
机译:摘要:本文提出了一种结合音频和视频提示的系统,用于实时定位和跟踪对象(通常是人)。结果表明,语音源定位算法与基于视频的头部跟踪算法相结合所产生的跟踪器比使用任何一种音频或视频模态获得的跟踪器更为准确和健壮。系统将在通用处理器上实时运行的系统上提供性能评估结果。多模式跟踪器具有多种应用程序,例如电话会议,多媒体信息亭和交互式游戏。 !26

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号