...
首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >A graphical model for audiovisual object tracking
【24h】

A graphical model for audiovisual object tracking

机译:用于视听对象跟踪的图形模型

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We present a new approach to modeling and processing multimedia data. This approach is based on graphical models that combine audio and video variables. We demonstrate it by developing a new algorithm for tracking a moving object in a cluttered, noisy scene using two microphones and a camera. Our model uses unobserved variables to describe the data in terms of the process that generates them. It is therefore able to capture and exploit the statistical structure of the audio and video data separately, as well as their mutual dependencies. Model parameters are learned from data via an EM algorithm, and automatic calibration is performed as part of this procedure. Tracking is done by Bayesian inference of the object location from data. We demonstrate successful performance on multimedia clips captured in real world scenarios using off-the-shelf equipment.
机译:我们提出了一种用于建模和处理多媒体数据的新方法。这种方法基于结合了音频和视频变量的图形模型。我们通过开发一种新算法来演示它,该算法使用两个麦克风和一个摄像头在混乱,嘈杂的场景中跟踪运动物体。我们的模型使用不可观察的变量按照生成它们的过程来描述数据。因此,它能够分别捕获和利用音频和视频数据的统计结构及其相互依赖性。通过EM算法从数据中学习模型参数,并在此过程中执行自动校准。跟踪是通过贝叶斯从数据中推断出对象的位置来完成的。我们演示了使用现成的设备在现实场景中捕获的多媒体剪辑上的成功表现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号