首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Bayesian Modeling of Temporal Coherence in Videos for Entity Discovery and Summarization
【24h】

Bayesian Modeling of Temporal Coherence in Videos for Entity Discovery and Summarization

机译:用于实体发现和汇总的视频中时间相干性的贝叶斯建模

获取原文
获取原文并翻译 | 示例
           

摘要

A video is understood by users in terms of entities present in it. Entity Discovery is the task of building appearance model for each entity (e.g., a person), and finding all its occurrences in the video. We represent a video as a sequence of tracklets, each spanning 10-20 frames, and associated with one entity. We pose Entity Discovery as tracklet clustering, and approach it by leveraging Temporal Coherence (TC): the property that temporally neighboring tracklets are likely to be associated with the same entity. Our major contributions are the first Bayesian nonparametric models for TC at tracklet-level. We extend Chinese Restaurant Process (CRP) to TC-CRP, and further to Temporally Coherent Chinese Restaurant Franchise (TC-CRF) to jointly model entities and temporal segments using mixture components and sparse distributions. For discovering persons in TV serial videos without meta-data like scripts, these methods show considerable improvement over state-of-the-art approaches to tracklet clustering in terms of clustering accuracy, cluster purity and entity coverage. The proposed methods can perform online tracklet clustering on streaming videos unlike existing approaches, and can automatically reject false tracklets. Finally we discuss entity-driven video summarization- where temporal segments of the video are selected based on the discovered entities, to create a semantically meaningful summary.
机译:用户根据视频中存在的实体来理解视频。实体发现是为每个实体(例如,一个人)建立外观模型,并查找其在视频中所有出现的任务。我们将视频表示为一系列小轨迹,每个小轨迹跨越10-20帧,并与一个实体相关联。我们将实体发现作为小波聚类,并通过利用时间相干性(TC)进行处理:时间相邻小波很可能与同一实体相关联的属性。我们的主要贡献是在小波级的TC的第一个贝叶斯非参数模型。我们将中餐厅流程(CRP)扩展到TC-CRP,并进一步扩展到临时连贯中餐厅特许经营(TC-CRF),以使用混合成分和稀疏分布共同对实体和时间段进行建模。为了在电视连续视频中发现没有脚本之类的元数据的人,这些方法在聚类精度,聚类纯度和实体覆盖率方面,都比最新的小波聚类方法有了显着改进。与现有方法不同,所提出的方法可以在流视频上执行在线Tracklet聚类,并且可以自动拒绝错误的Tracklet。最后,我们讨论了实体驱动的视频摘要-根据发现的实体选择视频的时间段,以创建语义上有意义的摘要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号