首页> 外文会议>ACM multimedia conference >Ensemble Multi-Instance Multi-Label Learning Approach for Video Annotation Task
【24h】

Ensemble Multi-Instance Multi-Label Learning Approach for Video Annotation Task

机译:集成视频注释任务的多实例多标签学习方法

获取原文

摘要

Automatic video annotation is an important ingredient for video indexing, browsing, and retrieval. Traditional studies represent one video clip with a flat feature vector; however, video data usually has natural structure. Moreover, a video clip is generally relevant to multiple concepts. Indeed, the video annotation task is inherently a Multi-Instance Multi-Label (MIML) learning problem. In this paper, we propose the En-MIMLSVM approach for the video annotation task. It considers the class imbalance and long time training problems of most video annotation tasks. In addition, a temporally consistent weighted multi-instance kernel is developed to take into account both the temporal consistency in video data and the significance of instances of different levels in pyramid representation. The En-MIMLSVM is evaluated on TRECVID 2005 data set, and the results show that it outperforms several state-of-the-art methods.
机译:自动视频注释是视频索引,浏览和检索的重要成分。传统研究代表一个具有平坦特征向量的视频剪辑;但是,视频数据通常具有自然结构。此外,视频剪辑通常与多个概念相关。实际上,视频注释任务本质上是一个多实例多标签(MIML)学习问题。在本文中,我们提出了用于视频注释任务的en-mimlsvm方法。它考虑了大多数视频注释任务的类别不平衡和长期训练问题。另外,开发了一个时间上一致的加权多实例内核,以考虑视频数据中的时间一致性以及金字塔表示中不同级别的实例的重要性。在Trecvid 2005数据集上评估EN-MIMLSVM,结果表明它优于几种最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号