首页> 外文会议>International Conferences on Advances in Multimedia >Video Object Detection by Classification Using String Kernels
【24h】

Video Object Detection by Classification Using String Kernels

机译:视频对象通过使用字符串内核进行分类检测

获取原文

摘要

Video object detection is one of the most important research problems for video event detection, indexing, and retrieval. For a variety of applications such as video surveillance and event annotation, the spatial-temporal boundaries between video objects are required for annotating visual content with high-level semantics. In this paper, we define spatial-temporal sampling as a unified process of extracting video objects and computing their spatial-temporal boundaries using a learnt video object model. We first provide a learning approach to build a class-specific video object model from a set of training video clips. Then the learnt model is used to locate the video objects with precise spatial-temporal boundaries from a test video clip using graph kernels. A frame sorting process as a preprocessing is also proposed to transform the graph, modeling the shot configuration of a video clip, into a string of shots. Thus, the computation of graph kernels is simplified to be string kernels. The string kernels for support vector machine (SVM) classification are finally adopted to train the SVM classifiers from a set of training samples and detect the video objects in a test video clip by classification. A human action detection and recognition system is finally constructed to verify the performance of the proposed method. Experimental results show that the proposed method gives good performance on several publicly available datasets in terms of detection accuracy and recognition rate.
机译:视频对象检测是视频事件检测,索引和检索最重要的研究问题之一。对于视频监控和事件注释等各种应用,视频对象之间的空间 - 时间边界是用高电平语义注释的视觉内容所必需的。在本文中,我们将空间时间采样定义为使用学习视频对象模型提取视频对象并计算其空间 - 时间边界的统一过程。我们首先提供一种从一组训练视频剪辑构建特定于特定视频对象模型的学习方法。然后,学习的模型用于使用图形内核从测试视频剪辑中使用精确的空间 - 时间边界定位视频对象。还提出了一种作为预处理的帧分类过程来转换图形,将视频剪辑的拍摄配置建模为一串拍摄。因此,图形内核的计算被简化为字符串内核。用于支持向量机(SVM)分类的字符串内核将采用从一组培训样本培训SVM分类器,并通过分类检测测试视频剪辑中的视频对象。最终构建人的行动检测和识别系统以验证所提出的方法的性能。实验结果表明,该方法在检测准确性和识别率方面对几个公开的数据集提供了良好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号