Video Object Detection by Classification Using String Kernels

机译：视频对象通过使用字符串内核进行分类检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Video object detection is one of the most important research problems for video event detection, indexing, and retrieval. For a variety of applications such as video surveillance and event annotation, the spatial-temporal boundaries between video objects are required for annotating visual content with high-level semantics. In this paper, we define spatial-temporal sampling as a unified process of extracting video objects and computing their spatial-temporal boundaries using a learnt video object model. We first provide a learning approach to build a class-specific video object model from a set of training video clips. Then the learnt model is used to locate the video objects with precise spatial-temporal boundaries from a test video clip using graph kernels. A frame sorting process as a preprocessing is also proposed to transform the graph, modeling the shot configuration of a video clip, into a string of shots. Thus, the computation of graph kernels is simplified to be string kernels. The string kernels for support vector machine (SVM) classification are finally adopted to train the SVM classifiers from a set of training samples and detect the video objects in a test video clip by classification. A human action detection and recognition system is finally constructed to verify the performance of the proposed method. Experimental results show that the proposed method gives good performance on several publicly available datasets in terms of detection accuracy and recognition rate.

机译：视频对象检测是视频事件检测，索引和检索最重要的研究问题之一。对于视频监控和事件注释等各种应用，视频对象之间的空间 - 时间边界是用高电平语义注释的视觉内容所必需的。在本文中，我们将空间时间采样定义为使用学习视频对象模型提取视频对象并计算其空间 - 时间边界的统一过程。我们首先提供一种从一组训练视频剪辑构建特定于特定视频对象模型的学习方法。然后，学习的模型用于使用图形内核从测试视频剪辑中使用精确的空间 - 时间边界定位视频对象。还提出了一种作为预处理的帧分类过程来转换图形，将视频剪辑的拍摄配置建模为一串拍摄。因此，图形内核的计算被简化为字符串内核。用于支持向量机（SVM）分类的字符串内核将采用从一组培训样本培训SVM分类器，并通过分类检测测试视频剪辑中的视频对象。最终构建人的行动检测和识别系统以验证所提出的方法的性能。实验结果表明，该方法在检测准确性和识别率方面对几个公开的数据集提供了良好的性能。

著录项

来源
《International Conferences on Advances in Multimedia》|2013年||共6页
会议地点
作者
Wan-Hsuan Yu; Chi-Han Chuang; Shyi-Chyi Cheng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP37-53;
关键词
Video objects; String kernels; Dynamic programming; Video object modeling; SVM classification;

机译：视频对象;字符串内核;动态编程;视频对象建模;SVM分类;
入库时间 2022-08-20 22:18:27

相似文献

外文文献
中文文献
专利

1. Video event classification using string kernels [J] . Lamberto Ballan, Marco Bertini, Alberto Del Bimbo, Multimedia Tools and Applications . 2010,第1期

机译：使用字符串内核进行视频事件分类
2. Video Stream Analysis in Clouds: An Object Detection and Classification Framework for High Performance Video Analytics [J] . Anjum Ashiq, Abdullah Tariq, Tariq M. Fahim, Cloud Computing, IEEE Transactions on . 2019,第4期

机译：云中的视频流分析：高性能视频分析的对象检测和分类框架
3. Detection and classification of wipe transitions in sport videos in presence of object motion [J] . Salim Chavan, M Narayana, L Koteswara Rao International Journal of Engineering & Technology . 2018,第2期

机译：在存在对象运动的情况下对运动视频中的划像过渡进行检测和分类
4. Video Object Detection by Classification Using String Kernels [C] . Wan-Hsuan Yu, Chi-Han Chuang, Shyi-Chyi Cheng International Conferences on Advances in Multimedia . 2013

机译：视频对象通过使用字符串内核进行分类检测
5. High Performance Video Stream Analytics System for Object Detection and Classification [D] . Yaseen, Muhammad Usman. 2021

机译：高性能视频流分析系统，用于对象检测和分类
6. Video based object representation and classification using multiple covariance matrices [O] . Yurong Zhang, Quan Liu -1

机译：使用多个协方差矩阵的基于视频的对象表示和分类
7. Learning and Classification of car trajectories in road video by string kernels [O] . Brun, Luc, Saggese, Alessia, Vento, Mario 2013

机译：字符串核对道路视频中汽车轨迹的学习和分类

Video Object Detection by Classification Using String Kernels

摘要

著录项

相似文献

相关主题

期刊订阅