...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Video classification and retrieval through spatio-temporal Radon features
【24h】

Video classification and retrieval through spatio-temporal Radon features

机译:视频分类和通过时空氡功能检索

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The rise in the availability of video content for access via the Internet and the medium of television has resulted in the development of automatic search procedures to retrieve the desired video. Searches can be simplified and hastened by employing automatic classification of videos. This paper proposes a descriptor called the Spatio-Temporal Histogram of Radon Projections (STHRP) for representing the temporal pattern of the contents of a video and demonstrates its application to video classification and retrieval. The first step in STHRP pattern computation is to represent any video as Three Orthogonal Planes (TOPs), i.e., XY, XT and YT, signifying the spatial and temporal contents. Frames corresponding to each plane are partitioned into overlapping blocks. Radon projections are obtained over these blocks at different orientations, resulting in weighted transform coefficients that are normalized and grouped into bins. Linear Discriminant Analysis (LDA) is performed over these coefficients of the TOPs to arrive at a compact description of STHRP pattern. Compared to existing classification and retrieval approaches, the proposed descriptor is highly robust to translation, rotation and illumination variations in videos. To evaluate the capabilities of the invariant STHRP pattern, we analyse the performance by conducting experiments on the UCF-101, HMDB51, 10contexts and TRECVID data sets for classification and retrieval using a bagged tree model. Experimental evaluation of video classification reveals that STHRP pattern can achieve classification rates of 96.15%, 71.7%, 93.24% and 97.3% for the UCF-101, HMDB51,10contexts and TRECVID 2005 data sets respectively. We conducted retrieval experiments on the TRECVID 2005, JHMDB and 10contexts data sets and the results revealed that STHRP pattern is able to provide the videos relevant to the user's query in minimal time (0.05s) with a good precision rate. (C) 2019 Elsevier Ltd. All rights reserved.
机译:通过Internet和电视媒体访问的视频内容可用性的增加导致自动搜索过程的开发来检索所需的视频。通过使用视频自动分类,可以简化和加速搜索。本文提出了一种称为氡投影的时空直方图(STHRP)的描述符,用于表示视频的内容的时间模式,并演示其在视频分类和检索中的应用。 STHRP模式计算的第一步是将任何视频表示为三个正交平面(上部),即XY,XT和YT,表示空间和时间内容。对应于每个平面的帧被划分为重叠块。在不同取向的这些块上获得氡突起,导致加权变换系数,其被归一化并分组成箱。线性判别分析(LDA)在顶部的这些系数上进行,以确定STHRP图案的紧凑描述。与现有分类和检索方法相比,所提出的描述符是对视频中的翻译,旋转和照明变化的高度稳健。为了评估不变的STHRP模式的能力,我们通过使用袋装树模型进行分类和检索的UCF-101,HMDB51,10Contexts和Trecvid数据集的实验来分析性能。视频分类的实验评估显示,STHRP模式分别可以分别实现UCF-101,HMDB51,10Contexts和Trecvid 2005数据集的96.15%,71.7%,93.24%和97.3%的分类率。我们在TRECVID 2005,JHMDB和10Contexts数据集上进行了检索实验,结果显示,STHRP模式能够以良好的精度率(0.05s)以良好的时间(0.05s)提供与用户查询相关的视频。 (c)2019年elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号