首页> 外文OA文献 >An Analytic Training Approach for Recognition in Still Images and Videos
【2h】

An Analytic Training Approach for Recognition in Still Images and Videos

机译:一种用于静止图像和视频识别的解析训练方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。
获取外文期刊封面目录资料

摘要

This dissertation proposes a general framework to efficiently identify the objects of interest (OI) in still images and its application can be further extended to human action recognition in videos. The frameworks utilized in this research to process still images and videos are similar in architecture except they have different content representations. Initially, global level analysis is employed to extract distinctive feature sets from an input data. For the global analysis of data the bidirectional two dimensional principal component analysis (2D-PCA) is employed to preserve correlation amongst neighborhood pixels. Furthermore, to cope with the inherent limitations within the holistic approach local information is introduced into the framework. The local information of OI is identified utilizing FERNS and affine SIFT (ASIFT) approaches for spatial and temporal datasets, respectively. For supportive local information, the feature detection is followed by an effective pruning strategy to divide these features into inliers and outliers. A cluster of inliers represents local features which exhibit stable behavior and geometric consistency. Incremental learning is a significant but often overlooked problem in action recognition. The final part of this dissertation proposes a new action recognition algorithm based on sequential learning and adaptive representation of the human body using Pyramid of Histogram of Oriented Gradients (PHOG) features. The changing shape and appearance of human body parts is tracked based on the weak appearance constancy assumption. The constantly changing shape of an OI is maximally covered by the small blocks to approximate the body contour of a segmented foreground object. In addition, the analytically determined learning phase guarantees lower computational burden for classification. The utilization of a minimum number of video frames in a causal way to recognize an action is also explored in this dissertation. The use of PHOG features adaptively extracted from individual frames allows the recognition of an incoming action video using a small group of frames which eliminates the need of large look-ahead.
机译:本文提出了一种有效识别静止图像中感兴趣对象的通用框架,并将其应用扩展到视频中的人为识别。本研究中用于处理静态图像和视频的框架在架构上相似,只是它们具有不同的内容表示形式。最初,使用全局级别分析从输入数据中提取独特的特征集。对于数据的全局分析,采用双向二维主成分分析(2D-PCA)来保留邻域像素之间的相关性。此外,为了应对整体方法中的固有局限性,将本地信息引入到框架中。分别使用FERNS和仿射SIFT(ASIFT)方法识别OI的本地信息,以用于空间和时间数据集。对于支持性的本地信息,在特征检测之后是有效的修剪策略,可将这些特征分为内部值和离群值。一连串的inliers代表具有稳定行为和几何一致性的局部特征。在动作识别中,增量学习是一个重要的但通常被忽略的问题。本文的最后部分提出了一种新的动作识别算法,该算法基于顺序学习和利用梯度直方图金字塔特征对人体的自适应表示。基于弱外观恒定性假设,可以跟踪人体部位不断变化的形状和外观。小块最大程度地覆盖了OI不断变化的形状,以近似分割前景对象的身体轮廓。此外,通过分析确定的学习阶段可以保证较低的分类计算负担。本文还探讨了以因果关系利用最小数量的视频帧来识别动作的方法。从各个帧中自适应提取的PHOG功能的使用允许使用一小组帧识别传入的动作视频,从而无需大前瞻。

著录项

  • 作者

    Minhas Rashid;

  • 作者单位
  • 年度 2010
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号