首页> 外文学位 >Towards a local-global visual feature-based framework for recognition.
【24h】

Towards a local-global visual feature-based framework for recognition.

机译:迈向基于局部全局视觉特征的识别框架。

获取原文
获取原文并翻译 | 示例

摘要

General object and activity recognition is a fundamental problem in computer vision, which has been the subject of much research. Traditional approaches include model-based and appearance template-based methods. Recently, inspired by methods from the text retrieval literature, local visual feature-based models have shown a lot of success for recognition of objects or activities with large within-class geometric variability.;There are several challenges in this approach, namely feature selection and target modeling using these features. This thesis proposes a local-global visual feature-based framework for general object and activity recognition with novel methods for these problems: (1) Combinatorial and statistical methods for selecting informative parts to build statistical models for part-based object recognition. First a combinatorial optimization formulation is used for clustering on a weighted multipartite graph. Second, a statistical method for selecting discriminative parts from positive images is used to localize objects. (2) An entropy based vocabulary selection method for "bag-of-words" models for activity recognition. (3) Integrating both spatial and temporal information with appearance features for human activity recognition. This method models the human motions with the distribution of local motion features and their spatial-temporal arrangements.;The effectiveness of the proposed methods is demonstrated by several object recognition and activity recognition data sets, which include human facial expressions and hand gestures, etc.;This thesis also covers an interesting project regarding a framework of applying Discrete Fourier Transform to detect salient regions in images and video sequences. This framework generalizes the previous saliency detection methods and can be applied for saliency detection in the video sequences.
机译:通用对象和活动识别是计算机视觉中的一个基本问题,已经成为许多研究的主题。传统方法包括基于模型的方法和基于外观模板的方法。最近,受文本检索文献的方法启发,基于局部视觉特征的模型在识别具有较大类内几何可变性的对象或活动方面已显示出许多成功。这种方法存在一些挑战,即特征选择和特征选择。使用这些功能进行目标建模。本文提出了一种基于局部全局视觉特征的通用对象和活动识别框架,并针对这些问题提出了新颖的方法:(1)选择组合信息和统计方法,建立基于零件的信息识别统计模型。首先,使用组合优化公式在加权多部分图上进行聚类。其次,使用一种从正像中选择区分部分的统计方法来定位对象。 (2)用于活动识别的“词袋”模型的基于熵的词汇选择方法。 (3)将时空信息与外观特征相结合,以进行人类活动识别。该方法利用局部运动特征的分布及其时空排列对人体运动进行建模。;该方法的有效性通过多种物体识别和活动识别数据集得到了证明,其中包括人类的面部表情和手势等。 ;本文还涉及一个有趣的项目,该项目涉及应用离散傅立叶变换来检测图像和视频序列中的显着区域的框架。该框架概括了先前的显着性检测方法,可用于视频序列中的显着性检测。

著录项

  • 作者

    Zhao, Zhipeng.;

  • 作者单位

    Rutgers The State University of New Jersey - New Brunswick.;

  • 授予单位 Rutgers The State University of New Jersey - New Brunswick.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 119 p.
  • 总页数 119
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号