首页> 外文学位 >Joint appearance and locality image representation by gaussianization .
【24h】

Joint appearance and locality image representation by gaussianization .

机译:高斯化的联合外观与局部图像表示。

获取原文
获取原文并翻译 | 示例

摘要

A novel image representation is proposed in this thesis to capture both the appearance and locality information for image classification applications. First, we model the feature vectors, from various granularity levels including the corpus level, the image level and image patch level, in a hierarchical Bayesian framework using mixtures of Gaussians. After such a hierarchical Gaussianization, each image is represented as a Gaussian mixture model (GMM) for its appearance, and several Gaussian maps for its spatial layout. Then we extract the appearance information from the GMMparameters, and the locality information from the global and the local statistics over Gaussian maps. Finally, we employ a supervised dimension reduction technique called DAP (discriminant adaptive projection) to remove noise directions and to further enhance the discriminating power of our representation.;To validate the argument that the new representation is a general representation for images and video frames, we evaluate the representation on several important applications. Firstly, we apply the new presentation to classification and regression tasks taking whole images as inputs. These tasks include object recognition, scene category classification, face recognition, age estimation, pose estimation, gender recognition, and video event recognition. Then we test it for the object detection and image parsing tasks, where the new representation takes partial images as inputs. The experimental results show that, for various types of images and tasks, the performances using the proposed representation were the best in all the applications compared with other state-of-the-art algorithms.
机译:本文提出了一种新颖的图像表示方法来捕获图像分类的外观和位置信息。首先,我们在使用高斯混合的层次贝叶斯框架中,从各种粒度级别(包括语料库级别,图像级别和图像补丁级别)对特征向量进行建模。经过这样的分层高斯化后,每个图像的外观都表示为高斯混合模型(GMM),而其空间布局则表示为多个高斯图。然后,我们从GMM参数中提取外观信息,并从高斯地图上的全局和本地统计信息中提取位置信息。最后,我们采用一种称为DAP(判别式自适应投影)的监督降维技术来消除噪声方向并进一步增强表示的区分能力。为了验证新表示是图像和视频帧的一般表示的说法,我们评估了几个重要应用程序上的表示形式。首先,我们将新演示文稿应用于以整个图像为输入的分类和回归任务。这些任务包括对象识别,场景类别分类,面部识别,年龄估计,姿势估计,性别识别和视频事件识别。然后,我们测试它的对象检测和图像解析任务,其中新的表示形式将部分图像作为输入。实验结果表明,与其他最新算法相比,对于各种类型的图像和任务,使用所提出的表示形式的性能在所有应用程序中均为最佳。

著录项

  • 作者

    Zhou, Xi.;

  • 作者单位

    University of Illinois at Urbana-Champaign.;

  • 授予单位 University of Illinois at Urbana-Champaign.;
  • 学科 Engineering Computer.;Computer Science.;Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 119 p.
  • 总页数 119
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号