Learning Hierarchical Feature Extractors For Image Recognition.

机译：学习用于图像识别的分层特征提取器。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Telling cow from sheep is effortless for most animals, but requires much engineering for computers. In this thesis, we seek to tease out basic principles that underlie many recent advances in image recognition. First, we recast many methods into a common unsupervised feature extraction framework based on an alternation of coding steps, which encode the input by comparing it with a collection of reference patterns, and pooling steps, which compute an aggregation statistic summarizing the codes within some region of interest of the image. Within that framework, we conduct extensive comparative evaluations of many coding or pooling operators proposed in the literature. Our results demonstrate a robust superiority of sparse coding (which decomposes an input as a linear combination of a few visual words) and max pooling (which summarizes a set of inputs by their maximum value). We also propose macrofeatures, which import into the popular spatial pyramid framework the joint encoding of nearby features commonly practiced in neural networks, and obtain significantly improved image recognition performance. Next, we analyze the statistical properties of max pooling that underlie its better performance, through a simple theoretical model of feature activation. We then present results of experiments that confirm many predictions of the model. Beyond the pooling operator itself, an important parameter is the set of pools over which the summary statistic is computed. We propose locality in feature configuration space as a natural criterion for devising better pools. Finally, we propose ways to make coding faster and more powerful through fast convolutional feedforward architectures, and examine how to incorporate supervision into feature extraction schemes. Overall, our experiments offer insights into what makes current systems work so well, and state-of-the-art results on several image recognition benchmarks.

机译：对于大多数动物而言，用羊讲牛是不费力的，但需要大量的计算机工程设计。在这篇论文中，我们试图梳理构成图像识别最新进展基础的基本原理。首先，我们将许多方法改写为基于编码步骤交替的通用无监督特征提取框架，该方法通过将输入与参考模式的集合进行比较来对输入进行编码，并合并步骤，从而计算出汇总统计量以汇总某些区域内的代码图像的兴趣。在此框架内，我们对文献中提出的许多编码或合并运算符进行了广泛的比较评估。我们的结果证明了稀疏编码（将输入分解为几个视觉单词的线性组合）和最大池化（通过其最大值总结一组输入）的强大优势。我们还提出了宏观特征，将其引入到流行的空间金字塔框架中，将通常在神经网络中进行的附近特征的联合编码导入到人的空间金字塔框架中，并获得显着改善的图像识别性能。接下来，我们通过简单的特征激活理论模型来分析最大池的统计属性，该统计池是其更好性能的基础。然后，我们介绍确认模型的许多预测的实验结果。除了合并运算符本身之外，一个重要的参数是在其上计算摘要统计信息的一组池。我们建议将要素配置空间中的局部性作为设计更好池的自然标准。最后，我们提出了通过快速卷积前馈体系结构使编码更快，功能更强大的方法，并研究了如何将监督纳入特征提取方案中。总体而言，我们的实验提供了有关使当前系统如此出色运行的见解，并提供了一些图像识别基准的最新技术成果。

著录项

作者
Boureau, Y-Lan.;
展开▼
作者单位

New York University.;

展开▼
授予单位 New York University.;
学科 Artificial Intelligence.;Computer Science.
学位 Ph.D.
年度 2012
页码 195 p.
总页数 195
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. 3D-SSD: Learning hierarchical features from RGB-D images for amodal 3D object detection [J] . Luo Qianhui, Ma Huifang, Tang Li, Neurocomputing . 2020,第Feba22期

机译：3D-SSD：从RGB-D图像中学习分层特征以进行无模3D对象检测
2. Recognition on Images from Internet Street View Based on Hierarchical Features Learning with CNNs [J] . Jian-min Liu, Min-hua Yang Journal of information technology research . 2018,第3期

机译：基于CNN分层特征学习的互联网街景图像识别
3. Extracting features from infrared images using convolutional neural networks and transfer learning [J] . Infrared physics and technology . 2020,第期

机译：利用卷积神经网络提取红外图像的特征和转移学习
4. And-Or Graph Grammar for Architectural Floor Plan Representation, Learning and Recognition. A Semantic, Structural and Hierarchical Model [C] . Lluis-Pere de las Heras, Gemma Sanchez Pattern recognition and image analysis . 2011

机译：和/或图语法用于建筑平面图的表示，学习和识别。语义，结构和层次模型
5. Hierarchical learning of discriminative features and classifiers for large-scale visual recognition. [D] . Zhou, Ning. 2014

机译：用于大规模视觉识别的区分性特征和分类器的分层学习。
6. Development of a Machine Learning Classifier Based on Radiomic Features Extracted From Post-Contrast 3D T1-Weighted MR Images to Distinguish Glioblastoma From Solitary Brain Metastasis [O] . Alix de Causans, Alexandre Carré, Alexandre Roux, 2021

机译：基于对比度3D T1加权MR图像中提取的基于辐射瘤特征的机器学习分类器的开发从孤立脑转移区分胶质母细胞瘤
7. 3D Facial Feature Extraction and Recognition. An investigation of 3D face recognition: correction and normalisation of the facial data, extraction of facial features and classification using machine learning techniques. [O] . Al-Qatawneh Sokyna M.S. 2010

机译：3D面部特征提取和识别。 3D人脸识别研究：人脸数据的校正和规范化，人脸特征的提取以及使用机器学习技术的分类。
8. Learning Hierarchical Feature Extractors for Image Recognition. [R] . Boureau, Y. 2012

机译：学习用于图像识别的分层特征提取器。

Learning Hierarchical Feature Extractors For Image Recognition.

摘要

著录项

相似文献

相关主题

期刊订阅