Beyond SIFT for image classification

机译：超越SIFT进行图像分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In classifying images, scenes or objects, the most popular approach is based on the features extraction-coding-pooling framework allowing to generate discriminative and robust image representations from densely extracted local patches, mainly some SIFT/HOG ones. The majority of the latest research is focused on how to improve successfully these coding and pooling parts. In this work, we show that substantial improvements can be also obtained by coding information closer to the pixel values level in the same way that deep-learning architectures do. We introduce a two layer, stacked, coder-pooler architecture where the first layer is specifically dedicated to extract, from our so-called Differential Vectors (DV) patches, some efficient, local low-level features more discriminative and efficient that their classic handcrafted counterpart. This first layer can advantageously replace any classic dense SIFT/HOG patches extraction stage. We demonstrate the effectiveness of our approach on three datasets: UIUC-Sports, Scene 15 and Caltech 101. We achieve excellent performances with simple linear classification while using basic coding and pooling schemes for both layers, i.e. Sparse Coding (SC) and Max-Pooling (MP) respectively.

机译：在对图像，场景或对象进行分类时，最流行的方法是基于特征提取-编码-合并框架，该框架允许从密集提取的局部斑块（主要是一些SIFT / HOG斑块）中生成具有判别力和鲁棒性的图像表示形式。最新研究的大部分集中在如何成功地改进这些编码和合并部分上。在这项工作中，我们表明，以与深度学习体系结构相同的方式，通过编码更接近像素值级别的信息也可以获得实质性的改进。我们引入了两层堆叠的编码器-池结构，其中第一层专门用于从所谓的“差分矢量”（DV）补丁中提取一些比传统的手工制作更具歧视性和效率的高效本地低层功能。对应。该第一层可以有利地代替任何经典的密集SIFT / HOG补丁提取阶段。我们在三个数据集上展示了我们的方法的有效性：UIUC-Sports，Scene 15和Caltech101。我们通过简单的线性分类实现了出色的性能，同时在两个层上都使用了基本的编码和池化方案，即稀疏编码（SC）和最大池化（MP）。

著录项

来源
《International Conference on Computer Vision Theory and Applications》||542-548|共7页
会议地点
作者
Paris Sebastien; Halkias Xanadu; Glotin Herve;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Computer architecture; Dictionaries; Encoding; Feature extraction; Image coding; Robustness; Semantics; Dictionary Learning; Differential Vectors; Max-pooling; Scenes Categorization; Sparse Coding; Sparse-coding;

机译：计算机体系结构;词典;编码;特征提取;图像编码;稳健性;语义;词典学习;差分向量;最大池;场景分类;稀疏编码;稀疏编码;

相似文献

外文文献
中文文献
专利

1. SiftingGAN: Generating and Sifting Labeled Samples to Improve the Remote Sensing Image Scene Classification Baseline In Vitro [J] . Ma Dongao, Tang Ping, Zhao Lijun IEEE Geoscience and Remote Sensing Letters . 2019,第7期

机译：SiftingGAN：生成和筛选带标签的样本以改善遥感影像场景分类基准体外
2. Classification of Hematoxylin and Eosin Images Using Local Binary Patterns and 1-D SIFT Algorithm [J] . O??uz O??uzhan, ??etin A. Enis, Atalay Rengul ??etin Proceedings . 2018,第2期

机译：使用局部二元模式和一维SIFT算法对苏木和曙红图像进行分类
3. Application of Sparse Coded SIFT Features for Classification of Plant Images [J] . Suchit Purohit, Savita R. Gandhi International Journal of Image, Graphics and Signal Processing . 2017,第10期

机译：稀疏编码SIFT特征在植物图像分类中的应用
4. Brain tumor detection and classification using SIFT in MRI images [C] . Mohammed Sahib Mahdi Altaei, Sura Yarub Kamil International Conference on Sustainable Manufacturing, Materials and Technologies . 2020

机译：脑肿瘤检测与分类使用SIFT在MRI图像中
5. Image classification with dense sift sampling: An exploration of optimal parameters. [D] . Chavez, Aaron J. 2012

机译：密集筛分图像分类：最佳参数的探索。
6. Towards the automated localisation of targets in rapid image-sifting by collaborative brain-computer interfaces [O] . Ana Matran-Fernandez, Riccardo Poli -1

机译：通过协作式人机界面实现快速图像筛选中目标的自动定位
7. Speech classification using SIFT features on spectrogram images [O] . Quang Trung Nguyen, The Duy Bui 2016

机译：在频谱图图像上使用SIFT功能进行语音分类
8. Improved Quality of Reconstructed Images Through Sifting of Data in StatisticalImage Reconstruction [R] . Stoudt, C. A. 1993

机译：通过统计图像重建中的数据筛选提高重建图像质量

Beyond SIFT for image classification

摘要

著录项

相似文献

相关主题

期刊订阅