A scene recognition method using sparse features with layout-sensitive pooling and extreme learning machine

机译：一种基于稀疏特征的场景识别方法及布局敏感池和极限学习机

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Scene recognition aims to find a semantic explanation of a scene, i.e., it helps intelligent machines to know where they are. It can be widely applied into various tasks in computer vision and robotics. Most of pioneer methods extracted a set of low-level features and put them into classifier directly to identify scene category. But it has been proved that low-level features do not work well. Currently researchers aim to overcome the semantic gap between the low-level vision features and high-level semantic categories to improve the recognition performance. Therefore, much attention has been put on transforming low-level descriptors into richer intermediate representations. This paper proposed a novel method based on intermediate feature representation to solve the problem of recognizing the semantic category of scene image. This proposed method uses sparse coding on SIFT features and presents a spatial layout sensitive pooling method. The space layout for pooling is based on three rectangles with size of 1*1,1*4 and 4*1 in each image. They are derived from inherent characteristics of the scene images by regularly dividing the image in horizontal and vertical direction. This spatial pooling strategy is easier and it can get optimal representation of scene images. Extreme learning machine (ELM) is used as a classifier. ELM has shown great ability to fit nonlinear classification boundaries. Experimental results have shown that this proposed method not only extracts lower dimension image feature but also outperforms other similar state-of-the-art methods in terms of recognition performance.

机译：场景识别旨在找到场景的语义解释，即，它可以帮助智能机器知道它们在哪里。它可以广泛地应用于计算机视觉和机器人技术中的各种任务。大多数先驱方法都提取了一组低级特征，并将其直接放入分类器中以识别场景类别。但事实证明，低级功能无法正常工作。当前，研究人员旨在克服低视觉特征与高语义类别之间的语义鸿沟，以提高识别性能。因此，将低级描述符转换为更丰富的中间表示形式已经引起了很多关注。提出了一种基于中间特征表示的新方法，解决了场景图像语义类别的识别问题。该提议的方法对SIFT特征使用了稀疏编码，并提出了一种对空间布局敏感的合并方法。池的空间布局基于每个图像中大小分别为1 * 1、1 * 4和4 * 1的三个矩形。它们是通过在水平和垂直方向上均匀划分图像而从场景图像的固有特性派生而来的。这种空间池化策略更容易，并且可以获得场景图像的最佳表示。极限学习机（ELM）用作分类器。 ELM已显示出强大的能力来拟合非线性分类边界。实验结果表明，该方法不仅在提取低维图像特征方面，而且在识别性能方面也优于其他类似技术。

著录项

来源
《IEEE International Conference on Information and Automation》|2016年|178-183|共6页
会议地点
作者
Lingying Wu; Yuanlong Yu; Jason Gu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Histograms; Encoding; Semantics; Training; Image recognition; Layout;

机译：特征提取;直方图;编码;语义;训练;图像识别;布局;

相似文献

外文文献
中文文献
专利

1. Traffic Sign Recognition Method Integrating Multi-Layer Features and Kernel Extreme Learning Machine Classifier [J] . Wei Sun, Hongji Du, Shoubai Nie, Computers, Materials & Continua . 2019,第1期

机译：交通标志识别方法集成多层功能和内核极端学习机分类器
2. Unsupervised feature learning with sparse Bayesian auto-encoding based extreme learning machine [J] . Zhang Guanghao, Cui Dongshun, Mao Shangbo, International journal of machine learning and cybernetics . 2020,第7期

机译：基于稀疏的贝叶斯自动编码基于极端学习机的无监督功能
3. An Improved Recognition Approach for Noisy Multispectral Palmprint by Robust L2 Sparse Representation with a Tensor-Based Extreme Learning Machine [J] . Cheng Dongxu, Zhang Xinman, Xu Xuebin Nature reviews Cancer . 2019,第2期

机译：通过张力的基于极端学习机的鲁棒L2稀疏表示，改进了嘈杂多光谱掌纹的识别方法
4. A scene recognition method using sparse features with layout-sensitive pooling and extreme learning machine [C] . Lingying Wu, Yuanlong Yu, Jason Gu IEEE International Conference on Information and Automation . 2016

机译：一种使用稀疏功能的场景识别方法，具有布局敏感池和极限学习机
5. A neural model of scene understanding: Multiple-scale spatial and feature-based attention in scene search, learning, and recognition. [D] . Huang, Tsung-Ren. 2010

机译：场景理解的神经模型：场景搜索，学习和识别中多尺度基于空间和基于特征的注意力。
6. Sparse Feature Learning of Hyperspectral Imagery via Multiobjective-Based Extreme Learning Machine [O] . Xiaoping Fang, Yaoming Cai, Zhihua Cai, 2020

机译：基于多目标极限学习机的高光谱图像稀疏特征学习
7. Sparse Feature Learning of Hyperspectral Imagery via Multiobjective-Based Extreme Learning Machine [O] . Xiaoping Fang, Yaoming Cai, Zhihua Cai, 2020

机译：基于多目标的极端学习机的高光谱图像稀疏特征学习

A scene recognition method using sparse features with layout-sensitive pooling and extreme learning machine

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅