首页> 外文会议>IEEE International Conference on Information and Automation >A scene recognition method using sparse features with layout-sensitive pooling and extreme learning machine
【24h】

A scene recognition method using sparse features with layout-sensitive pooling and extreme learning machine

机译:一种基于稀疏特征的场景识别方法及布局敏感池和极限学习机

获取原文
获取外文期刊封面目录资料

摘要

Scene recognition aims to find a semantic explanation of a scene, i.e., it helps intelligent machines to know where they are. It can be widely applied into various tasks in computer vision and robotics. Most of pioneer methods extracted a set of low-level features and put them into classifier directly to identify scene category. But it has been proved that low-level features do not work well. Currently researchers aim to overcome the semantic gap between the low-level vision features and high-level semantic categories to improve the recognition performance. Therefore, much attention has been put on transforming low-level descriptors into richer intermediate representations. This paper proposed a novel method based on intermediate feature representation to solve the problem of recognizing the semantic category of scene image. This proposed method uses sparse coding on SIFT features and presents a spatial layout sensitive pooling method. The space layout for pooling is based on three rectangles with size of 1*1,1*4 and 4*1 in each image. They are derived from inherent characteristics of the scene images by regularly dividing the image in horizontal and vertical direction. This spatial pooling strategy is easier and it can get optimal representation of scene images. Extreme learning machine (ELM) is used as a classifier. ELM has shown great ability to fit nonlinear classification boundaries. Experimental results have shown that this proposed method not only extracts lower dimension image feature but also outperforms other similar state-of-the-art methods in terms of recognition performance.
机译:场景识别旨在找到场景的语义解释,即,它可以帮助智能机器知道它们在哪里。它可以广泛地应用于计算机视觉和机器人技术中的各种任务。大多数先驱方法都提取了一组低级特征,并将其直接放入分类器中以识别场景类别。但事实证明,低级功能无法正常工作。当前,研究人员旨在克服低视觉特征与高语义类别之间的语义鸿沟,以提高识别性能。因此,将低级描述符转换为更丰富的中间表示形式已经引起了很多关注。提出了一种基于中间特征表示的新方法,解决了场景图像语义类别的识别问题。该提议的方法对SIFT特征使用了稀疏编码,并提出了一种对空间布局敏感的合并方法。池的空间布局基于每个图像中大小分别为1 * 1、1 * 4和4 * 1的三个矩形。它们是通过在水平和垂直方向上均匀划分图像而从场景图像的固有特性派生而来的。这种空间池化策略更容易,并且可以获得场景图像的最佳表示。极限学习机(ELM)用作分类器。 ELM已显示出强大的能力来拟合非线性分类边界。实验结果表明,该方法不仅在提取低维图像特征方面,而且在识别性能方面也优于其他类似技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号