A New Method of 3D Scene Recognition from Still Images

机译：从静止图像识别3D场景的新方法

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Most methods of monocular visual three dimensional (3D) scene recognition involve supervised machine learning. However, these methods often rely on prior knowledge. Specifically, they learn the image scene as part of a training dataset. For this reason, when the sampling equipment or scene is changed, monocular visual 3D scene recognition may fail. To cope with this problem, a new method of unsupervised learning for monocular visual 3D scene recognition is here proposed. First, the image is made using superpixel segmentation based on the CIELAB color space values L, a, and b and on the coordinate values x and y of pixels, forming a superpixel image with a specific density. Second, a spectral clustering algorithm based on the superpixels' color characteristics and neighboring relationships was used to reduce the dimensions of the superpixel image. Third, the fuzzy distribution density functions representing sky, ground, and facade are multiplied with the segment pixels, where the expectations of these segments are obtained. A preliminary classification of sky, ground, and facade is generated in this way. Fourth, the most accurate classification images of sky, ground, and facade were extracted through the tier-1 wavelet sampling and Manhattan direction feature. Finally, a depth perception map is generated based on the pinhole imaging model and the linear perspective information of ground surface. Here, 400 images of Make3D Image data from the Cornell University website were used to test the algorithm. The experimental results showed that this unsupervised learning method provides a more effective monocular visual 3D scene recognition model than other methods.

机译：单眼视觉三维（3D）场景识别的大多数方法都涉及有监督的机器学习。但是，这些方法通常依赖于先验知识。具体来说，他们将图像场景作为训练数据集的一部分进行学习。因此，当更改采样设备或场景时，单目视觉3D场景识别可能会失败。为了解决这个问题，在此提出了一种用于单眼视觉3D场景识别的无监督学习新方法。首先，基于CIELAB颜色空间值L，a和b以及像素的坐标值x和y使用超像素分割来制作图像，从而形成具有特定密度的超像素图像。其次，基于超像素的颜色特征和相邻关系的光谱聚类算法被用于减小超像素图像的尺寸。第三，将代表天空，地面和立面的模糊分布密度函数与分段像素相乘，从而获得这些分段的期望值。以这种方式生成天空，地面和立面的初步分类。第四，通过1级小波采样和曼哈顿方向特征提取了最准确的天空，地面和立面分类图像。最后，基于针孔成像模型和地表的线性透视信息生成深度感知图。在这里，使用了康奈尔大学网站上的400张Make3D图像数据图像来测试该算法。实验结果表明，这种无监督学习方法比其他方法提供了更有效的单眼视觉3D场景识别模型。

著录项

来源
《International conference on digital image processing》|2014年|91590N.1-91590N.12|共12页
会议地点
作者
ZHENG Li-ming; WANG Xing-song;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Unsupervised learning; monocular visual; 3D scene recognition; superpixels; spectral clustering;

机译：无监督学习;单眼视觉3D场景识别;超像素光谱聚类;

相似文献

外文文献
中文文献
专利

1. 3D scene retrieval and recognition with Depth Gradient Images [J] . Antonio Adán, Pilar Merchán, Santiago Salamanca Pattern recognition letters . 2011,第9期

机译：深度梯度图像的3D场景检索和识别
2. Using spin images for efficient object recognition in cluttered 3D scenes [J] . Johnson A.E., Hebert M. IEEE Transactions on Pattern Analysis and Machine Intelligence . 1999,第5期

机译：使用旋转图像在混乱的3D场景中进行有效的物体识别
3. Synthetic Scene Character Generator and Ensemble Scheme with the Random Image Feature Method for Japanese and Chinese Scene Character Recognition [J] . Fuma HORIE, Hideaki GOTO, Takuo SUGANUMA IEICE transactions on information and systems . 2021,第11期

机译：综合场景字符生成器和与日语场景字符识别随机图像特征方法的合成场景
4. Graph matching method for character recognition in natural scene images: A study of character recognition in natural scene image considering visual impairments [C] . Kim Jieun, Yoon Ho-sub 15th International conference on intelligent engineering systems . 2011

机译：用于自然场景图像中字符识别的图匹配方法：考虑视觉障碍的自然场景图像中字符识别的研究
5. Methods for efficient object categorization, detection, scene recognition, and image search. [D] . Bergamo, Alessandro. 2014

机译：高效的对象分类，检测，场景识别和图像搜索方法。
6. Visual Object Recognition with 3D-Aware Features in KITTI Urban Scenes [O] . J. Javier Yebes, Luis M. Bergasa, Miguel Ángel García-Garrido 2015

机译：KITTI城市场景中具有3D感知功能的视觉对象识别
7. Using spin images for efficient object recognition in cluttered 3D scenes [O] . Andrew E. Johnson, Martial Hebert 1999

机译：使用旋转图像在杂乱的3D场景中进行有效的对象识别

A New Method of 3D Scene Recognition from Still Images

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅