Visual Object Recognition and Pose Estimation Based on a Deep Semantic Segmentation Network

Chien-Ming Lin; Chi-Yi Tsai; Yu-Cheng Lai; Shin-An Li; Ching-Chang Wong

首页> 外文期刊>Sensors Journal, IEEE >Visual Object Recognition and Pose Estimation Based on a Deep Semantic Segmentation Network

【24h】

Visual Object Recognition and Pose Estimation Based on a Deep Semantic Segmentation Network

机译：基于深度语义分割网络的视觉目标识别与姿态估计

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In recent years, deep learning-based object recognition algorithms become emerging in robotic vision applications. This paper addresses the design of a novel deep learning-based visual object recognition and pose estimation system for a robot manipulator to handle random object picking tasks. The proposed visual control system consists of a visual perception module, an object pose estimation module, a data argumentation module, and a robot manipulator controller. The visual perception module combines deep convolution neural networks (CNNs) and a fully connected conditional random field layer to realize an image semantic segmentation function, which can provide stable and accurate object classification results in cluttered environments. The object pose estimation module implements a model-based pose estimation method to estimate the 3D pose of the target for picking control. In addition, the proposed data argumentation module automatically generates training data for training the deep CNN. Experimental results show that the proposed scene segmentation method used in the data argumentation module reaches a high accuracy rate of 97.10% on average, which is higher than other state-of-the-art segment methods. Moreover, with the proposed data argumentation module, the visual perception module reaches an accuracy rate over than 80% and 72% in the case of detecting and recognizing one object and three objects, respectively. In addition, the proposed model-based pose estimation method provides accurate 3D pose estimation results. The average translation and rotation errors in the three axes are all smaller than 0.52 cm and 3.95 degrees, respectively. These advantages make the proposed visual control system suitable for applications of random object picking and manipulation.

机译：近年来，基于深度学习的对象识别算法在机器人视觉应用中崭露头角。本文介绍了一种新颖的基于深度学习的视觉对象识别和姿态估计系统的设计，该系统用于机器人操纵器来处理随机对象拾取任务。所提出的视觉控制系统包括视觉感知模块，对象姿态估计模块，数据论证模块和机器人操纵器控制器。视觉感知模块结合了深度卷积神经网络（CNN）和完全连接的条件随机场层来实现图像语义分割功能，可以在杂乱的环境中提供稳定，准确的对象分类结果。对象姿势估计模块实现基于模型的姿势估计方法以估计用于拾取控制的目标的3D姿势。另外，所提出的数据论证模块自动生成用于训练深度CNN的训练数据。实验结果表明，该算法在数据论证模块中使用的场景分割方法平均准确率达到97.10％，高于其他最新的分割方法。此外，通过提出的数据论证模块，在分别检测和识别一个物体和三个物体的情况下，视觉感知模块的准确率超过80％和72％。另外，所提出的基于模型的姿势估计方法提供了准确的3D姿势估计结果。三个轴上的平均平移和旋转误差均分别小于0.52 cm和3.95度。这些优点使所提出的视觉控制系统适合于随机物体拾取和操纵的应用。

著录项

来源
《Sensors Journal, IEEE》 |2018年第22期|9370-9381|共12页
作者
Chien-Ming Lin; Chi-Yi Tsai; Yu-Cheng Lai; Shin-An Li; Ching-Chang Wong;
展开▼
作者单位

Department of Electrical and Computer Engineering, Tamkang University, New Taipei City, Taiwan;

Department of Electrical and Computer Engineering, Tamkang University, New Taipei City, Taiwan;

Department of Electrical and Computer Engineering, Tamkang University, New Taipei City, Taiwan;

Department of Electrical and Computer Engineering, Tamkang University, New Taipei City, Taiwan;

Department of Electrical and Computer Engineering, Tamkang University, New Taipei City, Taiwan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Pose estimation; Three-dimensional displays; Robots; Visual perception; Image segmentation; Object recognition; Semantics;

机译：姿态估计;三维显示;机器人;视觉感知;图像分割;目标识别;语义;

相似文献

外文文献
中文文献
专利

1. Semantic part segmentation method based 3D object pose estimation with RGB-D images for bin-picking [J] . Chungang Zhuang, Zhe Wang, Heng Zhao, Robotics and Computer-Integrated Manufacturing . 2021,第Apra期

机译：基于语义零件分割方法的基于3D对象姿势估计与垃圾拣选的RGB-D图像
2. Object manipulation with a variable-stiffness robotic mechanism using deep neural networks for visual semantics and load estimation [J] . Neural computing & applications . 2020,第13期

机译：使用深神经网络进行可变刚度机器机制的对象操纵，用于视觉语义和负载估计
3. Deep Object Pose Estimation for Semantic Robotic Grasping of Household Objects [J] . Jonathan Tremblay, Thang To, Balakumar Sundaralingam, JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：家庭对象语义机器人掌握的深度对象姿态
4. Robust 6D Object Pose Estimation in Cluttered Scenes Using Semantic Segmentation and Pose Regression Networks [C] . Arul Selvam Periyasamy, Max Schwarz, Sven Behnke IEEE/RSJ International Conference on Intelligent Robots and Systems . 2018

机译：使用语义分割和姿势回归网络对杂乱场景中的稳健6D对象姿势进行估计
5. Deep neural networks and regression models for object detection and pose estimation [D] . Hara, Kota. 2016

机译：用于对象检测和姿态估计的深度神经网络和回归模型
6. A Manufacturing-Oriented Intelligent Vision System Based on Deep Neural Network for Object Recognition and 6D Pose Estimation [O] . Guoyuan Liang, Fan Chen, Yu Liang, 2020

机译：基于深度神经网络的物体识别和6D姿态估计的制造导向智能视觉系统
7. Robust 6D Object Pose Estimation in Cluttered Scenes Using Semantic Segmentation and Pose Regression Networks [O] . Arul Selvam Periyasamy, Max Schwarz, Sven Behnke 2018

机译：使用语义分割和姿势回归网络杂乱场景中的鲁棒6d对象姿态估计

Visual Object Recognition and Pose Estimation Based on a Deep Semantic Segmentation Network

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅