From RGB-D Images to RGB Images: Single Labeling for Mining Visual Models

Zhang Quanshi; Song Xuan; Shao Xiaowei; Zhao Huijing; Shibasaki Ryosuke

首页> 外文期刊>ACM transactions on intelligent systems >From RGB-D Images to RGB Images: Single Labeling for Mining Visual Models

【24h】

From RGB-D Images to RGB Images: Single Labeling for Mining Visual Models

机译：从RGB-D图像到RGB图像：单一标签用于挖掘视觉模型

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Mining object-level knowledge, that is, building a comprehensive category model base, from a large set of cluttered scenes presents a considerable challenge to the field of artificial intelligence. How to initiate model learning with the least human supervision (i.e., manual labeling) and how to encode the structural knowledge are two elements of this challenge, as they largely determine the scalability and applicability of any solution. In this article, we propose a model-learning method that starts from a single-labeled object for each category, and mines further model knowledge from a number of informally captured, cluttered scenes. However, in these scenes, target objects are relatively small and have large variations in texture, scale, and rotation. Thus, to reduce the model bias normally associated with less supervised learning methods, we use the robust 3D shape in RGB-D images to guide our model learning, then apply the properly trained category models to both object detection and recognition in more conventional RGB images. In addition to model training for their own categories, the knowledge extracted from the RGB-D images can also be transferred to guide model learning for a new category, in which only RGB images without depth information in the new category are provided for training. Preliminary testing shows that the proposed method performs as well as fully supervised learning methods.

机译：从一大堆混乱的场景中挖掘对象级别的知识，即建立一个综合的类别模型库，对人工智能领域提出了相当大的挑战。如何在最少的人工监督下（即手动标记）启动模型学习以及如何对结构知识进行编码是此挑战的两个要素，因为它们在很大程度上决定了任何解决方案的可扩展性和适用性。在本文中，我们提出了一种模型学习方法，该方法从每个类别的单个标签对象开始，并从许多非正式捕获的，混乱的场景中挖掘更多的模型知识。但是，在这些场景中，目标对象相对较小，并且在纹理，比例和旋转方面具有较大的变化。因此，为了减少通常与较少监督学习方法相关的模型偏差，我们在RGB-D图像中使用鲁棒的3D形状来指导我们的模型学习，然后将经过适当训练的类别模型应用于更常规的RGB图像中的对象检测和识别。除了针对自己类别的模型训练之外，从RGB-D图像中提取的知识也可以用于指导新类别的模型学习，其中仅提供新类别中没有深度信息的RGB图像进行训练。初步测试表明，该方法的性能和完全监督的学习方法一样好。

著录项

来源
《ACM transactions on intelligent systems》 |2015年第2期|16.1-16.29|共29页
作者
Zhang Quanshi; Song Xuan; Shao Xiaowei; Zhao Huijing; Shibasaki Ryosuke;
展开▼
作者单位

Univ Calif Los Angeles, Los Angeles, CA 90089 USA|Univ Tokyo, Tokyo 1138654, Japan;

Univ Tokyo, Ctr Spatial Informat Sci, Tokyo, Japan;

Univ Tokyo, Ctr Spatial Informat Sci, Tokyo, Japan;

Peking Univ, Key Lab Machine Percept MoE, Beijing 100871, Peoples R China;

Univ Tokyo, Ctr Spatial Informat Sci, Tokyo, Japan|Univ Tokyo, Tokyo 1138654, Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Design; Algorithms; Performance; Theory; Data mining; computer vision; big visual data; visual mining; transfer learning; visual knowledge base; RGB-D sensor;

机译：设计;算法;性能;理论;数据挖掘;计算机视觉;大视觉数据;视觉挖掘;转移学习;视觉知识库;RGB-D传感器;

相似文献

外文文献
中文文献
专利

1. RDBN: Visual relationship detection with inaccurate RGB-D images [J] . Liu Xiaozhou, Gan Ming-Gang Knowledge-Based Systems . 2020,第Sepa27期

机译：RDBN：具有不准确的RGB-D图像的视觉关系检测
2. Visual saliency detection for RGB-D images under a Bayesian framework [J] . Songtao Wang, Zhen Zhou, Wei Jin, IPSJ Transactions on Computer Vision and Applications . 2018,第1期

机译：贝叶斯框架下RGB-D图像的视觉显着性检测
3. SymmetryNet: Learning to Predict Reflectional and Rotational Symmetries of 3D Shapes from Single-View RGB-D Images [J] . YIFEI SHI, JUNWEN HUANG, HONGJIA ZHANG, ACM Transactions on Graphics . 2020,第6CD期

机译：对称性：从单视图RGB-D图像中学习预测3D形状的反射和旋转对称
4. 3D Object Discovery and Modeling Using Single RGB-D Images Containing Multiple Object Instances [C] . Wim Abbeloos, Esra Ataer-Cansizoglu, Sergio Caccamo, International Conference on 3D Vision . 2017

机译：使用包含多个对象实例的单个RGB-D图像进行3D对象发现和建模
5. Object Localization from RGB-D Images and Spatial Referring Expressions [D] . Mauceri, Cecilia. 2021

机译：来自RGB-D图像和空间引用表达式的对象本地化
6. RGB-D Image Processing Algorithm for Target Recognition and Pose Estimation of Visual Servo System [O] . Shipeng Li, Di Li, Chunhua Zhang, 2020

机译：视觉伺服系统目标识别和姿态估计的RGB-D图像处理算法
7. 3D Object Discovery and Modeling Using Single RGB-D Images Containing Multiple Object Instances [O] . Abbeloos Wim, Ataer-Cansizoglu Esra, Caccamo Sergio, 2017

机译：使用包含多个对象实例的单个RGB-D图像进行3D对象发现和建模
8. Application of Data Mining and Knowledge Discovery Techniques to Enhance Binary Target Detection and Decision-Making for Compromised Visual Images [R] . Repperger, D. W. , Phillips, C. A. , Schrider, C. D. , 2004

机译：数据挖掘与知识发现技术在受损视觉图像二值目标检测与决策中的应用

From RGB-D Images to RGB Images: Single Labeling for Mining Visual Models

摘要

著录项

相似文献

相关主题

期刊订阅