首页> 外文期刊>ACM transactions on intelligent systems >From RGB-D Images to RGB Images: Single Labeling for Mining Visual Models
【24h】

From RGB-D Images to RGB Images: Single Labeling for Mining Visual Models

机译:从RGB-D图像到RGB图像:单一标签用于挖掘视觉模型

获取原文
获取原文并翻译 | 示例
           

摘要

Mining object-level knowledge, that is, building a comprehensive category model base, from a large set of cluttered scenes presents a considerable challenge to the field of artificial intelligence. How to initiate model learning with the least human supervision (i.e., manual labeling) and how to encode the structural knowledge are two elements of this challenge, as they largely determine the scalability and applicability of any solution. In this article, we propose a model-learning method that starts from a single-labeled object for each category, and mines further model knowledge from a number of informally captured, cluttered scenes. However, in these scenes, target objects are relatively small and have large variations in texture, scale, and rotation. Thus, to reduce the model bias normally associated with less supervised learning methods, we use the robust 3D shape in RGB-D images to guide our model learning, then apply the properly trained category models to both object detection and recognition in more conventional RGB images. In addition to model training for their own categories, the knowledge extracted from the RGB-D images can also be transferred to guide model learning for a new category, in which only RGB images without depth information in the new category are provided for training. Preliminary testing shows that the proposed method performs as well as fully supervised learning methods.
机译:从一大堆混乱的场景中挖掘对象级别的知识,即建立一个综合的类别模型库,对人工智能领域提出了相当大的挑战。如何在最少的人工监督下(即手动标记)启动模型学习以及如何对结构知识进行编码是此挑战的两个要素,因为它们在很大程度上决定了任何解决方案的可扩展性和适用性。在本文中,我们提出了一种模型学习方法,该方法从每个类别的单个标签对象开始,并从许多非正式捕获的,混乱的场景中挖掘更多的模型知识。但是,在这些场景中,目标对象相对较小,并且在纹理,比例和旋转方面具有较大的变化。因此,为了减少通常与较少监督学习方法相关的模型偏差,我们在RGB-D图像中使用鲁棒的3D形状来指导我们的模型学习,然后将经过适当训练的类别模型应用于更常规的RGB图像中的对象检测和识别。除了针对自己类别的模型训练之外,从RGB-D图像中提取的知识也可以用于指导新类别的模型学习,其中仅提供新类别中没有深度信息的RGB图像进行训练。初步测试表明,该方法的性能和完全监督的学习方法一样好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号