首页> 外文会议>International Conference on Soft Computing Models in Industrial and Environmental Applications >Generating 2.5D Photorealistic Synthetic Datasets for Training Machine Vision Algorithms
【24h】

Generating 2.5D Photorealistic Synthetic Datasets for Training Machine Vision Algorithms

机译:用于训练机视觉算法的2.5D光电型合成数据集

获取原文

摘要

The continued success of deep convolution neural networks (CNN) in computer vision can be directly linked to vast amounts of data and tremendous processing resources for training such non-linear models. However, depending on the task, the available amount of data varies significantly. Particularly robotic systems usually rely on small amounts of data, as producing and annotating them is extremely robot and task specific (e.g. grasping) and therefore prohibitive. Recently, in order to address the aforementioned problem of small datasets in robotic vision, a common practice is to reuse features that are already learned by a CNN within a large-scale task and apply them to different small scale ones. This transfer of learning shows some promising results as an alternative, but nevertheless it can not be compared with the performance of a CNN that is specifically trained from the beginning for that specific task. Thus, many researchers turned to synthetic datasets for training, since they can be produced easily and cost effectively. The main issue of such datasets that already exist, is the lack of photorealism both in terms of background and lighting. Herein, we are proposing a framework for the generation of completely synthetic datasets that includes all types of data that state-of-the-art algorithms in object recognition, and tracking need for their training. Thus, we can improve robotic perception without deploying the robot in time-consuming real-world scenarios.
机译:计算机愿景中的深度卷积神经网络(CNN)的持续成功可以直接与大量数据和巨大的处理资源直接相关,以训练这种非线性模型。但是,根据任务,可用的数据量显着变化。特别是机器人系统通常依赖于少量数据,因为产生和注释它们是极其机器人和特定的任务(例如抓握),因此令人望而却步。最近,为了解决机器人视觉中的小型数据集的上述问题,常识是重用在大规模任务中由CNN学习的功能,并将它们应用于不同的小规模。这种学习的转移显示了一些有希望的结果作为替代方案,但尽管如此,它不能与从开始为该特定任务的开始专门培训的CNN的性能进行比较。因此,许多研究人员转向了培训的合成数据集,因为它们可以容易且成本有效地生产。已经存在的此类数据集的主要问题是在背景和照明方面缺乏光容。这里,我们提出了一种用于生成完全合成数据集的框架,该数据集包括对象识别中最先进的算法以及对其训练的最新算法的所有类型的数据。因此,我们可以提高机器人感知,而不在耗时的现实情景中部署机器人。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号