首页> 外文会议>IEEE International Conference on Computer Vision >Multi-view Convolutional Neural Networks for 3D Shape Recognition
【24h】

Multi-view Convolutional Neural Networks for 3D Shape Recognition

机译:用于3D形状识别的多视图卷积神经网络

获取原文

摘要

A longstanding question in computer vision concerns the representation of 3D shapes for recognition: should 3D shapes be represented with descriptors operating on their native 3D formats, such as voxel grid or polygon mesh, or can they be effectively represented with view-based descriptors? We address this question in the context of learning to recognize 3D shapes from a collection of their rendered views on 2D images. We first present a standard CNN architecture trained to recognize the shapes' rendered views independently of each other, and show that a 3D shape can be recognized even from a single view at an accuracy far higher than using state-of-the-art 3D shape descriptors. Recognition rates further increase when multiple views of the shapes are provided. In addition, we present a novel CNN architecture that combines information from multiple views of a 3D shape into a single and compact shape descriptor offering even better recognition performance. The same architecture can be applied to accurately recognize human hand-drawn sketches of shapes. We conclude that a collection of 2D views can be highly informative for 3D shape recognition and is amenable to emerging CNN architectures and their derivatives.
机译:计算机视觉中一个长期存在的问题涉及用于识别的3D形状的表示:3D形状是否应使用以其本机3D格式(例如体素网格或多边形网格)运行的描述符来表示,还是可以有效地使用基于视图的描述符来表示?我们在学习从3D形状在2D图像上的渲染视图集合中识别3D形状的背景下解决了这个问题。我们首先提出一种标准的CNN体​​系结构,该体系结构经过训练可以彼此独立地识别形状的渲染视图,并显示即使从单个视图中也可以识别3D形状,其准确度远高于使用最新的3D形状描述符。提供多个形状的视图时,识别率会进一步提高。此外,我们提出了一种新颖的CNN架构,该架构将来自3D形状多个视图的信息组合到一个紧凑的形状描述符中,从而提供了更好的识别性能。可以应用相同的体系结构来准确识别人类手绘的形状草图。我们得出的结论是,2D视图的集合对于3D形状识别可能非常有用,并且适用于新兴的CNN架构及其派生类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号