首页> 外文期刊>Multimedia Tools and Applications >3D model retrieval based on multi-view attentional convolutional neural network
【24h】

3D model retrieval based on multi-view attentional convolutional neural network

机译:基于多视图注意力卷积神经网络的3D模型检索

获取原文
获取原文并翻译 | 示例
       

摘要

We propose a discriminative Multi-View Attentional Convolutional Neural Network, dubbed as MVA-CNN, which takes the multiple views of an shape as input and output the object category. Unlike previous view-based approaches that simply "compile" the view features into a compact 3D descriptors, our method can discover the context among multiple views in both the visual and spatial domain. First, we extract multiple rendered images from a 3D object by virtual cameras, and then we use Convolutional Neural Network (CNN) to abstract the information of the views. Second, we aggregate the visual views by two steps: 1). an element-wise maximum operation across the view features is adopted to discover discriminative features. 2). a soft attention mechanism is used to dynamically adjust the shape descriptors for better representing the spatial information. The entire network can be trained in an end-to-end way with the standard backpropagation. We verify the effectiveness of MVA-CNN on two widely used datasets: ModelNetlO, ModelNet40 by comparing our method with state-of-the-art methods.
机译:我们提出了一种辨别性的多视图注意力卷积神经网络,称为MVA-CNN,其将形状的多视图作为输入和输出对象类别。与以前的视图的方法不同,即将视图功能“编译”到Compact 3D描述符中,我们的方法可以在Visual和Spatial域中发现多个视图之间的上下文。首先,我们通过虚拟摄像机从3D对象中提取多个渲染图像,然后我们使用卷积神经网络(CNN)来抽象视图的信息。其次,我们通过两个步骤汇总视图:1)。采用跨视图特征的元素明智的最大操作来发现歧视特征。 2)。用于动态调整形状描述符的软注意机制,以便更好地代表空间信息。整个网络可以以端到端的方式培训,以标准的反向化。我们通过将我们的方法与最先进的方法进行比较,我们验证了MVA-CNN在两个广泛使用的数据集:ModelNetLo,ModelNet40上的有效性。

著录项

  • 来源
    《Multimedia Tools and Applications》 |2020年第8期|4699-4711|共13页
  • 作者单位

    School of Electrical and Information Engineering Tianjin University Tianjin 300072 China;

    School of Electrical and Information Engineering Tianjin University Tianjin 300072 China;

    School of Electrical and Information Engineering Tianjin University Tianjin 300072 China;

    School of Electrical and Information Engineering Tianjin University Tianjin 300072 China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    3D model retrieval; Multi-view; CNN; LSTM;

    机译:3D模型检索;多视图;CNN;LSTM.;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号