首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Learning Convolutional Neural Networks using Hybrid Orthogonal Projection and Estimation
【24h】

Learning Convolutional Neural Networks using Hybrid Orthogonal Projection and Estimation

机译:使用混合正交投影和估计学习卷积神经网络

获取原文
           

摘要

Convolutional neural networks (CNNs) have yielded the excellent performance in a variety of computer vision tasks, where CNNs typically adopt a similar structure consisting of convolution layers, pooling layers and fully connected layers. In this paper, we propose to apply a novel method, namely Hybrid Orthogonal Projection and Estimation (HOPE), to CNNs in order to introduce orthogonality into the CNN structure. The HOPE model can be viewed as a hybrid model to combine feature extraction using orthogonal linear projection with mixture models. It is an effective model to extract useful information from the original high-dimension feature vectors and meanwhile filter out irrelevant noises. In this work, we present three different ways to apply the HOPE models to CNNs, i.e., em HOPE-Input, em single-HOPE-Block and em multi-HOPE-Blocks. For em HOPE-Input CNNs, a HOPE layer is directly used right after the input to de-correlate high-dimension input feature vectors. Alternatively, in em single-HOPE-Block and em multi-HOPE-Blocks CNNs, we consider to use HOPE layers to replace one or more blocks in the CNNs, where one block may include several convolutional layers and one pooling layer. The experimental results on CIFAR-10, CIFAR-100 and ImageNet databases have shown that the orthogonal constraints imposed by the HOPE layers can significantly improve the performance of CNNs in these image classification tasks (we have achieved one of the best performance when image augmentation has not been applied, and top 5 performance with image augmentation).
机译:卷积神经网络(CNN)在各种计算机视觉任务中均具有出色的性能,其中CNN通常采用由卷积层,池化层和完全连接的层组成的相似结构。在本文中,我们建议对CNN应用一种新方法,即混合正交投影和估计(HOPE),以将正交性引入CNN结构中。 HOPE模型可以看作是一种混合模型,可以将使用正交线性投影的特征提取与混合模型相结合。这是一种有效的模型,可以从原始的高维特征向量中提取有用的信息,同时滤除无关的噪声。在这项工作中,我们介绍了三种将HOPE模型应用于CNN的方法,即 em HOPE-Input, em single-HOPE-Block和 em multi-HOPE-Blocks。对于 em HOPE-Input CNN,在输入之后立即直接使用HOPE层对高维输入特征向量进行去相关。或者,在单个HOPE块和多个HOPE块CNN中,我们考虑使用HOPE层来替换CNN中的一个或多个块,其中一个块可能包括多个卷积层和一个池化层。在CIFAR-10,CIFAR-100和ImageNet数据库上的实验结果表明,由HOPE层施加的正交约束可以显着改善这些图像分类任务中CNN的性能(当图像增强具有尚未应用,并且在图像增强方面排名前5位)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号