首页> 外文会议>International Conference on Pattern Recognition >Trainable Spectrally Initializable Matrix Transformations in Convolutional Neural Networks
【24h】

Trainable Spectrally Initializable Matrix Transformations in Convolutional Neural Networks

机译:卷积神经网络中的可训练谱初始化矩阵变换

获取原文

摘要

In this work, we introduce a new architectural component to Neural Network (NN), i.e., trainable and spectrally initializable matrix transformations on feature maps. While previous literature has already demonstrated the possibility of adding static spectral transformations as feature processors, our focus is on more general trainable transforms. We study the transforms in various architectural configurations on four datasets of different nature: from medical (ColorectalHist, HAM10000) and natural (Flowers) images to historical documents (CB55). With rigorous experiments that control for the number of parameters and randomness, we show that networks utilizing the introduced matrix transformations outperform vanilla neural networks. The observed accuracy increases appreciably across all datasets. In addition, we show that the benefit of spectral initialization leads to significantly faster convergence, as opposed to randomly initialized matrix transformations. The transformations are implemented as auto-differentiable PyTorch modules that can be incorporated into any neural network architecture. The entire code base is open-source.
机译:在这项工作中,我们向神经网络(NN),即在特征映射上的可训练和频谱初始化矩阵变换引入了新的架构组件。虽然以前的文献已经证明了作为特征处理器的静态光谱变换的可能性,但我们的重点是更一般的培训变换。我们在不同性质的四个数据集中研究了各种架构配置的变换:从医疗(Collectalhist,Ham10000)和自然(花)图像到历史文档(CB55)。对于控制参数和随机性的次数的严格实验,我们显示利用引入的矩阵变换来赢得VANILLA神经网络的网络。所有数据集中观察到的准确性会显着增加。此外,我们表明光谱初始化的益处导致了更快的收敛性,而不是随机初始化的矩阵变换。变换实现为可将自动微分的Pytorch模块结合到任何神经网络架构中。整个代码库是开源。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号