...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >A generic shift-norm-activation approach for deep learning
【24h】

A generic shift-norm-activation approach for deep learning

机译:深度学习的通用移位 - 规范激活方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Deep learning has received increasing attention in the last decade. Its amazing success, is partly attributed to the evolution of normalization and activation techniques. However, less works have devoted to explore both modules together. This work, therefore, aims at pushing for a deeper understanding on the effect of normalization and activation together analytically. We design a generic method which integrates both normalization and activation together as a whole, named as the Generic Shift-Normalization Activation Approach (GSNA), in reserving richer information propagation in neural networks. A rigorous mathematical analysis was performed to investigate the benefits of the designed method, such as its computation complexity, performance potential as well as optimization over trainable parameter initialization. Further, extensive experiments are conducted to demonstrate the superiority and generality of the designed method in many computer vision benchmarking tasks, such as CIFAR-10/100, SVHN, ImageNet32 x 32, etc. To explore its generality, we also conduct some experiments on natural language understanding tasks like text classification, natural language inference, and some variational generative task as well. More interestingly, GSNA can be naturally incorporated into the existing neural networks with arbitrary architectures, demonstrating its generic effectiveness in deep learning field. (C) 2020 Elsevier Ltd. All rights reserved.
机译:在过去十年中,深度学习受到了越来越多的关注。其惊人的成功部分归功于规范化和激活技术的发展。然而,很少有人致力于同时探索这两个模块。因此,这项工作旨在从分析的角度推动对正常化和激活效应的更深入理解。为了在神经网络中保留更丰富的信息传播,我们设计了一种将归一化和激活作为一个整体集成在一起的通用方法,称为通用移位归一化激活方法(GSNA)。通过严格的数学分析,研究了所设计方法的优点,如计算复杂度、性能潜力以及相对于可训练参数初始化的优化。此外,还进行了大量实验,以证明所设计的方法在许多计算机视觉基准测试任务中的优越性和通用性,如CIFAR-10/100、SVHN、ImageNet32 x 32等。为了探索其通用性,我们还对文本分类、自然语言推理、,还有一些可变的生成任务。更有趣的是,GSNA可以自然地融入到现有的任意结构的神经网络中,证明了它在深度学习领域的通用有效性。(C) 2020爱思唯尔有限公司版权所有。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号