首页> 外文会议>Conference on Neural Information Processing Systems >How to Initialize your Network? Robust Initialization for WeightNorm ResNets
【24h】

How to Initialize your Network? Robust Initialization for WeightNorm ResNets

机译:如何初始化网络? 重量ω和Resnets的鲁棒初始化

获取原文

摘要

Residual networks (ResNet) and weight normalization play an important role in various deep learning applications. However, parameter initialization strategies have not been studied previously for weight normalized networks and, in practice, initialization methods designed for un-normalized networks are used as a proxy. Similarly, initialization for ResNets have also been studied for un-normalized networks and often under simplified settings ignoring the shortcut connection. To address these issues, we propose a novel parameter initialization strategy that avoids explosion/vanishment of information across layers for weight normalized networks with and without residual connections. The proposed strategy is based on a theoretical analysis using mean field approximation. We run over 2,500 experiments and evaluate our proposal on image datasets showing that the proposed initialization outperforms existing initialization methods in terms of generalization performance, robustness to hyper-parameter values and variance between seeds, especially when networks get deeper in which case existing methods fail to even start training. Finally, we show that using our initialization in conjunction with learning rate warmup is able to reduce the gap between the performance of weight normalized and batch normalized networks.
机译:剩余网络(Reset)和重量标准化在各种深度学习应用中起着重要作用。然而,前面尚未研究参数初始化策略,以便在实践中,设计用于未归一化网络的初始化方法用作代理。类似地,还研究了未归一化网络的初始化,并且通常在简化的设置下忽略快捷方式连接。为了解决这些问题,我们提出了一种新颖的参数初始化策略,避免了跨层爆炸/消失的信息,以获得重量标准化网络,没有残余连接。拟议的策略基于使用平均场近似的理论分析。我们运行超过2,500个实验,并评估我们在图像数据集上的提议,显示所提出的初始化在泛化性能方面优于现有的初始化方法,鲁棒性对种子之间的超参数值和方差,特别是当网络更深的情况下,现有方法无法进行甚至开始培训。最后,我们表明,使用我们的初始化与学习速率预热,能够降低重量归一化和批量标准化网络的性能之间的差距。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号