首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >Noisy Softmax: Improving the Generalization Ability of DCNN via Postponing the Early Softmax Saturation
【24h】

Noisy Softmax: Improving the Generalization Ability of DCNN via Postponing the Early Softmax Saturation

机译:嘈杂的Softmax:通过推迟早期的Softmax饱和度来提高DCNN的泛化能力

获取原文

摘要

Over the past few years, softmax and SGD have become a commonly used component and the default training strategy in CNN frameworks, respectively. However, when optimizing CNNs with SGD, the saturation behavior behind softmax always gives us an illusion of training well and then is omitted. In this paper, we first emphasize that the early saturation behavior of softmax will impede the exploration of SGD, which sometimes is a reason for model converging at a bad local-minima, then propose Noisy Softmax to mitigating this early saturation issue by injecting annealed noise in softmax during each iteration. This operation based on noise injection aims at postponing the early saturation and further bringing continuous gradients propagation so as to significantly encourage SGD solver to be more exploratory and help to find a better local-minima. This paper empirically verifies the superiority of the early softmax desaturation, and our method indeed improves the generalization ability of CNN model by regularization. We experimentally find that this early desaturation helps optimization in many tasks, yielding state-of-the-art or competitive results on several popular benchmark datasets.
机译:在过去的几年中,softmax和SGD分别成为CNN框架中的常用组件和默认培训策略。但是,当使用SGD优化CNN时,softmax背后的饱和行为总是给我们一种训练良好的幻觉,因此被省略。在本文中,我们首先强调softmax的早期饱和行为将阻碍SGD的探索,这有时是模型在较差的局部最小值处收敛的原因,然后提出Noisy Softmax通过注入退火噪声来缓解此早期饱和问题在每次迭代期间在softmax中。此基于噪声注入的操作旨在延迟早期饱和并进一步带来连续的梯度传播,从而显着鼓励SGD求解器更具探索性,并有助于找到更好的局部最小值。本文通过经验验证了早期softmax去饱和的优越性,我们的方法确实通过正则化提高了CNN模型的泛化能力。我们通过实验发现,这种早期去饱和有助于在许多任务中进行优化,从而在几个流行的基准数据集上产生最先进的或具有竞争力的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号