【24h】

How to Train a Compact Binary Neural Network with High Accuracy?

机译:如何高精度训练紧凑的二元神经网络?

获取原文

摘要

How to train a binary neural network (BinaryNet) with both high compression rate and high accuracy on large scale datasets? We answer this question through a careful analysis of previous work on BinaryNets, in terms of training strategies, regularization, and activation approximation. Our findings first reveal that a low learning rate is highly preferred to avoid frequent sign changes of the weights, which often makes the learning of BinaryNets unstable. Secondly, we propose to use PReLU instead of ReLU in a BinaryNet to conveniently absorb the scale factor for weights to the activation function, which enjoys high computation efficiency for binarized layers while maintains high approximation accuracy. Thirdly, we reveal that instead of imposing L2 regularization, driving all weights to zero which contradicts with the setting of BinaryNets, we introduce a regularization term that encourages the weights to be bipolar. Fourthly, we discover that the failure of binarizing the last layer, which is essential for high compression rate, is due to the improper output range. We propose to use a scale layer to bring it to normal. Last but not least, we propose multiple binarizations to improve the approximation of the activations. The composition of all these enables us to train BinaryNets with both high compression rate and high accuracy, which is strongly supported by our extensive empirical study.
机译:如何在大规模数据集中培训二元神经网络(BinaryNet),高压缩率和高精度?在培训策略,正常化和激活近似方面,通过仔细分析之前的Binarynets对前一项工作的仔细分析来回答这个问题。我们的研究结果首先揭示了低学习率是非常优选的,以避免频繁的重量变化,这通常会使BinaryNets的学习不稳定。其次,我们建议在BinaryNet中使用PRELU而不是Relu,方便地将重量的比例因子用于激活功能,这对于二值化层来说享有高计算效率,同时保持高近似精度。第三,我们揭示了它而不是强加L2正则化,驱动所有重量到零,这与BinaryNets的设置相矛盾,我们介绍了一个促进重量为双极的正则化术语。第四,我们发现二值化最后一层的失败,这对高压缩率至关重要,是由于输出范围不当。我们建议使用秤层将其带到正常情况。最后但并非最不重要的是,我们提出了多种二进制度来改善激活的近似。所有这些的组合使我们能够以高压缩率和高精度训练BinaryNets,这是我们广泛的实证研究强烈支持的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号