首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops >Highway Network Block with Gates Constraints for Training Very Deep Networks
【24h】

Highway Network Block with Gates Constraints for Training Very Deep Networks

机译:高速公路网络块,带盖训练非常深网络的限制

获取原文

摘要

In this paper, we propose to reformulate the learning of the highway network block to realize both early optimization and improved generalization of very deep networks while preserving the network depth. Gate constraints are duly employed to improve optimization, latent representations and parameterization usage in order to efficiently learn hierarchical feature transformations which are crucial for the success of any deep network. One of the earliest very deep models with over 30 layers that was successfully trained relied on highway network blocks. Although, highway blocks suffice for alleviating optimization problem via improved information flow, we show for the first time that further in training such highway blocks may result into learning mostly untransformed features and therefore a reduction in the effective depth of the model; this could negatively impact model generalization performance. Using the proposed approach, 15-layer and 20-layer models are successfully trained with one gate and a 32-layer model using three gates. This leads to a drastic reduction of model parameters as compared to the original highway network. Extensive experiments on CIFAR-10, CIFAR-100, Fashion-MNIST and USPS datasets are performed to validate the effectiveness of the proposed approach. Particularly, we outperform the original highway network and many state-of-the-art results. To the best our knowledge, on the Fashion-MNIST and USPS datasets, the achieved results are the best reported in literature.
机译:在本文中,我们建议重构公路网络块的学习,以实现早期优化和改进非常深网络的泛化,同时保留网络深度。栅极约束正式用于改善优化,潜在表示和参数化使用,以便有效地学习对于任何深网络成功至关重要的分层特征转换。最早的最深层模型之一,拥有超过30层的,在公路网络块上依赖于成功培训。虽然,公路块通过改进的信息流量来缓解优化问题,但我们首次显示在训练此类公路块中可能导致大多数未转化的特征,从而降低了模型的有效深度的降低;这可能会对模型泛化性能产生负面影响。使用所提出的方法,使用三个门和32层模型成功培训了15层和20层模型。与原始公路网络相比,这导致模型参数的急剧减少。进行广泛的CiFar-10,CiFar-100,Fashion-Mnist和USPS数据集的实验,以验证所提出的方法的有效性。特别是,我们优于原始的公路网络和许多最先进的结果。为了最好的知识,在时尚 - Mnist和USPS数据集上,所达到的结果是文学中最好的报道。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号