Highway Network Block with Gates Constraints for Training Very Deep Networks

机译：高速公路网络块，带盖训练非常深网络的限制

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose to reformulate the learning of the highway network block to realize both early optimization and improved generalization of very deep networks while preserving the network depth. Gate constraints are duly employed to improve optimization, latent representations and parameterization usage in order to efficiently learn hierarchical feature transformations which are crucial for the success of any deep network. One of the earliest very deep models with over 30 layers that was successfully trained relied on highway network blocks. Although, highway blocks suffice for alleviating optimization problem via improved information flow, we show for the first time that further in training such highway blocks may result into learning mostly untransformed features and therefore a reduction in the effective depth of the model; this could negatively impact model generalization performance. Using the proposed approach, 15-layer and 20-layer models are successfully trained with one gate and a 32-layer model using three gates. This leads to a drastic reduction of model parameters as compared to the original highway network. Extensive experiments on CIFAR-10, CIFAR-100, Fashion-MNIST and USPS datasets are performed to validate the effectiveness of the proposed approach. Particularly, we outperform the original highway network and many state-of-the-art results. To the best our knowledge, on the Fashion-MNIST and USPS datasets, the achieved results are the best reported in literature.

机译：在本文中，我们建议重构公路网络块的学习，以实现早期优化和改进非常深网络的泛化，同时保留网络深度。栅极约束正式用于改善优化，潜在表示和参数化使用，以便有效地学习对于任何深网络成功至关重要的分层特征转换。最早的最深层模型之一，拥有超过30层的，在公路网络块上依赖于成功培训。虽然，公路块通过改进的信息流量来缓解优化问题，但我们首次显示在训练此类公路块中可能导致大多数未转化的特征，从而降低了模型的有效深度的降低;这可能会对模型泛化性能产生负面影响。使用所提出的方法，使用三个门和32层模型成功培训了15层和20层模型。与原始公路网络相比，这导致模型参数的急剧减少。进行广泛的CiFar-10，CiFar-100，Fashion-Mnist和USPS数据集的实验，以验证所提出的方法的有效性。特别是，我们优于原始的公路网络和许多最先进的结果。为了最好的知识，在时尚 - Mnist和USPS数据集上，所达到的结果是文学中最好的报道。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops》|2018年|813p|共500页
会议地点
作者
Oyebade K. Oyedotun; Abd El Rahman Shabayek; Djamila Aouada; Bj?rn Ottersten;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词
Road transportation; Training; Logic gates; Optimization; Error analysis; Routing; Task analysis;

机译：道路运输;培训;逻辑门;优化;误差分析;路由;任务分析;

相似文献

外文文献
中文文献
专利

1. Block layer decomposition schemes for training deep neural networks [J] . Palagi Laura, Seccia Ruggiero Journal of Global Optimization . 2020,第1期

机译：深度神经网络的块层分解方案
2. Training deep neural networks for wireless sensor networks using loosely and weakly labeled images [J] . Zhou Qianwei, Chen Yuhang, Li Baoqing, Neurocomputing . 2021,第Feba28期

机译：使用松散和弱标记的图像训练无线传感器网络的深度神经网络
3. Multi-way backpropagation networks for training compact deep neural networks [J] . Neural Networks: The Official Journal of the International Neural Network Society . 2020,第期

机译：用于训练紧凑型深神经网络的多路背部经历网络
4. Highway Network Block with Gates Constraints for Training Very Deep Networks [C] . Oyebade K. Oyedotun, Abd El Rahman Shabayek, Djamila Aouada, IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops . 2018

机译：带有门约束的高速公路网模块，用于训练非常深的网络
5. Network design under budget constraints with application to the railroad blocking problem. [D] . Newton, Harry Nelson. 1996

机译：预算约束下的网络设计及其在铁路阻塞问题中的应用。
6. Automatically Linking Registered Clinical Trials to their Published Results with Deep Highway Networks [O] . Travis R. Goodwin, Michael A. Skinner, Sanda M. Harabagiu 2018

机译：使用深层高速公路网络自动将注册的临床试验链接到其已发表的结果
7. Improved Highway Network Block for Training Very Deep Neural Networks [O] . Oyebade K. Oyedotun, Abd El Rahman Shabayek, Djamila Aouada, 2020

机译：改进的高速公路网络块训练非常深神经网络

Highway Network Block with Gates Constraints for Training Very Deep Networks

摘要

著录项

相似文献

相关主题

期刊订阅