Adaptive Learning Rate Adjustment with Short-Term Pre-Training in Data-Parallel Deep Learning

机译：数据并行深度学习中具有短期预训练的自适应学习速率调整

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper introduces a method to adaptively choose a learning rate (LR) with short-term pre-training (STPT). This is useful for quick model prototyping in data-parallel deep learning. For unknown models, it is necessary to tune numerous hyperparameters. The proposed method reduces computational time and increases efficiency in finding an appropriate LR; multiple LRs are evaluated by STPT in data-parallel deep learning. STPT means training only with the beginning iterations in an epoch. When eight LRs are evaluated using eight parallel workers, the proposed method can easily reduce the computational time by 87.5% in comparison with the conventional method. The accuracy is also improved by 4.8% in comparison with the conventional method with a reference LR of 0.1; thus, no deterioration in accuracy is observed. For an unknown model, this method shows a better training curve trend than other cases with fixed LRs.

机译：本文介绍了一种通过短期预训练（STPT）自适应选择学习率（LR）的方法。这对于数据并行深度学习中的快速模型原型很有用。对于未知模型，有必要调整许多超参数。所提出的方法减少了计算时间并提高了寻找合适的LR的效率。 STPT在数据并行深度学习中评估多个LR。 STPT意味着仅在一个时期的开始迭代中进行训练。当使用八个并行工作者评估八个LR时，与传统方法相比，该方法可以轻松地将计算时间减少87.5％。与参考LR为0.1的传统方法相比，精度也提高了4.8％。因此，没有观察到精度的降低。对于未知模型，此方法显示出比其他具有固定LR的情况更好的训练曲线趋势。

著录项

来源
《IEEE International Workshop on Signal Processing Systems》|2018年|100-105|共6页
会议地点
作者
Kazuki Yamada; Haruki Mori; Tetsuya Youkawa; Yuki Miyauchi; Shintaro Izumi; Masahiko Yoshimoto; Hiroshi Kawaguchi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Deep learning; Computational modeling; Parallel processing; Data models; Software algorithms; Adaptation models;

机译：培训;深度学习;计算模型;并行处理;数据模型;软件算法;适应模型;

相似文献

外文文献
中文文献
专利

1. Pre-Training Acquisition Functions by Deep Reinforcement Learning for Fixed Budget Active Learning [J] . Taguchi Yusuke, Hino Hideitsu, Kameyama Keisuke Neural processing letters . 2021,第3期

机译：通过深度加强学习进行预训练课程，以获得固定预算主动学习
2. State-of-Charge Estimation of the Panasonic 18650PF Li-ion Cell using Deep Learning Models and Algorithms with Adaptive Learning Rates [J] . Alexandre Barbosa de Lima, Mauricio B. C. Sales, José Roberto Cardoso International Journal of Engineering Research and Applications . 2020,第12期

机译：使用深度学习模型和自适应学习速率的深度学习模型和算法的Panasonic 18650PF锂离子电池的充电状态
3. Adaptive Learning Recommendation Strategy Based on Deep Q-learning [J] . Applied Psychological Measurement . 2020,第4期

机译：基于深度Q学习的自适应学习推荐战略
4. Adaptive Learning Rate Adjustment with Short-Term Pre-Training in Data-Parallel Deep Learning [C] . Kazuki Yamada, Haruki Mori, Tetsuya Youkawa, IEEE International Workshop on Signal Processing Systems . 2018

机译：基于数据平行深度学习的短期预培训自适应学习率调整
5. A Data-Parallel Approach for Efficient Resource Utilization in Distributed Serverless Deep Learning [D] . Assogba, Kevin Tunder Elom. 2020

机译：分布式无服务深度学习中有效资源利用的数据并行方法
6. Adaptive Learning Recommendation Strategy Based on Deep Q-learning [O] . Chunxi Tan, Ruijian Han, Rougang Ye, 2020

机译：基于深度Q学习的自适应学习推荐战略
7. Pre-Training Acquisition Functions by Deep Reinforcement Learning for Fixed Budget Active Learning [O] . Yusuke Taguchi, Hideitsu Hino, Keisuke Kameyama 2021

机译：通过深度加强学习进行预训练课程，以获得固定预算主动学习

Adaptive Learning Rate Adjustment with Short-Term Pre-Training in Data-Parallel Deep Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅