首页> 外文会议>IEEE International Conference on Big Data Computing Service and Applications >AutoDLCon: An Approach for Controlling the Automated Tuning for Deep Learning Networks
【24h】

AutoDLCon: An Approach for Controlling the Automated Tuning for Deep Learning Networks

机译:AutoDLCon:一种控制深度学习网络自动调整的方法

获取原文

摘要

Neural networks have become the main building block on revolutionizing the field of artificial intelligence aided applications. With the wide availability of data and the increasing capacity of computing resources, they triggered a new era of state-of-the-art results in diverse directions. However, building neural network models is domain-specific, and figuring out the best architecture and hyper-parameters in each problem is still an art. In practice, it is a highly iterative process that is very time-consuming, requires substantial computing resources, and needs deep knowledge and solid technical skills. To tackle this challenge, we introduce a new gradient-based technique, AutoDLCon, that aims to automate the design process of neural network architecture for the given classification task and dataset within a specified time budget using a controller neural network. In particular, the controller network predicts how good a model is and suggests trying an optimized model by back-propagating from a loss function through the controller network to the controller’s weights one time and to the controller’s inputs at another time. This approach mimics the exploration of the search space in a more efficient way by reducing the number of trials to the minimum. As a consequence, it can significantly reduce the time budget and computing resources to a minimum and controllable level. We evaluate our approach using MNIST and CIFAR-10 datasets with different settings based on the difficulty of the problem. The results of our experiments show that our approach is able to generate child models that are good enough to obtain competitive results on the validation data after only a few trials.
机译:神经网络已成为革新人工智能辅助应用领域的主要基础。随着数据的广泛可用性和计算资源的不断增长,它们引发了朝着不同方向发展最新结果的新时代。但是,建立神经网络模型是特定于领域的,并且找出每个问题中的最佳架构和超参数仍然是一门艺术。实际上,这是一个高度迭代的过程,非常耗时,需要大量的计算资源,并且需要深厚的知识和扎实的技术技能。为了解决这一挑战,我们引入了一种新的基于梯度的技术AutoDLCon,该技术旨在使用控制器神经网络在指定的时间预算内针对给定的分类任务和数据集自动化神经网络体系结构的设计过程。特别是,控制器网络会预测模型的质量,并建议通过从损耗函数通过控制器网络向控制器权重一次反向传播,并在另一时间向控制器输入反向传播,从而尝试优化模型。这种方法通过将试验次数减少到最少,以更有效的方式模仿了对搜索空间的探索。结果,它可以将时间预算和计算资源显着减少到最小且可控制的水平。我们根据问题的难度,使用具有不同设置的MNIST和CIFAR-10数据集来评估我们的方法。我们的实验结果表明,我们的方法能够生成子模型,这些子模型仅经过几次试验就足以在验证数据上获得竞争性结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号