首页> 外文会议>European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases >Deep Learning Architecture Search by Neuro-Cell-Based Evolution with Function-Preserving Mutations
【24h】

Deep Learning Architecture Search by Neuro-Cell-Based Evolution with Function-Preserving Mutations

机译:基于神经细胞的进化与功能保存突变的深度学习建筑

获取原文

摘要

The design of convolutional neural network architectures for a new image data set is a laborious and computational expensive task which requires expert knowledge. We propose a novel neuro-evolutionary technique to solve this problem without human interference. Our method assumes that a convolutional neural network architecture is a sequence of neuro-cells and keeps mutating them using function-preserving operations. This novel combination of approaches has several advantages. We define the network architecture by a sequence of repeating neurocells which reduces the search space complexity. Furthermore, these cells are possibly transferable and can be used in order to arbitrarily extend the complexity of the network. Mutations based on function-preserving operations guarantee better parameter initialization than random initialization such that less training time is required per network architecture. Our proposed method finds within 12 GPU hours neural network architectures that can achieve a classification error of about 4% and 24% with only 5.5 and 6.5 million parameters on CIFAR-10 and CIFAR-100, respectively. In comparison to competitor approaches, our method provides similar competitive results but requires orders of magnitudes less search time and in many cases less network parameters.
机译:卷积神经网络架构的一个新的图像数据集的设计是一个费力又昂贵的计算任务,这需要专业知识。我们提出了一个新颖的神经进化技术,无需人为干预来解决这个问题。我们的方法假定卷积神经网络结构是神经细胞的序列,并让他们使用的功能保留操作变异。这种新颖的方式组合有几个优点。我们通过重复neurocells这减少了搜索空间复杂度的序列定义了网络架构。此外,这些细胞可能转让,也以任意延长了网络的复杂性来使用。基于功能保留操作的突变保证更好的参数初始化不是随机的初始化,从而减少培训时间为每一个网络架构需要。我们提出的方法在12小时的GPU神经网络结构,可以实现的约4%和24%的分类误差与仅5.5和6.5百万在CIFAR-10和CIFAR-100,分别参数认定。相较于竞争对手的方法,我们的方法提供了类似的竞争的结果,但需要数量级较少的搜索时间,而且在许多情况下,减少了网络参数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号