首页> 外文期刊>Procedia Computer Science >System Architecting Approach for Designing Deep Learning Models
【24h】

System Architecting Approach for Designing Deep Learning Models

机译:设计深度学习模型的系统架构方法

获取原文
           

摘要

Deep Learning (DL) models have proven to be very effective in solving many challenging problems, especially, those related to computer vision, text, and speech. However, the design of such models is challenging because of the vast search space and computational complexity that needs to be explored. Our goal in this paper is to reduce the human effort required to design architectures by using a system architecture development process that allows the exploration of large design space by automating certain model construction, alternative generation, and assessment. The proposed framework is generic and targeted at all deep learning architectures that can be expressed by logical models with certain numeric properties. The implementation of the proposed approach is presented, along with the test results achieved on CIFAR-10 dataset using a convolutional neural network (CNN). We show that the architecture generated by our approach achieves 5.23% error rate with only 1.2M parameters, which shows the capability to design high performing architectures.
机译:事实证明,深度学习(DL)模型对于解决许多具有挑战性的问题非常有效,尤其是与计算机视觉,文本和语音相关的问题。但是,由于需要探索巨大的搜索空间和计算复杂性,因此此类模型的设计具有挑战性。本文的目标是通过使用系统架构开发过程来减少设计架构所需的人力,该过程允许通过自动执行某些模型构建,替代生成和评估来探索大型设计空间。提出的框架是通用的,针对所有可以由具有某些数字属性的逻辑模型表示的深度学习架构。与卷积神经网络(CNN)在CIFAR-10数据集上获得的测试结果一起,介绍了该方法的实现。我们表明,通过我们的方法生成的体系结构仅使用1.2M参数即可达到5.23%的错误率,这表明了设计高性能体系结构的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号