首页> 外文会议>Workshop on Energy Efficient Machine Learning and Cognitive Computing for Embedded Applications >Efficient Compiler Code Generation for Deep Learning Snowflake Co-Processor
【24h】

Efficient Compiler Code Generation for Deep Learning Snowflake Co-Processor

机译:深度学习雪花协处理器的高效编译器代码生成

获取原文

摘要

Deep Neural Networks (DNNs) are widely used in various applications including image classification, semantic segmentation and natural language processing. Various DNN models were developed to achieve high accuracy on different tasks. Efficiently mapping the workflow of those models onto custom accelerators requires a programmable hardware and a custom compiler. In this work, we use Snowflake, which is a programmable DNN targeted accelerator. We also present a compiler that correctly generated code for Snowflake. Our system were evaluated on various convolution layers present in AlexNet, ResNet and LightCNN. Snowflake with 256 processing units was implemented on Xilinx FPGA, and it achieved 70 frames/s for AlexNet without linear layers.
机译:深度神经网络(DNN)被广泛用于各种应用程序,包括图像分类,语义分割和自然语言处理。开发了各种DNN模型以在不同任务上实现高精度。将这些模型的工作流程有效地映射到定制加速器上需要可编程硬件和定制编译器。在这项工作中,我们使用Snowflake,这是一个针对DNN的可编程加速器。我们还提供了可以正确生成Snowflake代码的编译器。我们的系统在AlexNet,ResNet和LightCNN中存在的各种卷积层上进行了评估。在Xilinx FPGA上实现了具有256个处理单元的Snowflake,它在没有线性层的AlexNet上达到了70帧/秒的速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号