Efficient Compiler Code Generation for Deep Learning Snowflake Co-Processor

机译：深度学习雪花协处理器的高效编译器代码生成

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep Neural Networks (DNNs) are widely used in various applications including image classification, semantic segmentation and natural language processing. Various DNN models were developed to achieve high accuracy on different tasks. Efficiently mapping the workflow of those models onto custom accelerators requires a programmable hardware and a custom compiler. In this work, we use Snowflake, which is a programmable DNN targeted accelerator. We also present a compiler that correctly generated code for Snowflake. Our system were evaluated on various convolution layers present in AlexNet, ResNet and LightCNN. Snowflake with 256 processing units was implemented on Xilinx FPGA, and it achieved 70 frames/s for AlexNet without linear layers.

机译：深度神经网络（DNN）被广泛用于各种应用程序，包括图像分类，语义分割和自然语言处理。开发了各种DNN模型以在不同任务上实现高精度。将这些模型的工作流程有效地映射到定制加速器上需要可编程硬件和定制编译器。在这项工作中，我们使用Snowflake，这是一个针对DNN的可编程加速器。我们还提供了可以正确生成Snowflake代码的编译器。我们的系统在AlexNet，ResNet和LightCNN中存在的各种卷积层上进行了评估。在Xilinx FPGA上实现了具有256个处理单元的Snowflake，它在没有线性层的AlexNet上达到了70帧/秒的速度。

著录项

来源
《Workshop on Energy Efficient Machine Learning and Cognitive Computing for Embedded Applications》|2018年|24-28|共5页
会议地点 Williamsburg(US)
作者
Andre Xian Ming Chang; Aliasger Zaidy; Eugenio Culurciello;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Kernel; Copper; Three-dimensional displays; Task analysis; Field programmable gate arrays;

机译：核心;铜;三维显示器；任务分析；现场可编程门阵列;

相似文献

外文文献
中文文献
专利

1. Practical compiler techniques on efficient multithreaded code generation for OpenMP programs [J] . Tian XM, Girkar M, Bik A, The Computer journal . 2005,第5期

机译：适用于OpenMP程序的高效多线程代码生成的实用编译器技术
2. Deep Learning for Source Code Modeling and Generation: Models, Applications, and Challenges [J] . Le Triet H. M., Chen Hao, Babar Muhammad Ali ACM Computing Surveys . 2021,第3期

机译：深入学习源代码建模和生成：模型，应用和挑战
3. C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA [J] . Sitong An, Lorenzo Moneta EPJ Web of Conferences . 2021,第a期

机译：C ++代码生成，用于root / TMVA中深度学习模型的快速推断
4. Efficient Compiler Code Generation for Deep Learning Snowflake Co-Processor [C] . Andre Xian Ming Chang, Aliasger Zaidy, Eugenio Culurciello Workshop on Energy Efficient Machine Learning and Cognitive Computing for Embedded Applications . 2018

机译：深度学习雪花共同处理器的高效编译代码生成
5. Emerging Opportunities in Machine Learning Hardware Acceleration: From Advanced Neural Networks Implementation to Ultra-efficient Deep Learning Framework Using Next Generation Technology [D] . ?Cai, Ruizhe 2020

机译：机器学习硬件加速的新兴机会：从先进的神经网络实现，使用下一代技术实现超高效的深度学习框架
6. An Autoencoder-Based Deep Learning Classifier for Efficient Diagnosis of Autism [O] . Harshini Sewani, Rasha Kashef 2020

机译：基于AutoEncoder的深度学习分类器可高效诊断自闭症
7. Identifying Compiler and Optimization Options from Binary Code using Deep Learning Approaches [O] . Davide Pizzolotto, Katsuro Inoue 2020

机译：使用深度学习方法识别二进制代码的编译器和优化选项

Efficient Compiler Code Generation for Deep Learning Snowflake Co-Processor

摘要

著录项

相似文献

相关主题

期刊订阅