首页> 外国专利> SYSTOLIC-CNN: AN OPENCL-DEFINED SCALABLE RUNTIME-FLEXIBLE PROGRAMMABLE ACCELERATOR ARCHITECTURE FOR ACCELERATING CONVOLUTIONAL NEURAL NETWORK INFERENCE IN CLOUD/EDGE COMPUTING

SYSTOLIC-CNN: AN OPENCL-DEFINED SCALABLE RUNTIME-FLEXIBLE PROGRAMMABLE ACCELERATOR ARCHITECTURE FOR ACCELERATING CONVOLUTIONAL NEURAL NETWORK INFERENCE IN CLOUD/EDGE COMPUTING

机译：Systolic-CNN：一个用于在云/边缘计算中加速卷积神经网络推断的OpenCL定义可扩展的运行时灵活的可编程加速器架构

页面导航

摘要
著录项
相似文献

摘要

An OpenCL-defined scalable runtime-flexible programmable accelerator architecture for accelerating convolutional neural network (CNN) inference in cloud/edge computing is provided, referred to herein as Systolic-CNN. Existing OpenCL-defined programmable accelerators (e.g., field-programmable gate array (FPGA)-based accelerators) for CNN inference are insufficient due to limited flexibility for supporting multiple CNN models at runtime and poor scalability resulting in underutilized accelerator resources and limited computational parallelism. Systolic-CNN adopts a highly pipelined and paralleled one-dimensional (1-D) systolic array architecture, which efficiently explores both spatial and temporal parallelism for accelerating CNN inference on programmable accelerators (e.g., FPGAs). Systolic-CNN is highly scalable and parameterized, and can be easily adapted by users to efficiently utilize the coarse-grained computation resources for a given programmable accelerator. In addition, Systolic-CNN is runtime-flexible and can be time-shared to accelerate a variety of CNN models at runtime without the need to recompile the programmable accelerator kernel hardware or reprogram the programmable accelerator.

机译：提供用于加速云/边缘计算的卷积神经网络（CNN）推断的OpenCL定义的可扩展运行时间柔性可编程加速器架构，在此称为Systolic-CNN。对于CNN推断，现有的OpenCL定义的可编程加速器（例如，基于现场可编程门阵列（FPGA）的加速器）由于用于在运行时支撑多个CNN模型的灵活性并且可扩展性差，导致未充分利用的加速度资源和有限的计算并行性。 Systolic-CNN采用高度流水线和平行的一维（1-D）收缩系统阵列架构，其有效地探索用于加速对可编程加速器（例如，FPGA）的CNN推断的空间和时间并行性。 Systolic-CNN是高度可扩展和参数化的，并且可以通过用户容易地调整，以有效地利用给定可编程加速器的粗粒计算资源。此外，Systolic-CNN是运行时灵活的，可以时间共享，以在运行时加速各种CNN模型，而无需重新编译可编程加速器内核硬件或重新编程可编程加速器。

著录项

公开/公告号US2021334636A1

专利类型
公开/公告日2021-10-28

原文格式PDF
申请/专利权人 AKSHAY DUA;FENGBO REN;
展开▼

申请/专利号US202117243136
发明设计人 AKSHAY DUA;FENGBO REN;
展开▼

申请日2021-04-28
分类号G06N3/063;G06N3/04;G06N3/08;
国家 US
入库时间 2022-08-24 21:57:20

相似文献

专利
外文文献
中文文献