PULP-NN: accelerating quantized neural networks on parallel ultra-low-power RISC-V processors

Angelo Garofalo; Manuele Rusci; Francesco Conti; Davide Rossi; Luca Benini

首页> 外文期刊>Philosophical transactions of the Royal Society. Mathematical, physical, and engineering sciences >PULP-NN: accelerating quantized neural networks on parallel ultra-low-power RISC-V processors

【24h】

PULP-NN: accelerating quantized neural networks on parallel ultra-low-power RISC-V processors

机译：PULP-NN：在平行的超低功耗RISC-V处理器上加速量化的神经网络

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present PULP-NN, an optimized computing library for a parallel ultra-low-power tightly coupled cluster of RISC-V processors. The key innovation in PULP-NN is a set of kernels for quantized neural network inference, targeting byte and subbyte data types, down to INT-1, tuned for the recent trend toward aggressive quantization in deep neural network inference. The proposed library exploits both the digital signal processing extensions available in the PULP RISC-V processors and the cluster's parallelism, achieving up to 15.5 MACs/cycle on INT-8 and improving performance by up to 63× with respect to a sequential implementation on a single RISC-V core implementing the baseline RV32IMC ISA. Using PULP-NN, a CIFAR-10 network on an octa-core cluster runs in 30× and 19.6× less clock cycles than the current state-of-the-art ARM CMSISNN library, running on STM32L4 and STM32H7 MCUs, respectively. The proposed library, when running on a GAP-8 processor, outperforms by 36.8× and by 7.45× the execution on energy efficient MCUs such as STM32L4 and high-end MCUs such as STM32H7 respectively, when operating at the maximum frequency. The energy efficiency on GAP-8 is 14.1× higher than STM32L4 and 39.5× higher than STM32H7, at the maximum efficiency operating point. This article is part of the theme issue 'Harmonizing energy-autonomous computing and intelligence'.

机译：我们呈现PURP-NN，一个用于平行的超低功耗紧密耦合的RISC-V处理器集群的优化计算库。 PURP-NN的关键创新是一组用于量化神经网络推理的核，瞄准字节和子系统数据类型，降至INT-1，为最近的深度神经网络推理中的攻击量化趋势而调整。该拟议的库利用纸浆RISC-V处理器和群集的并行性中提供的数字信号处理扩展，在INT-8上实现高达15.5 Mac /循环，并在A的顺序实现上提高了最多63倍的性能单个RISC-V核心实施基线RV32IMC ISA。使用PURP-NN，Octa-Core集中的CiFar-10网络以30×和19.6倍的时钟周期运行，而不是当前最先进的ARM CMSISNN库，分别在STM32L4和STM32H7 MCU上运行。所提出的库，在GAP-8处理器上运行时，在最大频率下运行时，在节点8处理器上运行36.8倍，并且在节能MCU上执行7.45倍，例如STM32L4和高端MCU，如STM32H7。在最大效率操作点处，GAP-8上的能量效率比STM32L4高于STM32L4和39.5倍，高于STM32H7。本文是主题问题“协调能源自主计算和智能”的一部分。

著录项

来源
《Philosophical transactions of the Royal Society. Mathematical, physical, and engineering sciences》 |2020年第2164期|共21页
作者
Angelo Garofalo; Manuele Rusci; Francesco Conti; Davide Rossi; Luca Benini;
展开▼
作者单位

Department of Electrical Electronic and Information Engineering (DEI) University of Bologna Bologna Italy;

Department of Electrical Electronic and Information Engineering (DEI) University of Bologna Bologna Italy;

Department of Electrical Electronic and Information Engineering (DEI) University of Bologna Bologna Italy;

Department of Electrical Electronic and Information Engineering (DEI) University of Bologna Bologna Italy;

Department of Electrical Electronic and Information Engineering (DEI) University of Bologna Bologna Italy;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类数学;物理学;一般工业技术;
关键词
embedded systems; quantized neural networks; edge processing; low power;

机译：嵌入式系统;量化神经网络;边缘处理;低功率;

相似文献

外文文献
中文文献
专利

1. PULP-NN: accelerating quantized neural networks on parallel ultra-low-power RISC-V processors [J] . Angelo Garofalo, Manuele Rusci, Francesco Conti, Philosophical transactions of the Royal Society. Mathematical, physical, and engineering sciences . 2020,第2164期

机译：PULP-NN：在平行的超低功耗RISC-V处理器上加速量化的神经网络
2. PARALLEL PROCESSING OF CHEMICAL INFORMATION IN A LOCAL AREA NETWORK—Ⅱ. A PARALLEL CROSS-VALIDATION PROCEDURE FOR ARTIFICIAL NEURAL NETWORKS [J] . E. P. P. A DERKS, M. L. M. BECKERS, W. J. MELSSEN, Computers & Chemistry . 1996,第4期

机译：局部网络中化学信息的并行处理—Ⅱ。人工神经网络的并行交叉验证过程
3. Memristive Quantized Neural Networks: A Novel Approach to Accelerate Deep Learning On-Chip [J] . Zhang Yang, Cui Menglin, Shen Linlin, Cybernetics, IEEE Transactions on . 2021,第4期

机译：回忆量化神经网络：一种加快芯片深度学习的新方法
4. XpulpNN: Accelerating Quantized Neural Networks on RISC-V Processors Through ISA Extensions [C] . Angelo Garofalo, Giuseppe Tagliavini, Francesco Conti, Design, Automation and Test in Europe Conference and Exhibition . 2020

机译：XpulpNN：通过ISA扩展在RISC-V处理器上加速量化神经网络
5. Modern Problems in Mathematical Signal Processing: Quantized Compressed Sensing and Randomized Neural Networks [D] . Nelson, Aaron A. 2019

机译：数学信号处理中的现代问题：量化压缩感和随机神经网络
6. Corrigendum: Event- and Time-Driven Techniques Using Parallel CPU-GPU Co-processing for Spiking Neural Networks [O] . Francisco Naveros, Jesus A. Garrido, Richard R. Carrillo, 2018

机译：勘误：事件和时间驱动技术使用并行CPU-GPU协处理处理尖刺神经网络
7. PULP-NN: A Computing Library for Quantized Neural Network inference at the edge on RISC-V Based Parallel Ultra Low Power Clusters [O] . Angelo Garofalo, Manuele Rusci, Francesco Conti, 2019

机译：PURP-NN：用于基于RISC-V并行超低功耗簇的边缘的量化神经网络推断的计算库

PULP-NN: accelerating quantized neural networks on parallel ultra-low-power RISC-V processors

摘要

著录项

相似文献

相关主题

期刊订阅