XNOR Neural Engine: A Hardware Accelerator IP for 21.6-fJ/op Binary Neural Network Inference

Francesco Conti; Pasquale Davide Schiavone; Luca Benini

首页> 外文期刊>IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems >XNOR Neural Engine: A Hardware Accelerator IP for 21.6-fJ/op Binary Neural Network Inference

【24h】

XNOR Neural Engine: A Hardware Accelerator IP for 21.6-fJ/op Binary Neural Network Inference

机译：XNOR神经引擎：用于21.6-fJ / op二进制神经网络推理的硬件加速器IP

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Binary neural networks (BNNs) are promising to deliver accuracy comparable to conventional deep neural networks at a fraction of the cost in terms of memory and energy. In this paper, we introduce the XNOR neural engine (XNE), a fully digital configurable hardware accelerator IP for BNNs, integrated within a microcontroller unit (MCU) equipped with an autonomous I/O subsystem and hybrid SRAM/standard cell memory. The XNE is able to fully compute convolutional and dense layers in autonomy or in cooperation with the core in the MCU to realize more complex behaviors. We show post-synthesis results in 65- and 22-nm technology for the XNE IP and post-layout results in 22 nm for the full MCU indicating that this system can drop the energy cost per binary operation to 21.6 fJ per operation at 0.4 V, and at the same time is flexible and performant enough to execute state-of-the-art BNN topologies such as ResNet-34 in less than 2.2 mJ per frame at 8.9 frames/s.

机译：二进制神经网络（BNN）有望以与内存和能源相比仅几分之一的成本提供与传统深度神经网络相当的准确性。在本文中，我们介绍了XNOR神经引擎（XNE），这是一种用于BNN的全数字可配置硬件加速器IP，集成在配有自治I / O子系统和混合SRAM /标准单元存储器的微控制器单元（MCU）中。 XNE能够以自治方式或与MCU的内核协作来完全计算卷积层和密集层，以实现更复杂的行为。我们显示了针对XNE IP的65和22 nm技术的合成后结果，以及针对完整MCU的22 nm的布局后结果，表明该系统可以将0.4 V时每二进制操作的能量成本降低至每操作21.6 fJ ，同时又具有足够的灵活性和性能，足以在8.9帧/秒的速度下以每帧小于2.2 mJ的速度执行最新的BNN拓扑，例如ResNet-34。

著录项

来源
《IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems》 |2018年第11期|2940-2951|共12页
作者
Francesco Conti; Pasquale Davide Schiavone; Luca Benini;
展开▼
作者单位

Integrated Systems Laboratory, D-ITET, ETH Zürich, Zürich, Switzerland;

Integrated Systems Laboratory, D-ITET, ETH Zürich, Zürich, Switzerland;

Integrated Systems Laboratory, D-ITET, ETH Zürich, Zürich, Switzerland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Neural networks; Hardware; Convolutional codes; Engines; IP networks; Microcontrollers; Machine learning;

机译：神经网络;硬件;卷积码;引擎;IP网络;微控制器;机器学习;

相似文献

外文文献
中文文献
专利

1. Design of an energy-efficient XNOR gate based on MTJ-based nonvolatile logic-in-memory architecture for binary neural network hardware [J] . Natsui Masanori, Chiba Tomoki, Hanyu Takahiro Japanese journal of applied physics . 2019,第SB期

机译：基于基于MTJ的非易失性内存在逻辑的二进制神经网络硬件的节能XNOR门设计
2. A Resource-Efficient Inference Accelerator for Binary Convolutional Neural Networks [J] . Kim Tae-Hwan, Shin Jihoon Circuits and Systems II: Express Briefs, IEEE Transactions on . 2021,第1期

机译：二元卷积神经网络的资源有效推理加速器
3. PXNOR-BNN: In/With Spin-Orbit Torque MRAM Preset-XNOR Operation-Based Binary Neural Networks [J] . Chang Liang, Ma Xin, Wang Zhaohao, IEEE transactions on very large scale integration (VLSI) systems . 2019,第11期

机译：PXNOR-BNN：带有自旋轨道转矩MRAM预设/基于XNOR运算的二进制神经网络
4. XNORBIN: A 95 TOp/s/W hardware accelerator for binary convolutional neural networks [C] . Andrawes Al Bahou, Geethan Karunaratne, Renzo Andri, 2018 IEEE Symposium in Low-Power and High-Speed Chips . 2018

机译：XNORBIN：用于二进制卷积神经网络的95 TOp / s / W硬件加速器
5. Programmable Manycore Accelerator for Machine Learning, Convolution Neural Network and Binary Neural Network [D] . Kulkarni, Adwaya Amey. 2017

机译：面向机器学习，卷积神经网络和二进制神经网络的可编程Manycore加速器
6. Boosting Throughput and Efficiency of Hardware Spiking Neural Accelerators Using Time Compression Supporting Multiple Spike Codes [O] . Changqing Xu, Wenrui Zhang, Yu Liu, 2020

机译：使用时间压缩支撑多穗码的硬件尖峰神经加速器的吞吐量和效率
7. XNOR Neural Engine: A Hardware Accelerator IP for 21.6-fJ/op Binary Neural Network Inference [O] . Francesco Conti, Pasquale Davide Schiavone, Luca Benini 2018

机译：XNOR神经发动机：21.6-FJ / OP二元神经网络推断的硬件加速器IP
8. Equipment for Neural Network Determination of Optical Phase Corrections UsingParallel Optical Hardware and Control Using Micromachined Microjets [R] . Allen, M., Glezer, A., Jokerst, N. M. 1997

机译：用并行光学硬件确定光学相位校正的设备和使用微机械微型喷射器控制

XNOR Neural Engine: A Hardware Accelerator IP for 21.6-fJ/op Binary Neural Network Inference

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅