ChewBaccaNN: A Flexible 223 TOPS/W BNN Accelerator

机译：Chewbaccann：灵活的223顶部/ W BNN加速器

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Binary Neural Networks enable smart IoT devices, as they significantly reduce the required memory footprint and computational complexity while retaining a high network performance and flexibility. This paper presents ChewBaccaNN, a 0.7mm^{2 sized binary convolutional neural network (CNN) accelerator designed in GlobalFoundries 22nm technology. By exploiting efficient data re-use, data buffering, latch-based memories, and voltage scaling, a throughput of 241 GOPS is achieved while consuming just 1.1mW at 0.4V/154MHz during inference of binary CNNs with up to 7×7 kernels, leading to a peak core energy efficiency of 223TOPS/W. ChewBaccaNN’s flexibility allows to run a much wider range of binary CNNs than other accelerators, drastically improving the accuracy-energy trade-off beyond what can be captured by the TOPS/W metric. In fact, it can perform CIFAR-10 inference at 86.8% accuracy with merely 1.3µJ, thus exceeding the accuracy while at the same time lowering the energy cost by 2.8× compared to even the most efficient and much larger analog processing-in-memory devices, while keeping the flexibility of running larger CNNs for higher accuracy when needed. It also runs a binary ResNet-18 trained on the 1000-class ILSVRC dataset and improves the energy efficiency by 4.4× over accelerators of similar flexibility. Furthermore, it can perform inference on a binarized ResNet-18 trained with 8-bases Group-Net to achieve a 67.5% Top-1 accuracy with only 3.0mJ/frame—at an accuracy drop of merely 1.8% from the full-precision ResNet-18.}

机译：二元神经网络使智能物联网设备能够显着降低所需的内存占用和计算复杂性，同时保留高网络性能和灵活性。本文介绍了Chewbaccann，0.7mm^{2 尺寸的二元卷积神经网络（CNN）加速器设计在GlobalFoundries 22nm技术中。通过利用高效的数据重复使用，数据缓冲，基于闩锁的存储和电压缩放，在二进制CNN的推理期间在0.4V / 154MHz中消耗的241个GOP的吞吐量，在二进制CNN的推理中，高达7×7内核，导致峰值核心能效223tops / w。 Chewbaccann的灵活性允许比其他加速器运行更广泛的二进制CNN，大大提高了超出顶部/ W度量捕获的精度 - 能量折衷。事实上，它可以以86.8％的准确性执行CiFar-10推理，仅限为1.3μJ，从而超出精度，同时将能量成本降低2.8×相比，即使是最有效和更大的模拟处理内存设备，同时在需要时保持更高的CNN的灵活性以更高的精度运行。它还在1000级ILSVRC数据集上运行二进制Resnet-18，并通过相似灵活性的加速器提高了4.4倍的能量效率。此外，它可以对具有8个基团组培训的二值化Reset-18上的推断，以实现67.5％的前-1顶级精度，只有3.0MJ /帧，仅为Precision Reset的精度下降1.8％ -18。}

著录项

来源
《IEEE International Symposium on Circuits and Systems》|2021年|1-5|共5页
会议地点
作者
Renzo Andri; Geethan Karunaratne; Lukas Cavigelli; Luca Benini;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Performance evaluation; Voltage measurement; Memory management; Throughput; Energy efficiency; System-on-chip; Kernel;

机译：性能评估;电压测量;存储器管理;吞吐量;能效;片上系统;内核;

相似文献

外文文献
中文文献
专利

1. miR-223 and miR-142 attenuate hematopoietic cell proliferation,and miR-223 positively regulates miR-142 through LMO2 isoforms and CEBP-β [J] . Wei Sun, Wenwen Shen, Shuang Yang, 细胞研究（英文版） . 2010,第010期
2. miR-223 and miR-142 attenuate hematopoietic cell proliferation, and miR-223 positively regulates miR-142 through LMO2 isoforms and CEBP-β [J] . Wei Sun, Wenwen Shen, Shuang Yang, 细胞研究：英文版 . 2010,第010期
3. FCA-BNN: Flexible and Configurable Accelerator for Binarized Neural Networks on FPGA [J] . Jiabao GAO, Yuchen YAO, Zhengjie LI, IEICE transactions on information and systems . 2021,第8期

机译：FCA-BNN：用于FPGA上的二值化神经网络的灵活和可配置的加速器
4. Facile fabrication of flexible layered GO/BNNS composite films with high thermal conductivity [J] . Li Pengchong, Shen Heng, Qian Zhenchao, Journal of Materials Science . 2018,第6期

机译：具有高导热率的柔性分层GO / BNNS复合膜的舒适性
5. Transparent and flexible piezoelectric sensor for detecting human movement with a boron nitride nanosheet (BNNS) [J] . Kim Kyung-Bum, Jang Wooree, Cho Jae Yong, Nano Energy . 2018,第期

机译：用氮化硼纳米片（BNN）检测人体运动的透明和柔性压电传感器
6. A large scale flexible real-time communications topology for the LHC accelerator [C] . Lauckner, R., Rausch, . 1999

机译：LHC加速器的大规模灵活实时通信拓扑
7. Dual strands of the miR-223 duplex (miR-223-5p and miR-223-3p) inhibit cancer cell aggressiveness : targeted genes are involved in bladder cancer pathogenesis [D] . SUGAWARA, Sho 2019

机译：miR-223双链体的双链（miR-223-5p和miR-223-3p）抑制癌细胞侵袭性：靶向基因参与膀胱癌的发病机理
8. Feasibility studies towards future self-sufficient supply of the 99Mo-99mTc isotopes with Japanese accelerators [O] . Kozi NAKAI, Naruto TAKAHASHI, Jun HATAZAWA, 2014

机译：未来用日本加速器自足供应99Mo-99mTc同位素的可行性研究
9. ChewBaccaNN: A Flexible 223 TOPS/W BNN Accelerator [O] . Renzo Andri, Geethan Karunaratne, Lukas Cavigelli, 2021

机译：Chewbaccann：灵活的223顶部/ W BNN加速器

ChewBaccaNN: A Flexible 223 TOPS/W BNN Accelerator

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅