首页> 外文会议>IEEE International Conference on Consumer Electronics- Taiwan >High Throughput Hardware Implementation for Deep Learning AI Accelerator

【24h】

High Throughput Hardware Implementation for Deep Learning AI Accelerator

机译：深度学习AI加速器的高吞吐量硬件实现

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, a high-throughput hardware accelerator for deep learning neural networks is proposed. Since deep learning operations require high data access from DRAM, we design a high data reuse architecture to reduce the data access directly from external DRAM and provide a pipeline scheme to achieve high throughput requirements. The architecture proposed in this paper uses INT8 precision computing, 128-bit AXI bus protocol, and parallel processing with 16 sets of processing units, and achieved real-time operation at 125 MHz operating frequency and 8GOPS throughput.

机译：本文提出了一种用于深度学习神经网络的高吞吐量硬件加速器。由于深度学习操作需要从DRAM进行高数据访问，因此我们设计了一种高数据重用架构，以减少直接从外部DRAM进行数据访问，并提供流水线方案来实现高吞吐量要求。本文提出的架构使用INT8精确计算，128位AXI总线协议以及16组处理单元的并行处理，并以125 MHz的工作频率和8GOPS的吞吐量实现了实时操作。

著录项

来源
《IEEE International Conference on Consumer Electronics- Taiwan 》|2019年|1-2|共2页
会议地点 YILAN(CN)
作者
Chung-Bin Wu; Yu-Cheng Hsueh; Ching-Shun Wang; Yen-Chi Lai;
展开▼
作者单位

National Chung-Hsing University Taichung Taiwan R.O.C.;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
DRAM chips; learning (artificial intelligence); neural nets; pipeline processing; protocols;

机译：DRAM芯片；学习（人工智能）；神经网络管道处理；协议;

相似文献

外文文献
中文文献
专利

1. Hardware Implementation of Deep Network Accelerators Towards Healthcare and Biomedical Applications [J] . Mostafa Rahimi Azghadi, Corey Lammie, Jason K. Eshraghian, IEEE transactions on biomedical circuits and systems . 2020 ,第6期

机译：深网络加速器的硬件实现对医疗保健和生物医学应用
2. Toward Functional Safety of Systolic Array-Based Deep Learning Hardware Accelerators [J] . Kundu Shamik, Banerjee Suvadeep, Raha Arnab, IEEE transactions on very large scale integration (VLSI) systems . 2021 ,第3期

机译：朝着基于收缩阵列的深度学习硬件加速器的功能安全
3. BAIDU DEBUTS FIRST AI ACCELERATOR: Kunlun K200 Is Co-designed for Deep Learning [J] . Aakash Jani Microprocessor report . 2020 ,第9期

机译：百度首次亮相第一个AI加速器：KUNLUN K200共同设计为深度学习
4. High Throughput Hardware Implementation for Deep Learning AI Accelerator [C] . Chung-Bin Wu, Yu-Cheng Hsueh, Ching-Shun Wang, IEEE International Conference on Consumer Electronics - Taiwan . 2019

机译：深度学习AI加速器的高吞吐量硬件实现
5. Co-designing Model Compression Algorithms and Hardware Accelerators for Efficient Deep Learning [D] . Zhao, Ritchie. 2020

机译：共同设计模型压缩算法和高效深度学习的硬件加速器
6. Boosting Throughput and Efficiency of Hardware Spiking Neural Accelerators Using Time Compression Supporting Multiple Spike Codes [O] . Changqing Xu, Wenrui Zhang, Yu Liu, 2020

机译：使用时间压缩支撑多穗码的硬件尖峰神经加速器的吞吐量和效率
7. Deep-Learning Inferencing with High-Performance Hardware Accelerators [O] . Luke Kljucaric, Alan D. George 2019

机译：具有高性能硬件加速器的深度学习推理

High Throughput Hardware Implementation for Deep Learning AI Accelerator

摘要

著录项

相似文献

相关主题

期刊订阅