MAHASIM: Machine-Learning Hardware Acceleration Using a Software-Defined Intelligent Memory System

Asgari Bahar; Mukhopadhyay Saibal; Yalamanchili Sudhakar

首页> 外文期刊>Journal of signal processing systems for signal, image, and video technology >MAHASIM: Machine-Learning Hardware Acceleration Using a Software-Defined Intelligent Memory System

【24h】

MAHASIM: Machine-Learning Hardware Acceleration Using a Software-Defined Intelligent Memory System

机译：Mahasim：使用软件定义的智能内存系统的机器学习硬件加速

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

As computations in machine-learning applications are increasing simultaneously along the size of datasets, the energy and performance costs of data movement dominate that of compute. This issue is more pronounced in embedded systems with limited resources and energy. Although near-data-processing (NDP) is pursued as an architectural solution, comparatively less attention has been focused on how to scale NDP for larger-scale embedded machine learning applications (e.g., speech and motion processing). We propose machine-learning hardware acceleration using a software-defined intelligent memory system (Mahasim). Mahasim is a scalable NDP-based memory system, in which application performance scales with the size of data. The building blocks of Mahasim are the programable memory slices, supported by data partitioning, compute-aware memory allocation, and an independent in-memory execution model. For recurrent neural networks, Mahasim shows up to 537.95 GFLOPS/W energy efficiency and 3.9x speedup, when the size of the system increases from 2 to 256 memory slices, which indicates that Mahasim favors larger problems.

机译：随着机器学习应用中的计算沿着数据集的大小同时增加，数据移动的能量和性能成本占据计算的能量和性能。在具有有限资源和能量的嵌入式系统中，此问题更加明显。虽然近数据处理（NDP）被追求为架构解决方案，但相对较少的关注，专注于如何为大规模嵌入式机器学习应用（例如，语音和运动处理）缩放NDP。我们使用软件定义的智能内存系统（Mahasim）提出机器学习硬件加速。 Mahasim是一个可扩展的基于NDP的内存系统，其中应用程序性能尺寸具有数据大小。 Mahasim的构建块是可编程存储器切片，由数据分区，计算感知内存分配和独立的内存中执行模型支持。对于经常性的神经网络，Mahasim显示出高达537.95 GFLOPS / W能效和3.9倍的加速，当系统的大小从2到256内存切片增加时，这表明Mahasim有利于更大的问题。

著录项

来源
《Journal of signal processing systems for signal, image, and video technology》 |2021年第6期|659-675|共17页
作者
Asgari Bahar; Mukhopadhyay Saibal; Yalamanchili Sudhakar;
展开▼
作者单位

Georgia Inst Technol Sch Elect & Comp Engn 266 Ferst Dr NW KACB 2316 Atlanta GA 30332 USA;

Georgia Inst Technol Sch Elect & Comp Engn 266 Ferst Dr NW KACB 2316 Atlanta GA 30332 USA;

Georgia Inst Technol Sch Elect & Comp Engn 266 Ferst Dr NW KACB 2316 Atlanta GA 30332 USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Machine learning; Neural networks; Near-data-processing; Memory system;

机译：机器学习;神经网络;近数据处理;记忆系统;

相似文献

外文文献
中文文献
专利

1. Hardware Acceleration of Transactional Memory on Commodity Systems [J] . Jared Casper, Tayo Oguntebi, Sungpack Hong, Computer architecture news . 2011,第1期

机译：商品系统上交易内存的硬件加速
2. Hardware Acceleration of Transactional Memory on Commodity Systems [J] . Casper Jared, Oguntebi T, Hong S, ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2011,第3期

机译：商品系统上交易内存的硬件加速
3. Hardware Transactional Memory with Software-Defined Conflicts [J] . RUBEN TITOS-GIL, MANUEL E. ACACIO, JOSE M. GARCIA, ACM Transactions on Architecture and Code Optimization . 2011,第4期

机译：具有软件定义冲突的硬件事务存储
4. Hardware Acceleration of Transactional Memory on Commodity Systems [C] . Jared Casper, Tayo Oguntebi, Sungpack Hong, Sixteenth international conference on architectural support for programming languages and operating systems. . 2011

机译：商品系统上交易内存的硬件加速
5. CMOL/CMOS hardware architectures and performance/price for Bayesian memory - The building block of intelligent systems. [D] . Zaveri, Mazad Shaheriar. 2009

机译：贝叶斯存储器的CMOL / CMOS硬件体系结构和性能/价格-智能系统的组成部分。
6. Machine-Learning Techniques Can Enhance Dairy Cow Estrus Detection Using Location and Acceleration Data [O] . Jun Wang, Matt Bell, Xiaohang Liu, 2020

机译：机器学习技术可以使用位置和加速度数据增强乳制品牛雌性检测
7. Hardware Acceleration of Transactional Memory on Commodity Systems [O] . Jared Casper, Tayo Oguntebi, Sungpack Hong, 2012

机译：商品系统上交易内存的硬件加速

MAHASIM: Machine-Learning Hardware Acceleration Using a Software-Defined Intelligent Memory System

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅