首页> 外文会议>2019 Spring Simulation Conference >Systolic Sparse Matrix Vector Multiply in the Age of TPUs and Accelerators

【24h】

Systolic Sparse Matrix Vector Multiply in the Age of TPUs and Accelerators

机译：TPU和加速器时代的收缩期稀疏矩阵向量乘

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Tensor Processing Units has brought back systolic arrays as a computational alternative to high performance computing. Recently Google presented a Tensor Processing Unit for handling matrix multiplication using systolic arrays. This unit is designed for dense matrices only. As they stated, sparse architectural support was omitted momentarily but they will focus on sparsity in future designs. We propose a systolic array to compute the Sparse Matrix Vector product in T2(n) ≈ [nnz/2] + 2n + 2 using 2n + 2 processing elements. The systolic array we propose also use accumulators to collect the partial results of the resulting vector and supports adapting tiling.

机译：Tensor Processing Units带回了脉动阵列作为高性能计算的一种计算替代方法。最近，Google提供了一个Tensor处理单元，用于使用脉动阵列处理矩阵乘法。此单元仅适用于密集矩阵。正如他们所说，稀疏的架构支持暂时被省略了，但是他们将专注于未来设计中的稀疏性。我们提出一个脉动阵列，使用2n + 2个处理元素来计算T2（n）≈[nnz / 2] + 2n + 2中的稀疏矩阵矢量积。我们建议的脉动阵列也使用累加器来收集所得向量的部分结果，并支持自适应平铺。

著录项

来源
《2019 Spring Simulation Conference》|2019年|1-10|共10页
会议地点 Tucson(US)
作者
Euripides Montagne; Rina Surós;
展开▼
作者单位

Department of Computer Science, University of Central Florida 4000 Central Florida Boulevard, Orlando, FL, USA;

Faculty of Science, University Central de Venezuela Av. Paseo los Illustres, Caracas, DC, VENEZUELA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Sparse matrices; Arrays; Adders; Artificial neural networks; Indexes;

机译：稀疏矩阵;数组;加法器;人工神经网络;索引;;

相似文献

外文文献
中文文献
专利

1. A 7.3 M Output Non-Zeros/J, 11.7 M Output Non-Zeros/GB Reconfigurable Sparse Matrix–Matrix Multiplication Accelerator [J] . Park Dong-Hyeon, Pal Subhankar, Peng Siying, IEEE Journal of Solid-State Circuits . 2020,第4期

机译：A 7.3 M输出非零/ j，11.7 m输出非零/ GB可重新配置稀疏矩阵矩阵乘法加速器
2. Comments on “Low-Latency Digit-Serial Systolic Double Basis Multiplier over $GF(2^{m})$ Using Subquadratic Toeplitz Matrix-Vector Product Approach” [J] . Reyhani-Masoleh Arash Computers, IEEE Transactions on . 2015,第4期

机译：关于“ $ GF（2 ^ {m}）$ 使用二次Toeplitz矩阵-矢量积方法”
3. Low-Latency Digit-Serial Systolic Double Basis Multiplier over $mbi GF{(2^m})$ Using Subquadratic Toeplitz Matrix-Vector Product Approach [J] . Pan J.-S., Azarderakhsh R., Mozaffari Kermani M., IEEE Transactions on Computers . 2014,第5期

机译：使用二次Toeplitz矩阵-矢量积方法在$ mbi GF {（2 ^ m}）$上实现低延迟数位串行收缩双倍乘数
4. SYSTOLIC SPARSE MATRIX VECTOR MULTIPLY IN THE AGE OF TPUS AND ACCELERATORS [C] . Euripides Montagne, Rina Suros Simulation Multi-Conference . 2019

机译：TPU和加速器时代的收缩系稀疏矩阵矢量乘以
5. Fast space-varying convolution in stray light reduction, fast matrix vector multiplication using the sparse matrix transform, and activation detection in fMRI data analysis. [D] . Wei, Jianing. 2010

机译：快速减少杂散光的空间变化卷积，使用稀疏矩阵变换的快速矩阵向量乘法以及fMRI数据分析中的激活检测。
6. Output trends characteristics and measurements of three megavoltage radiotherapy linear accelerators [O] . Murshed Hossain 2014

机译：三种兆伏放射疗法直线加速器的输出趋势特性和测量
7. Systolic Sparse Matrix Vector Multiply in the Age of TPUs and Accelerators [O] . 2019

机译：TPU和加速器时代的收缩系稀疏矩阵矢量乘以

Systolic Sparse Matrix Vector Multiply in the Age of TPUs and Accelerators

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅