SMAT: An Input Adaptive Auto-Tuner for Sparse Matrix-Vector Multiplication

机译：SMAT：用于稀疏矩阵矢量乘法的输入自适应自动调谐器

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Sparse Matrix Vector multiplication (SpMV) is an important kernel in both traditional high performance computing and emerging data-intensive applications. By far. SpMV libraries are optimized by either application-specific or architecture-specific approaches, making the libraries become too complicated to be used extensively in real applications. In this work we develop a Sparse Matrix-vector multiplication Auto-Tuning system (SMAT) to bridge the gap between specific optimizations and general-purpose usage. S-MAT provides users with a unified programming interface in compressed sparse row (CSR) format and automatically determines the optimal format and implementation for any input sparse matrix at runtime. For this purpose, SMAT leverages a learning model, which is generated in an off-line stage by a machine learning method with a training set of more than 2000 matrices from the UF sparse matrix collection, to quickly predict the best combination of the matrix feature parameters. Our experiments show that SMAT achieves impressive performance of up to 51GFLOPS in single-precision and 37GFLOPS in double-precision on mainstream x86 multi-core processors, which are both more than 3 times faster than the Intel MKL library. We also demonstrate its adaptability in an algebraic multi-grid solver from Hypre library with above 20% performance improvement reported.

机译：稀疏矩阵矢量乘法（SpMV）是传统高性能计算和新兴数据密集型应用程序中的重要内核。到目前为止。 SpMV库通过特定于应用程序或特定于体系结构的方法进行了优化，从而使该库变得过于复杂而无法在实际应用中广泛使用。在这项工作中，我们开发了稀疏矩阵向量乘法自动调整系统（SMAT），以弥合特定优化和通用用途之间的差距。 S-MAT为用户提供压缩稀疏行（CSR）格式的统一编程接口，并在运行时自动确定任何输入稀疏矩阵的最佳格式和实现。为此，SMAT利用了一种学习模型，该模型是通过机器学习方法在离线阶段生成的，该模型具有来自UF稀疏矩阵集合的2000多个矩阵的训练集，可以快速预测矩阵特征的最佳组合参数。我们的实验表明，在主流x86多核处理器上，SMAT在单精度上实现了高达51GFLOPS的出色性能，在双精度上实现了37GFLOPS的出色性能，两者均比Intel MKL库快3倍以上。我们还证明了它在Hypre库的代数多网格求解器中的适应性，据报道性能提高了20％以上。

著录项

来源
《Proceedings of the 2013 ACM SIGPLAN conference on programming language design and implementation》|2013年|117-126|共10页
会议地点 Seattle WA(US)
作者
Jiajia Li; Guangming Tan; Mingyu Chen; Ninghui Sun;
展开▼
作者单位

State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences,University of Chinese Academy of Sciences;

State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences;

State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences;

State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
sparse matrix-vector multiplication; SpMV; auto-tuning; data mining; algebraic multi-grid;

机译：稀疏矩阵-向量乘法； SpMV;自动调节;数据挖掘;代数多重网格;

相似文献

外文文献
中文文献
专利

1. SMAT: An Input Adaptive Auto-Tuner for Sparse Matrix-Vector Multiplication [J] . Jiajia Li, Guangming Tan, Mingyu Chen, ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2013,第6期

机译：SMAT：用于稀疏矩阵矢量乘法的输入自适应自动调谐器
2. GPU accelerated sparse matrix-vector multiplication and sparse matrix-transpose vector multiplication [J] . Yuan Tao, Yangdong Deng, Shuai Mu, Concurrency and computation: practice and experience . 2015,第14期

机译：GPU加速的稀疏矩阵-向量乘法和稀疏矩阵-转置向量乘法
3. Adaptive sparse matrix representation for efficient matrix-vector multiplication [J] . Zardoshti Pantea, Khunjush Farshad, Sarbazi-Azad Hamid Journal of supercomputing . 2016,第9期

机译：自适应稀疏矩阵表示，用于有效的矩阵矢量乘法
4. SMAT: An Input Adaptive Auto-Tuner for Sparse Matrix-Vector Multiplication [C] . Jiajia Li, Guangming Tan, Mingyu Chen, ACM SIGPLAN Conference on Programming Language Design and Implementation . 2013

机译：SMAT：用于稀疏矩阵矢量乘法的输入自适应自动调谐器
5. Analysis of High Performance Sparse Matrix-Vector Multiplication for Small Finite Fields [D] . Lambert, Matthew A. 2020

机译：小型有限字段高性能稀疏矩阵矢量乘法分析
6. HIERARCHICAL ORTHOGONAL MATRIX GENERATION AND MATRIX-VECTOR MULTIPLICATIONS IN RIGID BODY SIMULATIONS [O] . FUHUI FANG, JINGFANG HUANG, GARY HUBER, -1

机译：刚体模拟中的正交正交矩阵生成和矩阵向量乘法
7. SMAT: An Input Adaptive Sparse Matrix-Vector Multiplication Auto-Tuner [O] . Li, Jiajia, Zhang, Xiuxia, Tan, Guangming, 2012

机译：smaT：输入自适应稀疏矩阵 - 向量乘法自动调谐器

SMAT: An Input Adaptive Auto-Tuner for Sparse Matrix-Vector Multiplication

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅