Cache Blocking for Linear Algebra Algorithms

机译：线性代数算法缓存阻塞

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We briefly describe Cache Blocking for Dense Linear Algebra Algorithms on computer architectures since about 1985. Before that one had uniform memory architectures. The Cray I machine was the last holdout. We cover the where, when, what, how and why of Cache Blocking. Almost all computer manufacturers have recently (about seven years ago) dramatically changed their computer architectures to produce Multicore (MC) processors It will be seen that the arrangement in memory of the submatrices A_(ij) of A is a critical factor for obtaining high performance. From a practical point of view, this work is very important as it will allow existing codes using LAPACK and ScaLAPACK to remain usable by new versions of LAPACK and ScaLAPACK.

机译：自1985年以来，我们简要介绍了计算机架构上的密集线性代数算法的缓存阻断。在那个有统一的内存架构之前。 CRAY I机器是最后一个扑扣。我们涵盖缓存阻止的何处，何时，何时，如何以及为什么和原理。最近（大约七年前）几乎所有电脑制造商都大大改变了他们的计算机架构来生产多核（MC）处理器，可以看出，内存的内存A_（IJ）的安排是获得高性能的关键因素。从实际的角度来看，这项工作非常重要，因为它将允许使用Lapack和ScalaCack的现有代码，以便通过新版本的Lapack和ScalaCACK保持可用。

著录项

来源
《International Conference on Parallel Processing and Applied Mathematics》|2012年||共11页
会议地点
作者
Fred G. Gustavson;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词

相似文献

外文文献
中文文献
专利

1. Recursion leads to automatic variable blocking for dense linear-algebra algorithms [J] . IBM Journal of Research and Development . 1997,第6期

机译：递归导致稠密线性代数算法的自动变量阻塞
2. Extension of the Thomas Algorithm to a Class of Algebraic Linear Equation Systems Involving Quasi-Block-Tridiagonal Matrices With Isolated Block-Pentadiagonal rows, Assuming Variable Block Dimensions [J] . L. K. Bieniasz Computing . 2001,第4期

机译：假设可变块尺寸，将Thomas算法扩展到一类代数线性方程组，其中涉及具有独立块-对角线行的拟块-三对角矩阵
3. Integer linear programming model for allocation and migration of data blocks in the STT-RAM-based hybrid caches [J] . Khajekarimi Elyas, Jamshidi Kamal, Vafaei Abbas Computers & Digital Techniques, IET . 2020,第3期

机译：基于STT-RAM的混合缓存中数据块分配和迁移的整数线性编程模型
4. Cache Blocking for Linear Algebra Algorithms [C] . Fred G. Gustavson International conference on parallel processing and applied mathematics . 2012

机译：线性代数算法的缓存块
5. I -structure software caches: Exploiting global data locality in non-blocking multithreaded architectures [D] . Lin, Wen-Yen 2000

机译：I结构软件缓存：在非阻塞多线程体系结构中利用全局数据局部性
6. Intelligently deciphering unintelligible designs: algorithmic algebraic model checking in systems biology [O] . Bud Mishra 2009

机译：智能破译难以理解的设计：系统生物学中的算法代数模型检查
7. The linear algebra of block quasi-newton algorithms [O] . OLeary Dianne P., Yeremin A. 1994

机译：块拟牛顿算法的线性代数
8. Algorithms for solving scalar and block-cyclic tridiagonal systems of linear algebraic equations [R] . I M Navon 1977

机译：求解线性代数方程的标量和块循环三对角系统的算法

Cache Blocking for Linear Algebra Algorithms

摘要

著录项

相似文献

相关主题

期刊订阅