Guided installation of basic linear algebra routines in a clusterrnwith manycore components

J. Cuenca; L. P. García; D. Giménez; F. J. Herrera

首页> 外文期刊>Concurrency and Computation >Guided installation of basic linear algebra routines in a clusterrnwith manycore components

【24h】

Guided installation of basic linear algebra routines in a clusterrnwith manycore components

机译：在具有许多核心组件的集群中引导安装基本线性代数例程

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Computational systems are nowadays composed of basic computational components that sharernmultiprocessors and coprocessors of different types, typically several graphics processing unitsrn(GPUs) or many integrated cores (MICs), and those computational components are combined inrnheterogeneous clusters of nodes with different characteristics, including coprocessors of differentrntypes, with varying numbers of nodes at different speeds. The software previously developedrnand optimized for simpler system needs to be redesigned and reoptimized for these new, morerncomplex systems. The adaptation to hybrid multicore + multiGPU and multicore + multiMIC ofrnautotuning techniques for basic linear algebra routines is analyzed. The matrix-matrixmultiplicationrnkernel, which is optimized for different computational system components through guidedrnexperimentation, is studied. The routine is installed for each node in the cluster, and the informationrngenerated from individual installations may be used for a hierarchical installation in a cluster.rnThe basic matrix-matrix multiplication may, in turn, be used inside higher level routines, whichrndelegate their efficient execution to the optimization of the lower level routine. Experimentalrnresults are satisfactory in different multicore + multiGPU and multicore + multiMIC systems. Sornthe guided search of execution configurations for satisfactory execution times proves to be a usefulrntool for heterogeneous systems, where the complexity of the system means a correct use ofrnhighly efficient routines and libraries is difficult.

机译：如今，计算系统由基本的计算组件组成，它们共享不同类型的多处理器和协处理器，通常是几个图形处理单元（GPU）或许多集成核（MIC），这些计算组件是由具有不同特征的节点的异构集群组合而成，包括不同的类型，以不同的速度具有不同数量的节点。对于这些新的，更复杂的系统，需要重新设计和优化先前为简化系统而开发和优化的软件。分析了基本线性代数例程对混合多核+ multiGPU和多核+ multiMIC自动调节技术的适应性。研究了通过导引实验针对不同计算系统组件进行优化的矩阵矩阵乘法内核。例程是为群集中的每个节点安装的，从单个安装生成的信息可以用于群集中的分层安装。基本矩阵矩阵乘法又可以在较高级别的例程中使用，这表明它们的有效执行对较低级别例程的优化。在不同的多核+多GPU和多核+ multiMIC系统中，实验结果令人满意。对于令人满意的执行时间，进行引导式搜索执行配置证明是异构系统的有用工具，在异构系统中，系统的复杂性意味着很难正确使用高效的例程和库。

著录项

来源
《Concurrency and Computation》 |2017年第15期|1-14|共14页
作者
J. Cuenca; L. P. García; D. Giménez; F. J. Herrera;
展开▼
作者单位

Department of Engineering and Technology ofComputers, University of Murcia,Murcia, Spain;

Service of Support to Technological Research,Technical University of Cartagena, Murcia,Spain;

Department of Computing and Systems,University of Murcia,Murcia, Spain;

Department of Computing and Systems,University of Murcia,Murcia, Spain;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
autotuning; heterogeneous computing; hybrid programming; parallel linear algebra; manycore;

机译：自动调节;异构计算混合编程;平行线性代数;

相似文献

外文文献
中文文献
专利

1. A Set of Batched Basic Linear Algebra Subprograms and LAPACK Routines [J] . Abdelfattah Ahmad, Costa Timothy, Dongarra Jack, ACM transactions on mathematical software . 2021,第3期

机译：一组批次的基本线性代数子程序和Lapack例程
2. Tuning basic Linear Algebra Routines for Hybrid CPU+GPU Platforms [J] . Gregorio Bernabé, Javier Cuenca, Luis-Pedro García, Procedia Computer Science . 2014,第1期

机译：为混合CPU + GPU平台调整基本的线性代数例程
3. Towards dense linear algebra for hybrid GPU accelerated manycore systems [J] . Stanimire Tomov, Jack Dongarra, Marc Baboulin Parallel Computing . 2010,第5a6期

机译：面向混合GPU加速多核系统的密集线性代数
4. Towards the Basic Linear Algebra Unit : Replicating multi-dimensional FPUs to accelerate linear algebra applications [C] . Nicolas Brunie Asilomar Conference on Signals, Systems, and Computers . 2020

机译：朝向基本线性代数单元：复制多维FPU以加速线性代数应用
5. High-Order Automatic Differentiation of Unmodified Linear Algebra Routines via Nilpotent Matrices. [D] . Dunham, Benjamin Z. 2017

机译：通过幂等矩阵对未修改的线性代数例程进行高阶自动微分。
6. Endoscopic Ultrasound (EUS)-Guided Pancreatic Duct Drainage: The Basics of When and How to Perform EUS-Guided Pancreatic Duct Interventions [O] . Christopher G. Chapman, Irving Waxman, Uzma D. Siddiqui 2016

机译：内镜超声（EUS）引导胰管引流：何时以及如何进行EUS引导胰管干预的基础
7. Tuning basic Linear Algebra Routines for Hybrid CPU+GPU Platforms [O] . Bernabé Gregorio, Cuenca Javier, García Luis-Pedro, 2014

机译：为混合CPU + GPU平台调整基本的线性代数例程
8. Numvec Fortran Library Manual. Chapter: Basic Linear Algebra. Routine: Matmul.Chapter: Simultaneous Linear Equations. Routines: BIDIAGL and BIDIAGU [R] . Schlichting, J. J. F. M. 1990

机译：Numvec Fortran图书馆手册。章：基本线性代数。例程：matmul.Chapter：同时线性方程组。例程：BIDIaGL和BIDIaGU

Guided installation of basic linear algebra routines in a clusterrnwith manycore components

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅