首页> 外文OA文献 >Some issues in dense linear algebra for multicore and special purpose architectures

【2h】

Some issues in dense linear algebra for multicore and special purpose architectures

机译：密集线性代数中的多核和特殊用途体系结构中的一些问题

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We address some key issues in designing dense linear algebra (DLA) algorithmsthat are common for both multi/many-cores and special purpose architectures(in particular GPUs). We present them in the context of an LU factorizationalgorithm, where randomization techniques are used as an alternative to pivoting.This approach yields an algorithm based entirely on a collection of small Level 3BLAS type computational tasks, which has emerged as a common goal in designingDLA algorithms for new architectures. Other common trends, also considered here,are block asynchronous task execution and “Block” layouts for the data associatedwith the separate tasks. We present numerical results and other specific experimentswith DLA algorithms on NVIDIA GPUs using CUDA. The GPU results arealso of interest themselves as we show a performance of up to 160 Glop/s on a singleQuadro FX 5600 card.

机译：我们在设计密集线性代数（DLA）算法时解决了一些关键问题，这些算法对于多核/多核和专用架构（尤其是GPU）而言都是常见的。我们在LU分解算法的背景下介绍它们，其中使用随机化技术替代数据透视。此方法产生的算法完全基于一组小型3BLAS类型的计算任务，这已成为设计DLA算法的共同目标适用于新架构。在此还考虑的其他常见趋势是块异步任务执行和与单独任务关联的数据的“块”布局。我们使用CUDA在NVIDIA GPU上使用DLA算法展示了数值结果和其他特定实验。 GPU的结果本身也很有趣，因为我们在一张Quadro FX 5600卡上显示出高达160 Glop / s的性能。

著录项

作者
Baboulin Marc; Dongarra Jack; Tomov Stanimire;
展开▼
作者单位

展开▼
年度 2008
总页数
原文格式 PDF
正文语种 eng
中图分类

相似文献

外文文献
中文文献
专利

1. Profiling high performance dense linear algebra algorithms on multicore architectures for power and energy efficiency [J] . Hatem Ltaief, Piotr Luszczek, Jack Dongarra Computer science . 2012,第4期

机译：在多核架构上分析高性能密集线性代数算法，以提高功率和能效
2. Analysis of dynamically scheduled tile algorithms for dense linear algebra on multicore architectures [J] . Azzam Haidar, Hatem Ltaief, Asim YarKhan, Concurrency and computation: practice and experience . 2012,第3期

机译：多核体系结构上稠密线性代数的动态调度图块算法分析
3. Multi-threaded dense linear algebra libraries for low-power asymmetric multicore processors [J] . Catalan Sandra, Herrero Jose R., Igual Francisco D., Journal of computational science . 2018,第MARa期

机译：低功耗非对称多核处理器的多线程密集线性代数库
4. Energy Footprint of Advanced Dense Numerical Linear Algebra Using Tile Algorithms on Multicore Architectures [C] . Dongarra Jack, Ltaief Hatem, Luszczek Piotr, The Second International Conference on Cloud and Green Computing. . 2012

机译：使用多核架构上的平铺算法的高级密集数值线性代数的能量足迹
5. Issues in adding special purpose functions to free scientific or statistical software. [D] . Wang, Sudan. 2003

机译：向免费的科学或统计软件添加特殊功能的问题。
6. Multicore Vesicles: Hyperosmolarity and L-DOPA Induce Homotypic Fusion of Dense Core Vesicles [O] . Leslie A. Sombers, Marc M. Maxson, Andrew G. Ewing -1

机译：多核囊泡：高渗性和L-DOPA诱导致密核心囊泡的同型融合。
7. Some issues in dense linear algebra for multicore and special purpose architectures [O] . Baboulin Marc, Dongarra Jack, Tomov Stanimire 2008

机译：多核和专用架构的密集线性代数中的一些问题

Some issues in dense linear algebra for multicore and special purpose architectures

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅