机译:FLAME方法:从密集的线性代数算法到高性能的多加速器实现
Departamento de Ingenieria y Ciencia de Computadores, Universidad Jaume I, Campus Riu Sec, 12.071, Castellon, Spain;
Department of Computer Science, The University of Texas at Austin, Austin, TX 78712, United States;
Departamento de Ingenieria y Ciencia de Computadores, Universidad Jaume I, Campus Riu Sec, 12.071, Castellon, Spain;
Departamento de Ingenieria y Ciencia de Computadores, Universidad Jaume I, Campus Riu Sec, 12.071, Castellon, Spain;
Department of Computer Science, The University of Texas at Austin, Austin, TX 78712, United States;
Department of Computer Science, The University of Texas at Austin, Austin, TX 78712, United States;
dense linear algebra libraries; graphics processors; runtime systems; high performance computing;
机译:混合LU和QR分解算法以设计高性能的密集线性代数求解器
机译:使用算法预取来提高线性矩阵的稠密矩阵算法的性能
机译:算法979:稠密线性代数的递归算法-ReLAPACK集合
机译:在连接机CM-5 / CM-5E上有效并行实现密集线性代数算法的选定技术
机译:一种设计和分析线性代数算法的系统方法。
机译:VLSI实现高性能非线性图像缩放算法
机译:FLamE方法:从密集线性代数算法到高性能多加速器实现
机译:在CRaY X-mp-4上使用多任务执行密集线性代数算法(或接近Gigaflop)