Accelerating the Execution of Matrix Languages on the Cell Broadband Engine Architecture

Khoury Raymes; Burgstaller Bernd; Scholz Bernhard

首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >Accelerating the Execution of Matrix Languages on the Cell Broadband Engine Architecture

【24h】

Accelerating the Execution of Matrix Languages on the Cell Broadband Engine Architecture

机译：加快单元宽带引擎架构上矩阵语言的执行

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Matrix languages, including MATLAB and Octave, are established standards for applications in science and engineering. They provide interactive programming environments that are easy to use due to their script languages with matrix data types. Current implementations of matrix languages do not fully utilize high-performance, special-purpose chip architectures, such as the IBM PowerXCell processor (Cell). We present a new framework that extends Octave to harvest the computational power of the Cell. With this framework, the programmer is alleviated of the burden of introducing explicit notions of parallelism. Instead, the programmer uses a new matrix data type to execute matrix operations in parallel on the synergistic processing elements (SPEs) of the Cell. We employ lazy evaluation semantics for our new matrix data type to obtain execution traces of matrix operations. Traces are converted to data dependence graphs; operations in the data dependence graph are lowered (split into submatrices), scheduled and executed on the SPEs. Thereby, we exploit 1) data parallelism, 2) instruction level parallelism, 3) pipeline parallelism, and 4) task parallelism of matrix language programs. We conducted extensive experiments to show the validity of our approach. Our Cell-based implementation achieves speedups of up to a factor of 12 over code run on recent Intel Core2 Quad processors.

机译：包括MATLAB和Octave在内的矩阵语言是科学和工程应用中已建立的标准。它们提供具有矩阵数据类型的脚本语言，因此它们提供了易于使用的交互式编程环境。矩阵语言的当前实现没有充分利用高性能，专用芯片架构，例如IBM PowerXCell处理器（Cell）。我们提出了一个扩展Octave的新框架，以获取Cell的计算能力。通过这种框架，程序员可以减轻引入明确的并行性概念的负担。而是，程序员使用新的矩阵数据类型在Cell的协同处理元素（SPE）上并行执行矩阵运算。我们对新的矩阵数据类型采用了惰性评估语义，以获得矩阵操作的执行轨迹。迹线被转换为数据依赖图；降低数据依赖图中的操作（拆分为子矩阵），在SPE上调度和执行。因此，我们利用1）数据并行性，2）指令级并行性，3）流水线并行性和4）矩阵语言程序的任务并行性。我们进行了广泛的实验以证明我们方法的有效性。我们基于单元的实施方式使最新Intel Core2 Quad处理器上运行的代码的速度提高了12倍。

著录项

来源
《Parallel and Distributed Systems, IEEE Transactions on》 |2011年第1期|p.7-21|共15页
作者
Khoury Raymes; Burgstaller Bernd; Scholz Bernhard;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Cell Broadband Engine architecture.; Programming languages; data partitioning; lazy evaluation; math script languages; scheduling;

机译：单元宽带引擎体系结构;编程语言;数据分区;延迟评估;数学脚本语言;调度;

相似文献

外文文献
中文文献
专利

1. Accelerating 3D nonrigid registration using the Cell Broadband Engine processor [J] . IBM Journal of Research and Development . 2009,第5期

机译：使用Cell Broadband Engine处理器加速3D非刚性注册
2. Searching for New Convolutional Codes using the Cell Broadband Engine Architecture [J] . Johnsson Daniel, Bjarkeson Fredrik, Hell Martin, Communications Letters, IEEE . 2011,第5期

机译：使用单元宽带引擎架构搜索新的卷积码
3. Assessment of the Cell Broadband Engine Architecture as a platform to solve closed-loop optimal control problems [J] . Andrzej Karbowski, Maciej Remiszewski Parallel Computing . 2010,第4期

机译：评估作为解决闭环最佳控制问题平台的单元宽带引擎架构
4. Adaptation of Double-Precision Matrix Multiplication to the Cell Broadband Engine Architecture [C] . Krzysztof Rojek, Lukasz Szustak International conference on parallel processing and applied mathematics;PPAM 2010 . 2010

机译：双精度矩阵乘法对小区宽带引擎架构的适应
5. Extracellular Matrix Architecture and Biomechanics of 3D Engineered Microtissues [D] . Bose, Prasenjit 2018

机译：3D工程微发的细胞外基质建筑和生物力学
6. Perfusion Decellularization of Extrahepatic Bile Duct Allows Tissue-Engineered Scaffold Generation by Preserving Matrix Architecture and Cytocompatibility [O] . Yolik Ramírez-Marín, David Eduardo Abad-Contreras, Martha Ustarroz-Cano, 2021

机译：侵袭性胆管导管的灌注脱细胞化允许通过保留基质架构和细胞势杂性来实现组织工程的支架产生
7. Accelerating the Execution of Matrix Languages on the Cell Broadband Engine Architecture [O] . Khoury, Raymes, Burgstaller, Bernd, Scholz, Bernhard 2009

机译：加速在小区宽带上执行矩阵语言引擎架构

Accelerating the Execution of Matrix Languages on the Cell Broadband Engine Architecture

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅