Compiler Directed Parallelization of Loops in Scale for Shared-Memory Multiprocessors

机译：编译器针对共享内存多处理器的循环指向并行化

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Effective utilization of symmetric shared-memory multiprocessors (SMPs) is predicted on the development of efficient parallel code. Unfortunately, efficient parallelism is not always easy for the programmer to identify. Worse, exploiting such parallelism may directly conflict with optimizations affecting per-processor utilization (i.e. loop recording to improve data locality). Here, we present our experience with a loop-level parallel compiler optimization for SMPs proposed by McKinley [6]. The algorithm uses dependence analysis and a simple model of the target machine, to transform nested loops. The goal of the approach is to promote efficient execution of parallel loops by exposing sources of large-grain parallel work while maintaining per-processor locality. We implement the optimization within the Scale compiler framework, and analyze the performance of multiprocessor code produced for three microbenchmarks.

机译：对对称共享内存多处理器（SMPS）的有效利用预测有效的平行代码的开发。不幸的是，程序员识别的高效并行性并不容易。更糟糕的是，利用这种并行性可能与影响每处理器利用率的优化直接冲突（即循环记录以改善数据局部性）。在这里，我们向McKinley提出的SMPS提供了我们的循环级并行编译器优化的体验[6]。该算法使用依赖性分析和目标机器的简单模型，转换嵌套环。该方法的目标是通过在维持每个处理器局部性的同时暴露大粒并联工作来源来促进平行环路的有效执行。我们在规模编译器框架内实现优化，并分析为三个微型发布标记生成的多处理器代码的性能。

著录项

来源
《International conference on computational science》|2003年||共10页
会议地点
作者
Gregory S. Johnson; Simha Sethumadhavan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类理论、方法;
关键词

相似文献

外文文献
中文文献
专利

1. Automatic partitioning of parallel loops and data arrays for distributed shared-memory multiprocessors [J] . Agarwal A., Kranz D.A. IEEE Transactions on Parallel and Distributed Systems . 1995,第9期

机译：分布式共享内存多处理器的并行循环和数据数组的自动分区
2. Minimizing the Directory Size for Large-Scale Shared-Memory Multiprocessors [J] . Jinseok KONG, Pen-Chung YEW, Gyungho LEE IEICE Transactions on Information and Systems . 2005,第11期

机译：最小化大型共享内存多处理器的目录大小
3. Techniques for Improving the Performance and Scalability of Directory-based Shared-Memory Multiprocessors: A Survey [J] . Manuel E. Acacio, Jos?? M. Garc?-a Journal of Computer Science and Technology . 2003,第2期

机译：改进基于目录的共享内存多处理器的性能和可伸缩性的技术：一项调查
4. Compiler Directed Parallelization of Loops in Scale for Shared-Memory Multiprocessors [C] . Gregory S. Johnson, Simha Sethumadhavan International Conference on Computational Science - ICCS 2003 Pt.3 Jun 2-4, 2003 Melbourne, Australia and St. Petersburg, Russia . 2003

机译：共享内存多处理器的编译器定向大规模循环并行化
5. Architectural support for scalable speculative parallelization in shared-memory multiprocessors. [D] . Cintra, Marcelo Hehl. 2001

机译：对共享内存多处理器中的可伸缩投机并行化的体系结构支持。
6. PANET: A GPU-Based Tool for Fast Parallel Analysis of Robustness Dynamics and Feed-Forward/Feedback Loop Structures in Large-Scale Biological Networks [O] . Hung-Cuong Trinh, Duc-Hau Le, Yung-Keun Kwon -1

机译：PANET：基于GPU的工具可快速并行分析大型生物网络中的鲁棒性动力学和前馈/反馈回路结构
7. Compiler Directed Parallelization of Loops in Scale for Shared-Memory Multiprocessors [O] . Gregory S. Johnson, Simha Sethumadhavan 2009

机译：共享内存多处理器的编译器定向大规模循环并行化

Compiler Directed Parallelization of Loops in Scale for Shared-Memory Multiprocessors

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅