An Automatic Parallel-Stage Decoupled Software Pipelining Parallelization Algorithm Based on OpenMP

机译：基于OpenMP的自动并行解耦软件流水线并行化算法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

While multicore processors increase throughput for multi-programmed and multithreaded codes, many important applications are single threaded and thus do not benefit. Automatic parallelization techniques play an important role in migrating singe threaded applications to multicore platforms. Unfortunately, the prevalence of control flow, recursive data structures, and general pointer accesses in ordinary programs renders the traditional automatic parallelization techniques unsuitable. Parallel-Stage Decoupled Software Pipelining (PS-DSWP) is proposed to exploit fine-grained pipeline parallelism lurking in ordinary programs with the existence of all kinds of dependences, including arbitrary control dependences, at the instruction level. But it requires knowledge of architectural properties and hardware support of a communication channel and two special instructions. We propose an improved PS-DSWP algorithm based on OpenMP in this paper. It is implemented without relying on CPU architectures by using a high level intermediate representation. Moreover, the Program Dependence Graph (PDG) used in the algorithm is built based on the basic blocks, which exploits coarser-grained parallelism than the original PS-DSWP transformation with PDG based on instructions. OpenMP is employed in our algorithm to assign task and implement synchronization among threads while avoiding dependence on hardware support. We evaluate the loops with complex memory patterns and control flow, which cannot be dealt with by traditional techniques, on multicore platform. As a result, they can be parallelized and gain significant performance improvement with our algorithm. We obtain a maximum speedup as high as 2.07x and on average 1.39x with 5 threads.

机译：尽管多核处理器提高了多程序和多线程代码的吞吐量，但是许多重要的应用程序都是单线程的，因此没有好处。自动并行化技术在将单线程应用程序迁移到多核平台中起着重要作用。不幸的是，普通程序中普遍存在控制流，递归数据结构和通用指针访问，这使得传统的自动并行化技术不合适。提出了并行级解耦软件流水线（PS-DSWP），以利用普通程序中潜伏的细粒度流水线并行性，在指令级别上存在各种依赖关系，包括任意控制依赖关系。但是，它需要有关通信通道的体系结构属性和硬件支持的知识以及两个特殊说明。本文提出了一种基于OpenMP的改进PS-DSWP算法。通过使用高级中间表示，可以在不依赖CPU架构的情况下实现它。此外，该算法中使用的程序依赖图（PDG）是基于基本块构建的，与基于指令的带有PDG的原始PS-DSWP转换相比，它利用了更粗糙的并行性。在我们的算法中使用OpenMP来分配任务并实现线程之间的同步，同时避免依赖硬件支持。我们在多核平台上用复杂的内存模式和控制流评估循环，而这是传统技术无法处理的。结果，它们可以并行化，并通过我们的算法获得显着的性能改进。我们通过5个线程获得了高达2.07倍的平均加速比和平均1.39倍的加速比。

著录项

来源
《2013 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications》|2013年|1825-1831|共7页
会议地点 Melbourne(AU)
作者
Liu Xiaoxian; Zhao Rongcai; Han Lin; Liu Peng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
OpenMP; automatic parallelization; parallel-stage decoupled software pipelining;

机译：OpenMP;自动并行化;并行解耦软件流水线;;

相似文献

外文文献
中文文献
专利

1. Automatic Parallelization of Simulation Code for Equation-based Models with Software Pipelining and Measurements on Three Platforms [J] . Hakan Lundvall, Kristian Stavaker, Peter Fritzson, Computer architecture news . 2008,第5期

机译：在三个平台上通过软件流水线和测量对基于方程的模型的仿真代码进行自动并行化
2. Extending decoupled software pipeline to parallelize Java programs [J] . Andre Loureiro, Joao Paulo Porto, Guido Araujo Software . 2013,第5期

机译：扩展解耦的软件管道以并行化Java程序
3. Parallelization of Needleman-Wunsch Algorithm Based on Software Pipelining [J] . Hanwen Hu, Zhenzhou Ji International Journal of Engineering and Manufacturing(IJEM) . 2011,第4期

机译：基于软件流水线的Needleman-Wunsch算法并行化
4. An Automatic Parallel-Stage Decoupled Software Pipelining Parallelization Algorithm Based on OpenMP [C] . Liu Xiaoxian, Zhao Rongcai, Han Lin, IEEE International Conference on Trust, Security and Privacy in Computing and Communications . 2013

机译：基于OpenMP的自动并行级解耦软件流水线平行化算法
5. MICROCOMPUTER BASED AUTOMATIC TRUCK DISPATCHING - SYSTEM MODELING AND SIMULATION (MINING, SOFTWARE, ALGORITHM, OPEN-PIT) [D] . KOLB, WILLIAM EDWARD 1986

机译：基于微计算机的自动卡车调度-系统建模和仿真（采矿，软件，算法，露天开采）
6. Impacts of right ventricular trabeculae and papillary muscles on volumes and function assessed by cardiovascular magnetic resonance using a novel software: semi-automatic threshold-based segmentation algorithm [O] . Akio Inage, Naokazu Mizuno 2015

机译：使用新型软件：半自动基于阈值的分割算法通过心血管磁共振评估右室小梁和乳头肌对容量和功能的影响
7. Advances in Parallel-Stage Decoupled Software Pipelining Leveraging Loop Distribution, Stream-Computing and the SSA Form [O] . Li Feng, Pop Antoniu, Cohen Albert 2011

机译：利用循环分配，流计算和SSA形式的并行解耦软件流水线技术的进步

An Automatic Parallel-Stage Decoupled Software Pipelining Parallelization Algorithm Based on OpenMP

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅