Data-flow analysis and optimization for data coherence in heterogeneous architectures

Sousa Rafael; Pereira Marcio; Quintao Pereira Fernando Magno; Araujo Guido

首页> 外文期刊>Journal of Parallel and Distributed Computing >Data-flow analysis and optimization for data coherence in heterogeneous architectures

【24h】

Data-flow analysis and optimization for data coherence in heterogeneous architectures

机译：异构架构中数据一致性的数据流分析与优化

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Although heterogeneous computing has enabled developers to achieve impressive program speed-ups, the cost of moving and keeping data coherent between host and device may easily eliminate any performance gains achieved by acceleration. To deal with this problem, this paper introduces DCA: a pair of two data-flow analyses that determine how variables are used by host/device at each program point. It also introduces DCO, a code optimization technique that uses DCA information to: (a) allocate OpenCL shared buffers between host and devices; and (b) insert appropriate OpenCL function calls into program points so as to minimize the number of data coherence operations. We have used the AClang compiler to measure the impact of DCA and DCO when generating code from Parboil, Polybench and Rodinia benchmarks for a set of discrete/integrated CPUs. The experimental results showed speed-ups of up to 5.25x (average of 1.39x) on an ARM Mali-T880 and up to 8.87x (average of 1.66x) on an NVIDIA GPU Pascal Titan X. (C) 2019 Elsevier Inc. All rights reserved.

机译：尽管异构计算使开发人员能够实现令人印象深刻的程序加速，但是在主机和设备之间移动和保持数据相干的成本可能很容易消除通过加速度实现的任何性能增益。要处理此问题，本文介绍了DCA：一对数据流分析，确定每个程序点的主机/设备如何使用变量。它还介绍了DCO，一种代码优化技术，它使用DCA信息：（a）在主机和设备之间分配OpenCL共享缓冲区; （b）将适当的OpenCL函数调用插入程序点，以最小化数据相干操作的数量。我们使用了ACLANG编译器来测量DCA和DCO的影响，当一组离散/集成CPU的帕押，PolyBench和Rodinia基准测试代码时。实验结果表明，在NVIDIA GPU Pascal Titan X.（C）2019年Elsevier Inc.的速度下，速度高达5.25倍（平均1.39倍），高达8.87倍（平均为1.66倍）版权所有。

著录项

来源
《Journal of Parallel and Distributed Computing》 |2019年第8期|126-139|共14页
作者
Sousa Rafael; Pereira Marcio; Quintao Pereira Fernando Magno; Araujo Guido;
展开▼
作者单位

Univ Estadual Campinas UNICAMP Inst Comp Campinas SP Brazil;

Univ Estadual Campinas UNICAMP Inst Comp Campinas SP Brazil;

Univ Minas Gerais Dept Comp Sci UFMG Belo Horizonte MG Brazil;

Univ Estadual Campinas UNICAMP Inst Comp Campinas SP Brazil;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Compilers; Data coherence; Heterogeneous architectures;

机译：编译器;数据一致性;异构架构;

相似文献

外文文献
中文文献
专利

1. Data-flow analysis and optimization for data coherence in heterogeneous architectures [J] . Sousa Rafael, Pereira Marcio, Quintao Pereira Fernando Magno, Journal of Parallel and Distributed Computing . 2019,第AUGa期

机译：异构架构中数据流的分析和数据一致性优化
2. Compiler analysis for cache coherence: interprocedural array data-flow analysis and its impact on cache performance [J] . Choi L., Pen-Chung Yew IEEE Transactions on Parallel and Distributed Systems . 2000,第9期

机译：缓存一致性的编译器分析：过程间数组数据流分析及其对缓存性能的影响
3. Efficient Telescopic Search Motion-Estimation Architecture Based on Data-Flow Optimization [J] . Wujian Zhang, Runde Zhou, Tsunehachi Ishitani IEICE Transactions on Electronics . 2001,第3期

机译：基于数据流优化的高效伸缩搜索运动估计架构
4. XKaapi: A Runtime System for Data-Flow Task Programming on Heterogeneous Architectures [C] . Gautier Thierry, Lima Joao V.F., Maillard Nicolas, IEEE International Parallel Distributed Processing Symposium . 2013

机译：XKaapi：用于异构体系结构上的数据流任务编程的运行时系统
5. Leveraging Data-Flow Information for Efficient Scheduling of Task-Parallel Programs on Heterogeneous Systems [D] . Simsek, Osman Seckin. 2020

机译：利用数据流信息，以便有效调度异构系统上的任务并行程序
6. Anomaly Detection Based Latency-Aware Energy Consumption Optimization For IoT Data-Flow Services [O] . Yuansheng Luo, Wenjia Li, Shi Qiu 2020

机译：基于异常检测的物联网数据流服务的延迟感知能耗优化
7. XKaapi: A Runtime System for Data-Flow Task Programming on Heterogeneous Architectures [O] . Thierry Gautier, João V. F. Lima, Nicolas Maillard, 2013

机译：XKaapi：用于异构架构上的数据流任务编程的运行时系统
8. Multi-level data-flow architecture for signal and data processing applications. Final report. [R] . Gaudiot, J. L. 1993

机译：用于信号和数据处理应用的多级数据流架构。总结报告。

Data-flow analysis and optimization for data coherence in heterogeneous architectures

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅