Cooperative and out-of-core execution of the irregular wavefront propagation pattern on hybrid machines with IntelⓇ Xeon Phi™

Jeremias Gomes; Alba C. M. A. de Melo; Jun Kong; Tahsin Kurc; Joel H. Saltz; George Teodoro

首页> 外文期刊>Concurrency, practice and experience >Cooperative and out-of-core execution of the irregular wavefront propagation pattern on hybrid machines with IntelⓇ Xeon Phi™

【24h】

Cooperative and out-of-core execution of the irregular wavefront propagation pattern on hybrid machines with IntelⓇ Xeon Phi™

机译：在具有Intel®Xeon Phi™的混合计算机上，不规则波前传播模式的协作和核心外执行

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The Irregular Wavefront Propagation Pattern (IWPP) is a core computing structure in severalrnimage analysis operations. Efficient implementation of IWPP on the Intel Xeon Phi is difficultrnbecause of the irregular data access and computation characteristics. The traditional IWPPrnalgorithm relies on atomic instructions, which are not available in the SIMD set of the Intel Phi.rnTo overcome this limitation, we have proposed a new IWPP algorithm that can take advantagernof non-atomic SIMD instructions supported on the Intel Xeon Phi. We have also developed andrnevaluated methods to useCPUand IntelPhi cooperatively for parallel execution of theIWPPalgorithms.rnOur new cooperative IWPP version is also able to handle large out-of-core images thatrnwould not fit into the memory of the accelerator. The new IWPP algorithm is used to implementrnthe Morphological Reconstruction and Fill Holes operations, which are operations commonlyrnfound in image analysis applications. The vectorization implemented with the new IWPP hasrnattained improvements of up to about 5×on top of the original IWPPand significant gains as comparedrnto state-of-the-art the CPU and GPU versions. The new version running on an Intel Phi isrn6.21× and 3.14× faster than running on a 16-core CPU and on a GPU, respectively. Finally, therncooperative execution using two Intel Phi devices and a multi-coreCPUhas reached performancerngains of 2.14× as compared to the execution using a single Intel Xeon Phi.

机译：不规则波前传播模式（IWPP）是几种图像分析操作中的核心计算结构。由于不规则的数据访问和计算特性，很难在Intel Xeon Phi上高效实施IWPP。传统的IWPPrn算法依赖于Intel Phi的SIMD集中没有的原子指令。为克服此限制，我们提出了一种新的IWPP算法，该算法可以利用Intel Xeon Phi支持的非原子SIMD指令。我们还开发并重新评估了将CPU和Intel Phi协同使用以并行执行IWPP算法的方法。我们的新的IWPP协同版本也能够处理加速器内存中无法容纳的大型核外图像。新的IWPP算法用于实现形态重建和填充孔操作，这些操作是图像分析应用程序中常见的操作。与最新的CPU和GPU版本相比，使用新的IWPP实现的矢量化已在原始IWPP的基础上进行了多达5倍的改进，并获得了可观的收益。在Intel Phi上运行的新版本分别比在16核CPU和GPU上运行的速度快6.21倍和3.14倍。最终，与使用单个Intel Xeon Phi的执行相比，使用两个Intel Phi设备和多核CPU的合作执行的性能收益达到了2.14倍。

著录项

来源
《Concurrency, practice and experience》 |2018年第14期|e4425.1-e4425.20|共20页
作者
Jeremias Gomes; Alba C. M. A. de Melo; Jun Kong; Tahsin Kurc; Joel H. Saltz; George Teodoro;
展开▼
作者单位

Department of Computer Science, Universityof Brasília, Brasília-DF, Brazil;

Department of Computer Science, Universityof Brasília, Brasília-DF, Brazil;

Department of Biomedical Informatics, EmoryUniversity, Atlanta, GA, USA;

Department of Biomedical Informatics, StonyBrook University, Stony Brook, NY, USA;

Department of Biomedical Informatics, StonyBrook University, Stony Brook, NY, USA;

Department of Computer Science, Universityof Brasília, Brasília-DF, Brazil Department of Biomedical Informatics, StonyBrook University, Stony Brook, NY, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Fill Holes, IntelⓇ Xeon Phi™, Irregular Algorithm Propagation Pattern, Morphological Reconstruction;

机译：填充孔;英特尔®至强融核™;不规则算法传播模式;形态重建;

相似文献

外文文献
中文文献
专利

1. High-level Support for Hybrid Parallel Execution of C++ Applications Targeting Intel? Xeon Phi? Coprocessors [J] . Jiri Dokulil, Enes Bajrovic, Siegfried Benkner, Procedia Computer Science . 2013,第1期

机译：针对Intel的C ++应用程序的混合并行执行的高级支持？至强皮协处理器
2. Benchmarking Performance of a Hybrid Intel Xeon/Xeon Phi System for Parallel Computation of Similarity Measures Between Large Vectors [J] . Pawel Czarnul International journal of parallel programming . 2017,第5期

机译：大向量之间相似性度量的并行计算的混合英特尔至强/至强融核系统的基准性能
3. Asynchronous and synchronous models of executions on Intel~® Xeon Phi~(TM) coprocessor systems for high performance of long wave radiation calculations in atmosphere models [J] . Amlesh Kashyap, Sathish S. Vadhiyar, Ravi S. Nanjundiah, Journal of Parallel and Distributed Computing . 2017,第Apra期

机译：Intel〜Xeon Phi〜（TM）协处理器系统的异步和同步模型，用于大气模型的长波辐射计算高性能
4. Efficient Irregular Wavefront Propagation Algorithms on Intel(R) Xeon Phi(TM) [C] . Jeremias M. Gomes, George Teodoro, Alba De Melo, IEEE International Symposium on Computer Architecture and High Performance Computing . 2015

机译：英特尔®至强融核™的高效不规则波前传播算法
5. An Analysis of Variation Between Cores for Intel Xeon Phi Knights Corner and Xeon Phi Knights Landing. [D] . Robinson, Jamar. 2017

机译：英特尔至强披披骑士角和至强披披骑士登陆的内核之间的差异分析。
6. Cooperative and out-of-core execution of the irregular wavefront propagation pattern on hybrid machines with IntelⓇ Xeon Phi™ [O] . Jeremias Gomes, Alba C. M. A. de Melo, Jun Kong, -1

机译：在具有Intel®Xeon Phi™的混合计算机上不规则波前传播模式的协作和核心外执行
7. Cooperative and out-of-core execution of the irregular wavefront propagation pattern on hybrid machines with Intel®Xeon Phi™ [O] . Jeremias Gomes, Alba C. M. A. de Melo, Jun Kong, 2018

机译：用英特尔®XEONPHI™对混合机器上的不规则波前传播模式的合作和核心执行

Cooperative and out-of-core execution of the irregular wavefront propagation pattern on hybrid machines with IntelⓇ Xeon Phi™

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅