KernelHive: a new workflow-based framework for multilevel high performance computing using clusters and workstations with CPUs and GPUs

Rościszewski Paweł; Czarnul Paweł; Lewandowski Rafał; Schally‐Kacprzak Marcel

首页> 外文期刊>Concurrency and computation: practice and experience >KernelHive: a new workflow-based framework for multilevel high performance computing using clusters and workstations with CPUs and GPUs

【24h】

KernelHive: a new workflow-based framework for multilevel high performance computing using clusters and workstations with CPUs and GPUs

机译：KernelHive：基于工作流的新框架，用于使用具有CPU和GPU的集群和工作站进行多层高性能计算

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The paper presents a new open-source framework called KernelHive for multilevel parallelization of computations among various clusters, cluster nodes, and finally, among both CPUs and GPUs for a particular application. An application is modeled as an acyclic directed graph with a possibility to run nodes in parallel and automatic expansion of nodes (called node unrolling) depending on the number of computation units available. A methodology is proposed for parallelization and mapping of an application to the environment that includes selection of devices using a chosen optimizer, selection of best grid configurations for compute devices, optimization of data partitioning and the execution. One of possibly many scheduling algorithms can be selected considering execution time, power consumption, and so on. An easy-to-use GUI is provided for modeling and monitoring with a repository of ready-to-use constructs and computational kernels. The methodology, execution times, and scalability have been demonstrated for a distributed and parallel password-breaking example run in a heterogeneous environment with a cluster and servers with different numbers of nodes and both CPUs and GPUs. Additionally, performance of the framework has been compared with an MPI + OpenCL implementation using a parallel geospatial interpolation application employing up to 40 cluster nodes and 320 cores. Copyright © 2015 John Wiley & Sons, Ltd.

机译：本文提出了一个称为KernelHive的新开源框架，用于在各种集群，集群节点之间以及最终在特定应用的CPU和GPU之间对计算进行多级并行化。应用程序被建模为非循环有向图，并可能根据可用的计算单元数量并行运行节点并自动扩展节点（称为节点展开）。提出了一种用于将应用程序并行化和映射到环境的方法，该方法包括使用选定的优化器选择设备，选择计算设备的最佳网格配置，数据分区和执行的优化。考虑执行时间，功耗等，可以选择许多调度算法之一。提供了一个易于使用的GUI，用于使用现成的结构和计算内核的存储库进行建模和监视。已针对在异构环境中运行的分布式并行密码破解示例演示了方法，执行时间和可伸缩性，该异构环境具有群集和具有不同数量节点以及CPU和GPU的服务器。此外，该框架的性能已与使用并行地理空间插值应用程序的MPI + OpenCL实施进行了比较，该应用程序采用了多达40个群集节点和320个核心。版权所有©2015 John Wiley＆Sons，Ltd.

著录项

来源
《Concurrency and computation: practice and experience》 |2016年第9期|2586-2607|共22页
作者
Rościszewski Paweł; Czarnul Paweł; Lewandowski Rafał; Schally‐Kacprzak Marcel;
展开▼
作者单位

Gdansk University of Technology Department of Computer Architecture Faculty of Electronics Telecommunications and Informatics Gdansk Poland;

Gdansk University of Technology Department of Computer Architecture Faculty of Electronics Telecommunications and Informatics Gdansk Poland;

Gdansk University of Technology Department of Computer Architecture Faculty of Electronics Telecommunications and Informatics Gdansk Poland;

Gdansk University of Technology Department of Computer Architecture Faculty of Electronics Telecommunications and Informatics Gdansk Poland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
high performance computing; GPGPU; cluster computing; multilevel parallelization; heterogeneous system; high level framework;

机译：高性能计算;GPGPU;集群计算;多层并行化;异构系统;高层框架;

相似文献

外文文献
中文文献
专利

1. Accelerated hyperspectral image recursive hierarchical segmentation using GPUs, multicore CPUs, and hybrid CPU/GPU cluster [J] . Hossam M. A., Ebied H. M., Abdel-Aziz M. H., Journal of Real-Time Image Processing . 2018,第2期

机译：使用GPU，多核CPU和混合CPU / GPU集群的加速高光谱图像递归分层分段
2. CPU/GPU computing for long-wave radiation physics on large GPU clusters [J] . Fengshun Lu, Junqiang Song, Xiaoqun Cao, Computers & geosciences . 2012,第期

机译：大型GPU群集上用于长波辐射物理的CPU / GPU计算
3. Performance analysis of SSE and AYX instructions in multi-core CPUs and GPU computing on FDTD scheme for solid and fluid vibration problems [J] . Jorge Frances, Sergio Bleda, Andres Marquez, Journal of supercomputing . 2014,第2期

机译：多核CPU中SSE和AYX指令的性能分析以及基于FDTD方案的GPU计算的固体和流体振动问题
4. Lit: A high performance massive data computing framework based on CPU/GPU cluster [C] . Zhai Yanlong, Mbarushimana Emmanuel, Li Wei, IEEE International Conference on Cluster Computing . 2013

机译：点亮：基于CPU / GPU集群的高性能海量数据计算框架
5. Hierarchical scheduling and uniform access programming frameworks for heterogeneous CPU-GPU computing clusters [D] . Sajjapongse, Kittisak. 2015

机译：异构CPU-GPU计算集群的分层调度和统一访问编程框架
6. Application Performance Analysis and Efficient Execution on Systems with multi-core CPUs GPUs and MICs: A Case Study with Microscopy Image Analysis [O] . George Teodoro, Tahsin Kurc, Guilherme Andrade, -1

机译：具有多核CPUGPU和MIC的系统上的应用程序性能分析和高效执行：以显微镜图像分析为例
7. Heterogeneous Gpu&Cpu Cluster For High Performance Computing In Cryptography [O] . Michał Marks, Jaroslaw Jantura, Ewa Niewiadomska-Szynkiewicz, 2012

机译：用于密码学中高性能计算的异构Gpu和Cpu集群

KernelHive: a new workflow-based framework for multilevel high performance computing using clusters and workstations with CPUs and GPUs

摘要

著录项

相似文献

相关主题

期刊订阅