首页> 外文会议>International Symposium on Computer Architecture and High Performance Computing >Optimizing a 3D-FWT Code in a Heterogeneous Cluster of Multicore CPUs and Manycore GPUs
【24h】

Optimizing a 3D-FWT Code in a Heterogeneous Cluster of Multicore CPUs and Manycore GPUs

机译:在多核CPU和Manycore GPU的异构集群中优化3D-FWT代码

获取原文

摘要

Clusters of nodes composed of many core GPUs and multicore CPUs are used to solve scientific problems with high computational requirements. The development and optimization of parallel-heterogeneous codes for these systems is a complex task which requires a deep knowledge of the different components of the hybrid, heterogeneous and hierarchical computational system, and also of the scientific problem to be solved and the different programing paradigms to be used for its efficient solution. Techniques for efficient development and optimization of scientific codes for these systems are needed. This paper presents an analysis of the development and optimization of the 3D-Fast Wavelet Transform (3D-FWT) for a heterogeneous cluster of multicores+GPUs. Different parallel programming paradigms (message passing, shared memory and SIMD GPU) are combined to fully exploit the computing capacity of the different computational elements of the cluster, so resulting in an efficient combination of basic codes developed previously for individual components (individual nodes, multicore or GPU) and an important reduction of the compression time of long video sequences.
机译:由许多核心GPU和多核CPU组成的节点群集用于解决对计算量有较高要求的科学问题。这些系统的并行异构代码的开发和优化是一项复杂的任务,需要对混合,异构和分层计算系统的不同组件以及要解决的科学问题和不同的编程范例有深入的了解。用于其有效的解决方案。需要有效开发和优化这些系统科学代码的技术。本文介绍了针对多核+ GPU的异构集群的3D快速小波变换(3D-FWT)的开发和优化分析。组合了不同的并行编程范例(消息传递,共享内存和SIMD GPU),以充分利用集群中不同计算元素的计算能力,因此可以有效地组合先前为各个组件(单个节点,多核)开发的基本代码或GPU),并大大减少了长视频序列的压缩时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号