Exploiting GPU Direct Access to Non-Volatile Memory to Accelerate Big Data Processing

机译：利用GPU直接访问非易失性内存以加速大数据处理

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The amount of data being collected for analysis is growing at an exponential rate. Along with this growth comes increasing necessity for computation and storage. Researchers are addressing these needs by building heterogeneous clusters with CPUs and computational accelerators such as GPUs equipped with high I/O bandwidth storage devices. One of the main bottlenecks of such heterogeneous systems is the data transfer bandwidth to GPUs when running I/O intensive applications. The traditional approach gets data from storage to the host memory and then transfers it to the GPU, which can limit data throughput and processing and thus degrade the end-to-end performance. In this paper, we propose a new framework to address the above issue by exploiting Peer-to-Peer Direct Memory Access to allow GPU direct access of the storage device and thus enhance the performance for parallel data processing applications in a heterogeneous big-data platform. Our heterogeneous cluster is supplied with CPUs and GPUs as computing resources and Non-Volatile Memory express (NVMe) drives as storage resources. We deploy an Apache Spark platform to execute representative data processing workloads over this heterogeneous cluster and then adopt Peer-to-Peer Direct Memory Access to connect GPUs to non-volatile storage directly to optimize the GPU data access. Experimental results reveal that this heterogeneous Spark platform successfully bypasses the host memory and enables GPUs to communicate directly to the NVMe drive, thus achieving higher data transfer throughput and improving both data communication time and end-to-end nerformance by 20%.

机译：的数据量被收集用于分析以指数速度增长。伴随着这种增长是增加了对计算和存储的必要性。研究人员正在通过建立与CPU和计算加速器异构集群诸如配备有高I / O带宽的存储设备的GPU解决这些需求。运行I / O密集的应用程序时一个这样的非均相体系的主要瓶颈是数据传输带宽的GPU。传统的方法从存储到主存储器中获取数据，然后将其传送到GPU，它可以限制数据量和处理，从而降低了端 - 端的性能。在本文中，我们提出了一个新的框架，通过利用对等网络直接内存访问，以允许存储设备的GPU直接访问解决上述问题，从而提高用于并行处理数据在异构的大数据平台的应用程序的性能。我们的异构集群与CPU和GPU提供为计算资源和非易失性存储器快速（NVMe）驱动器作为存储资源。我们部署Apache星火平台执行有代表性的数据处理工作在这个异构集群，然后通过对等网络直接内存访问来连接GPU的非易失性存储器直接优化GPU数据存取。实验结果表明，此异构火花平台成功绕过主机存储器和允许图形处理器以直接传送到NVMe驱动器，从而实现了可以通过更高的数据传输，并通过20％同时改善数据通信时间和结束到终端nerformance。

著录项

来源
《IEEE High Performance Extreme Computing Conference》|2020年|1-6|共6页
会议地点
作者
Mahsa Bayati; Miriam Leeser; Ningfang Mi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Performance evaluation; Nonvolatile memory; Graphics processing units; Bandwidth; Throughput; Data transfer; Peer-to-peer computing;

机译：性能评估;非易失性存储器;图形处理单元;带宽;吞吐量;数据传输;对等计算;

相似文献

外文文献
中文文献
专利

1. Mille Cheval: a GPU-based in-memory high-performance computing framework for accelerated processing of big-data streams [J] . Kumar Vivek, Sharma Dilip Kumar, Mishra Vinay Kumar Journal of supercomputing . 2021,第7期

机译：Mille Cheval：基于GPU的内存高性能计算框架，用于加速处理大数据流
2. EXPLOITING DIRECT ACCESS SHARED MEMORY FOR MPI ON MULTI-CORE PROCESSORS [J] . Ron Brightwell International Journal of High Performance Computing Applications . 2010,第1期

机译：在多核处理器上探索MPI的直接访问共享内存
3. Exploiting GPU memory hierarchy for accelerating a specialized stencil computation [J] . Thanasekhar Balaiah, Ranjani Parthasarathi Concurrency, practice and experience . 2017,第21期

机译：利用GPU内存层次结构来加速专业的模具计算
4. Accelerating K-mer Frequency Counting with GPU and Non-Volatile Memory [C] . Nicola Cadenelli, Jordà Polo, David Carrera IEEE International Conference on High Performance Computing and Communications;IEEE International Conference on Smart City;IEEE International Conference on Data Science and Systems . 2017

机译：利用GPU和非易失性内存加速K-mer频率计数
5. Accelerating Data Accessing by Exploiting Flash Memory Technologies [D] . Yang, Jing. 2019

机译：通过利用闪存技术加速数据访问
6. Large scale neural circuit mapping data analysis accelerated with the graphical processing unit (GPU) [O] . Yulin Shi, Alexander V. Veidenbaum, Alex Nicolau, -1

机译：图形处理单元（GPU）加速了大规模神经回路映射数据分析
7. ACCELERATING DATA ACCESSING BY EXPLOITING FLASH MEMORY TECHNOLOGIES [O] . Jing Yang -1

机译：通过利用闪存技术加速数据访问
8. Some Issues in General-Purpose Shared Memory Multiprocessing: Parallelism Exploitation and Memory Access Combining [R] . Lee, G. 1986

机译：通用共享存储器多处理中的几个问题：并行开发与存储器访问相结合

Exploiting GPU Direct Access to Non-Volatile Memory to Accelerate Big Data Processing

摘要

著录项

相似文献

相关主题

期刊订阅