Enabling Efficient Multithreaded MPI Communication through a Library-Based Implementation of MPI Endpoints

机译：通过基于库的MPI端点实现实现高效的多线程MPI通信

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Modern high-speed interconnection networks are designed with capabilities to support communication from multiple processor cores. The MPI endpoints extension has been proposed to ease process and thread count tradeoffs by enabling multithreaded MPI applications to efficiently drive independent network communication. In this work, we present the first implementation of the MPI endpoints interface and demonstrate the first applications running on this new interface. We use a novel library-based design that can be layered on top of any existing, production MPI implementation. Our approach uses proxy processes to isolate threads in an MPI job, eliminating threading overheads within the MPI library and allowing threads to achieve process-like communication performance. We evaluate the performance advantages of our implementation through several benchmarks and kernels. Performance results for the Lattice QCD Dslash kernel indicate that endpoints provides up to 2.9× improvement in communication performance and 1.87× overall performance improvement over a highly optimized hybrid MPI+OpenMP baseline on 128 processors.

机译：现代高速互连网络具有支持来自多个处理器内核的通信的功能。已经提出了MPI端点扩展，以通过使多线程MPI应用程序有效地驱动独立的网络通信来减轻进程和线程数的折衷。在这项工作中，我们展示了MPI端点接口的第一个实现，并演示了在此新接口上运行的第一个应用程序。我们使用一种新颖的基于库的设计，该设计可以分层放置在任何现有的生产MPI实施之上。我们的方法使用代理进程来隔离MPI作业中的线程，从而消除了MPI库中的线程开销，并允许线程实现类似于进程的通信性能。我们通过几个基准和内核评估了实现的性能优势。 Lattice QCD Dslash内核的性能结果表明，与128个处理器上的高度优化的混合MPI + OpenMP基准相比，端点可将通信性能提高2.9倍，将整体性能提高1.87倍。

著录项

来源
《International Conference for High Performance Computing, Networking, Storage and Analysis》|2014年|487-498|共12页
会议地点
作者
Sridharan Sridha; Dinan James; Kalamkar Dhiraj D.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
application program interfaces; message passing; multi-threading; multiprocessing systems; multiprocessor interconnection networks; software libraries; MPI endpoints extension; MPI endpoints interface; MPI job; MPI library; MPI+OpenMP baseline; high-speed interconnection network; independent network communication; lattice QCD Dslash kernel; library-based design; library-based implementation; multiple processor core; multithreaded MPI application; multithreaded MPI communication; performance evaluation; process-like communication performance; production MPI implementation; thread count tradeoff; threading overhead; Arrays; Context; Kernel; Libraries; Message systems; Parallel programming; Semantics; Endpoints; Hybrid Parallel Programming; MPI;

机译：应用程序接口;消息传递;多线程;多处理系统;多处理器互连网络;软件库; MPI端点扩展; MPI端点接口; MPI作业; MPI库; MPI + OpenMP基线;高速互连网络;独立网络通信;点阵QCD Dslash内核;基于库的设计;基于库的实现;多处理器内核;多线程MPI应用程序;多线程MPI通信;性能评估;类过程通信性能;生产MPI实现;线程计数权衡;线程开销;线程;数组;上下文;内核;库;消息系统;并行编程;语义;端点;混合并行编程; MPI;

相似文献

外文文献
中文文献
专利

1. Enabling efficient multithreaded MPI communication through a library-based implementation of MPI endpoints [J] . Khaled Hamidouche Computing reviews . 2015,第6期

机译：通过基于库的MPI端点实现启用高效的多线程MPI通信
2. Enabling communication concurrency through flexible MPI endpoints [J] . James Dinan, Ryan E Grant, Pavan Balaji, Experimental Mechanics . 2014,第4期

机译：通过灵活的MPI端点启用通信并发
3. Open MPI: A High Performance, Flexible Implementation of MPI Point-to-Point Communications [J] . Richard L. Graham, Brian W. Barrett, Galen M. Shipman, Parallel Processing Letters . 2007,第1期

机译：开放式MPI：MPI点对点通信的高性能，灵活实现
4. Enabling Concurrent Multithreaded MPI Communication on Multicore Petascale Systems [C] . Gabor Dozsa, Sameer Kumar, Pavan Balaji, Recent advances in the message passing interface . 2010

机译：在多核Petascale系统上启用并发多线程MPI通信
5. Enabling Efficient Use of MPI and PGAS Programming Models on Heterogeneous Clusters with High Performance Interconnects. [D] . Potluri, Sreeram. 2014

机译：在具有高性能互连的异构集群上有效使用MPI和PGAS编程模型。
6. Enabling Efficient Communications with Resource Constrained Information Endpoints in Smart Homes [O] . Diego Sánchez-de-Rivera, Borja Bordel, Ramón Alcarria, 2019

机译：在资源有限的智能家居中通过资源受限的信息端点实现高效通信
7. Enabling Concurrent Multithreaded MPI Communication on Multicore Petascale Systems [O] . Gábor Dózsa, Sameer Kumar, Pavan Balaji, 2011

机译：在多核Petascale系统上启用并发多线程MPI通信

Enabling Efficient Multithreaded MPI Communication through a Library-Based Implementation of MPI Endpoints

摘要

著录项

相似文献

相关主题

期刊订阅