Affinity-aware optimization of multithreaded two-phase I/O for high throughput collective I/O

机译：用于高吞吐量集体I / O的多线程两相I / O的亲和感知优化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Collective MPI-IO for non-contiguous accesses has been playing a big role in not only direct MPI-IO API calls but also scientific applications using parallel I/O libraries such as HDF5, which utilizes MPI-IO APIs underneath its parallel I/O APIs. We have been focusing on performance improvements in such a collective MPI-IO by using a representative MPI-IO library named ROMIO. Inside ROMIO, an optimization scheme named two-phase I/O achieves higher performance even if we have non-contiguous accesses. We have developed multithreaded ROMIO using Pthreads for further performance improvement. In this paper, we present a better performance optimization in collective write operations by using a newly implemented functionality to manage CPU core bindings for invoked I/O threads in addition to a multiple I/O request queueing scheme. We achieved performance gains up to 29% with the CPU core bindings compared to I/O throughput without CPU core bindings. Furthermore, we noted that a multiple number of I/O request slots in queues mitigated the internal unbalanced data-exchange phase times among MPI processes.

机译：用于非连续访问的集合MPI-IO在不仅可以直接MPI-IA API调用中播放了一个大作用，而且还在使用并行I / O库（如HDF5）的科学应用程序，该应用程序在其并行I / O下面使用MPI-IO API蜜蜂。我们一直专注于通过使用名为Romio的代表性MPI-IO库来实现这些集体MPI-IO的性能改进。在Romio内，即使我们具有非连续访问，指定了两阶段I / O的优化方案也可以实现更高的性能。我们使用Pthreads开发了多线程Romio，以进一步实现性能改进。在本文中，除了多个I / O请求排队方案之外，我们还通过使用新实现的功能来管理用于管理I / O线程的CPU核心绑定的集体写操作更好的性能优化。与没有CPU核心绑定的I / O吞吐量相比，我们通过CPU核心绑定实现了高达29％的性能提升。此外，我们注意到，队列中的多个I / O请求时隙减少了MPI进程之间的内部不平衡数据交换阶段时间。

著录项

来源
《International Conference on High Performance Computing Simulation》|2014年||共8页
会议地点
作者
Tsujita Yuichi; Hori Atsushi; Ishikawa Yutaka;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类一般性问题;
关键词
Benchmark testing; Instruction sets; Libraries; Message systems; Multicore processing; Optimization; Throughput; CPU core bindings; MPI-IO; Pthreads; multiple I/O request queueing; multithreaded two-phase I/O;

机译：基准测试;指令集;库;消息系统;多核处理;优化;吞吐量;CPU 核心绑定;MPI- IO;P线程;多个I / O 请求排队;多线程两相 I / O;

相似文献

外文文献
中文文献
专利

1. OPTIMIZATION ALGORITHM COIL DESIGN KILNS COLLECTIVELY HYDRODYNAMICS TWO-PHASE FLOW AND STRENGTH [J] . Kadantsev M.N., Bayazitov M.I., Filippova A.G., Neftegazovoe Delo . 2014,第5期

机译：优化算法线圈设计相结合的水动力两相流和强度
2. A Two-Phase Dynamic Throughput Optimization Model for Big Data Transfers [J] . Nine S. Q. Zulkar, Kosar Tevfik IEEE Transactions on Parallel and Distributed Systems . 2021,第2期

机译：大数据传输的两相动态吞吐量优化模型
3. High throughput screening techniques in downstream processing: Preparation, characterization and optimization of aqueous two-phase systems [J] . Bensch M, Selbach B, Hubbuch J Chemical Engineering Science . 2007,第7期

机译：下游工艺中的高通量筛选技术：水性两相系统的制备，表征和优化
4. Affinity-aware optimization of multithreaded two-phase I/O for high throughput collective I/O [C] . Tsujita Yuichi, Hori Atsushi, Ishikawa Yutaka International Conference on High Performance Computing Simulation . 2014

机译：多线程两阶段I / O的亲和力感知优化，以实现高吞吐量的集体I / O
5. High Throughput Non-Parametric Probability Density Estimation via Novel Multithreaded Stitching Method [D] . Merino, Zach D. 2019

机译：基于新型多线程拼接方法的高吞吐量非参数概率密度估计
6. Memory and Energy Optimization Strategies for Multithreaded Operating System on the Resource-Constrained Wireless Sensor Node [O] . Xing Liu, Kun Mean Hou, Christophe de Vaulx, 2015

机译：资源受限的无线传感器节点上多线程操作系统的内存和能量优化策略
7. Optimizing Thread Throughput for Multithreaded Workloads on Memory Constrained CMPs [O] . Major Bhadauria, Sally A. Mckee 2008

机译：在内存约束Cmp上优化多线程工作负载的线程吞吐量
8. New Multithreaded Code for Calculating Longitudinal Collective Instabilities Using Computers with Multiprocessors [R] . Tan, C. Y. 2001

机译：使用多处理器计算机计算纵向集体不稳定性的新多线程代码

Affinity-aware optimization of multithreaded two-phase I/O for high throughput collective I/O

摘要

著录项

相似文献

相关主题

期刊订阅