首页> 外文会议>International Conference on High Performance Computing Simulation >Affinity-aware optimization of multithreaded two-phase I/O for high throughput collective I/O
【24h】

Affinity-aware optimization of multithreaded two-phase I/O for high throughput collective I/O

机译:用于高吞吐量集体I / O的多线程两相I / O的亲和感知优化

获取原文

摘要

Collective MPI-IO for non-contiguous accesses has been playing a big role in not only direct MPI-IO API calls but also scientific applications using parallel I/O libraries such as HDF5, which utilizes MPI-IO APIs underneath its parallel I/O APIs. We have been focusing on performance improvements in such a collective MPI-IO by using a representative MPI-IO library named ROMIO. Inside ROMIO, an optimization scheme named two-phase I/O achieves higher performance even if we have non-contiguous accesses. We have developed multithreaded ROMIO using Pthreads for further performance improvement. In this paper, we present a better performance optimization in collective write operations by using a newly implemented functionality to manage CPU core bindings for invoked I/O threads in addition to a multiple I/O request queueing scheme. We achieved performance gains up to 29% with the CPU core bindings compared to I/O throughput without CPU core bindings. Furthermore, we noted that a multiple number of I/O request slots in queues mitigated the internal unbalanced data-exchange phase times among MPI processes.
机译:用于非连续访问的集合MPI-IO在不仅可以直接MPI-IA API调用中播放了一个大作用,而且还在使用并行I / O库(如HDF5)的科学应用程序,该应用程序在其并行I / O下面使用MPI-IO API蜜蜂。我们一直专注于通过使用名为Romio的代表性MPI-IO库来实现这些集体MPI-IO的性能改进。在Romio内,即使我们具有非连续访问,指定了两阶段I / O的优化方案也可以实现更高的性能。我们使用Pthreads开发了多线程Romio,以进一步实现性能改进。在本文中,除了多个I / O请求排队方案之外,我们还通过使用新实现的功能来管理用于管理I / O线程的CPU核心绑定的集体写操作更好的性能优化。与没有CPU核心绑定的I / O吞吐量相比,我们通过CPU核心绑定实现了高达29%的性能提升。此外,我们注意到,队列中的多个I / O请求时隙减少了MPI进程之间的内部不平衡数据交换阶段时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号