首页> 外文会议>IEEE International Conference on Cluster Computing >K MapReduce: A scalable tool for data-processing and search/ensemble applications on large-scale supercomputers
【24h】

K MapReduce: A scalable tool for data-processing and search/ensemble applications on large-scale supercomputers

机译:K MapReduce:可扩展的工具,用于大型超级计算机上的数据处理和搜索/集成应用程序

获取原文

摘要

K MapReduce (KMR) is a high-performance MapReduce system in the MPI environment, targeting large-scale supercomputers such as the K computer. Its objectives are to ease programming for data-processing and to achieve efficiency by utilizing the large amount of memory available in large-scale supercomputers. In KMR, shuffling operation exchanges key-value pairs in a scalable way by collective communication algorithms utilizing the K's interconnect. Mapping and reducing operations are multi-threaded to achieve even greater efficiency in modern multi-core machines. Sorting is optimized using fixed-length packed keys instead of variable-length raw keys, which is extensively used inside of shuffling and reducing operations. Besides the MapReduce operations, KMR provides routines for collective file reading for affinity-aware optimizations. This paper presents the results of experimental performance studies of KMR on the K computer. Affinity-aware file loading improves the performance by about 42% over a non-optimized implementation. We also show how KMR can be used to program real-world scientific applications such as meta-genome search and replica-exchange molecular dynamics.
机译:K MapReduce(KMR)是MPI环境中的高性能MapReduce系统,目标是大型超级计算机(例如K计算机)。其目的是通过利用大型超级计算机中可用的大量内存来简化数据处理的编程并提高效率。在KMR中,改组操作通过使用K互连的集体通信算法以可伸缩的方式交换键值对。映射和归约操作是多线程的,可在现代多核计算机中实现更高的效率。使用固定长度的压缩键而不是可变长度的原始键来优化排序,而可变长度的原始键在改组和精简操作内部广泛使用。除了MapReduce操作之外,KMR还提供了用于集体文件读取的例程,以进行亲和力感知优化。本文介绍了在K计算机上进行KMR实验性能研究的结果。与未优化的实现相比,可感知亲和力的文件加载将性能提高了约42%。我们还将展示如何使用KMR来编程现实世界中的科学应用,例如元基因组搜索和副本交换分子动力学。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号