首页> 外文OA文献 >Parallel Copying Tools for Distributed File Systems
【2h】

Parallel Copying Tools for Distributed File Systems

机译:分布式文件系统的并行复制工具

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Parallel distributed files systems are increasingly being used on clusters to allow greater throughput of data to the many compute nodes. They are also an effective way to store massive amounts of data. However, using the standard core utility cp does not make good use of the potential parallelism of the file systems. Using multiple cp commands has inherent problems too.Two utilities were created to help recursively copy directories containing large amounts of data on parallel distributed file systems. One of the test data sets contains very many files, and the other contains large files. One utility is a C program that submits a single job on a user specified number of nodes. The work of copying the files is dynamically distributed among those nodes using MPI communications. Multiple threads are used to traverse the directories. Speedups of 9.57 and 7.36 were attained for the many files set and the large files set, respectively. A second utility is written in Java. It also uses multiple threads to traverse the directories, but it performs the copying by creating Bash scripts and submitting them to the job scheduler. The work is balanced among those scripts and the number of jobs is specified by the user. It reached speedups of 3.67 and 7.32 for the same two data sets. Both utilities can also be used to track the progress of the jobs they have submitted.
机译:群集上越来越多地使用并行分布式文件系统,以允许更大的数据吞吐量到达许多计算节点。它们也是存储大量数据的有效方法。但是,使用标准核心实用程序cp不能充分利用文件系统的潜在并行性。使用多个cp命令也存在固有的问题。创建了两个实用程序,以帮助在并行分布式文件系统上递归复制包含大量数据的目录。其中一个测试数据集包含很多文件,另一个包含大文件。一个实用程序是C程序,它在用户指定数量的节点上提交单个作业。使用MPI通信在这些节点之间动态分发文件的复制工作。使用多个线程来遍历目录。对于许多文件集和大文件集,分别达到了9.57和7.36的加速比。第二个实用程序是用Java编写的。它还使用多个线程来遍历目录,但是它通过创建Bash脚本并将其提交到作业调度程序来执行复制。这些脚本之间的工作是平衡的,并且作业的数量由用户指定。对于相同的两个数据集,其加速比分别为3.67和7.32。这两个实用程序还可用于跟踪其提交的作业的进度。

著录项

  • 作者

    Nuss Kevin Matthew;

  • 作者单位
  • 年度 2013
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号