On distributed file tree walk of parallel file systems

机译：在并行文件系统的分布式文件树遍历中

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Supercomputers generate vast amounts of data, typically organized into large directory hierarchies on parallel file systems. While the supercomputing applications are parallel, the tools used to process them requiring complete directory traversais, are typically serial. We present an algorithm framework and three fully distributed algorithms for traversing large parallel file systems, and performing file operations in parallel. The first algorithm introduces a randomized work-stealing scheduler; the second improves the first with proximity-awareness; and the third improves upon the second by using a hybrid approach. We have tested our implementation on Cielo, a 1.37 petaflop supercomputer at the Los Alamos National Laboratory and its 7 petabyte file system. Test results show that our algorithms execute orders of magnitude faster than state-of-the-art algorithms while achieving ideal load balancing and low communication cost. We present performance insights from the use of our algorithms in production systems at LANL, performing daily file system operations.

机译：超级计算机生成大量数据，通常在并行文件系统上将其组织成大型目录层次结构。尽管超级计算应用程序是并行的，但用于处理它们的工具（需要完整的目录traversais）通常是串行的。我们提出了一种算法框架和三种完全分布式的算法，用于遍历大型并行文件系统并并行执行文件操作。第一种算法引入了随机的工作窃取调度器；第二个改进了第一个具有邻近感知的功能；第三种通过使用混合方法对第二种进行了改进。我们已经在洛斯阿拉莫斯国家实验室的1.37 petaflop超级计算机Cielo及其7 PB文件系统上测试了我们的实现。测试结果表明，我们的算法执行速度比最新算法快几个数量级，同时实现了理想的负载平衡和较低的通信成本。我们通过在LANL的生产系统中使用我们的算法，执行日常文件系统操作，提供性能见解。

著录项

来源
《2012 International Conference for High Performance Computing, Networking, Storage and Analysis.》|2012年|p.1-11|共11页
会议地点 Salt Lake City UT(US);Salt Lake City UT(US)
作者
LaFon Jharrod; Misra Satyajayant; Bringhurst Jon;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机网络;计算机网络;
关键词
File Systems; Metadata; Parallel Algorithms;

机译：文件系统;元数据;并行算法;;

相似文献

外文文献
中文文献
专利

1. Fault tolerant file models for parallel file systems: introducing distribution patterns for every file [J] . A. Calderón, F. García-Carballeira, L. M. Sánchez, The Journal of Supercomputing . 2009,第3期

机译：并行文件系统的容错文件模型：为每个文件引入分发模式
2. The Design of VIP-FS: A Virtual, Parallel File System for High Performance Parallel and Distributed Computing [J] . Michael Harry, Juan Miguel del Rosario, Alok Choudhary Operating systems review . 1995,第3期

机译：VIP-FS的设计：用于高性能并行和分布式计算的虚拟并行文件系统
3. Dynamic File Prefetching Scheme Based on File Access Patterns in Via-Based Parallel File System [J] . Yoon-Young Lee, Chei-Yol Kim, Dae-Wha Seo IEICE Transactions on Information and Systems . 2002,第4期

机译：基于Via的并行文件系统中基于文件访问模式的动态文件预取方案
4. On Distributed File Tree Walk of Parallel File Systems [C] . Jharrod LaFon, Satyajayant Misra, Jon Bringhurst SC 2012 . 2012

机译：在并行文件系统的分布式文件树步行
5. Distributed access to parallel file systems. [D] . Hildebrand, Dean. 2007

机译：对并行文件系统的分布式访问。
6. Optimized distributed systems achieve significant performance improvement on sorted merging of massive VCF files [O] . Xiaobo Sun, Jingjing Gao, Peng Jin, 2018

机译：经过优化的分布式系统在大量VCF文件的排序合并中实现了显着的性能提升
7. On distributed file tree walk of parallel file systems [O] . Jharrod LaFon, Satyajayant Misra, Jon Bringhurst 2012

机译：在并行文件系统的分布式文件树步行

On distributed file tree walk of parallel file systems

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅