首页>
外文OA文献
>Overcoming data locality: An in-memory runtime file system with symmetrical data distribution
【2h】
Overcoming data locality: An in-memory runtime file system with symmetrical data distribution
展开▼
机译:克服数据局部性:具有对称数据分布的内存中运行时文件系统
展开▼
免费
页面导航
摘要
著录项
引文网络
相似文献
相关主题
摘要
In many-task computing (MTC), applications such as scientific workflows or parameter sweeps communicate via intermediate files; application performance strongly depends on the file system in use. The state of the art uses runtime systems providing in-memory file storage that is designed for data locality: files are placed on those nodes that write or read them. With data locality, however, task distribution conflicts with data distribution, leading to application slowdown, and worse, to prohibitive storage imbalance. To overcome these limitations, we present MemFS, a fully symmetrical, in-memory runtime file system that stripes files across all compute nodes, based on a distributed hash function. Our cluster experiments with Montage and BLAST workflows, using up to 512 cores, show that MemFS has both better performance and better scalability than the state-of-the-art, locality-based file system, AMFS. Furthermore, our evaluation on a public commercial cloud validates our cluster results. On this platform MemFS shows excellent scalability up to 1024 cores and is able to saturate the 10G Ethernet bandwidth when running BLAST and Montage.
展开▼