...
首页> 外文期刊>Theoretical and Experimental Plant Physiology >Balancing push and pull in Confuga, an active storage cluster file system for scientific workflows
【24h】

Balancing push and pull in Confuga, an active storage cluster file system for scientific workflows

机译:平衡在Confuga中推动和拉动,一个用于科学工作流的活动存储群集文件系统

获取原文
获取原文并翻译 | 示例
           

摘要

Most big-data analysis systems require users to adopt restricted abstractions to achieve scaling and system stability. While highly effective at establishing data locality and eliminating interdependencies, this approach is not easily incorporated into scientific workflows that are often complex and irregular graphs of sequential programs with multiple dependencies. To address this, we have developed an active storage cluster file system named Confuga which harnesses the file information already available in the workflow to enable efficient and controlled distribution of dependencies across active storage nodes. Confuga is built upon the idea of leveraging a job's namespace to eliminate unknown transfers and to plan the replication of all job dependencies. Replication is carried out through two opposing transfer methodologies: centrally managed push transfers and distributed pulls. We evaluate the effectiveness of the two transfer mechanisms using workflows that stress the ability of the cluster to replicate dependencies. Ultimately, we show that a balance of the two approaches achieves optimal file distribution. This is shown in two bioinformatics workflows where a careful balance of the two mechanisms leads to 48% and 77% improvements over only push or pull. Copyright (C) 2016 John Wiley & Sons, Ltd.
机译:大多数大数据分析系统都要求用户采用限制的抽象来实现缩放和系统稳定性。虽然在建立数据位置和消除相互依赖性时高度有效,但这种方法不容易纳入科学工作流程,这些过程通常是具有多个依赖性的顺序程序的复杂和不规则图。为解决此问题,我们开发了一个名为combuga的活动存储群集文件系统,该文件系统利用工作流中已有的文件信息,以在活动存储节点上实现依赖关系的高效和受控分发。 Combuga建立在利用作业名称空间来消除未知的转移并计划所有工作依赖性的复制的想法。通过两个相对的转移方法进行复制:集中管理推送传输和分布式拉动。我们使用压力群集能力复制依赖性的工作流程来评估两个传输机制的有效性。最终,我们表明两种方法的平衡实现了最佳的文件分布。这在两种生物信息学工作流程中显示,其中两个机制的仔细平衡导致仅推动或拉动的48%和77%。版权所有(c)2016 John Wiley&Sons,Ltd。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号