首页> 外文会议>Cluster Computing and the Grid, 2009. CCGRID '09 >Hierarchical Caches for Grid Workflows
【24h】

Hierarchical Caches for Grid Workflows

机译:网格工作流的分层缓存

获取原文

摘要

From personal software to advanced systems, caching mechanisms have steadfastly been a ubiquitous means for reducing workloads. It is no surprise, then, that under the grid and cluster paradigms, middlewares and other large-scale applications often seek caching solutions. Among these distributed applications, scientific workflow management systems have gained ground towards mitigating the often painstaking process of composing sequences of scientific data sets and services to derive virtual data. In the past, workflow managers have relied on low-level system cache for reuse support. But in distributed query intensive environments, where high volumes of intermediate virtual data can potentially be stored anywhere on the grid, a novel cache structure is needed to efficiently facilitate workflow planning. In this paper, we describe an approach to combat the challenges of maintaining large, fast virtual data caches for workflow composition. A hierarchical structure is proposed for indexing scientific data with spatiotemporal annotations across grid nodes. Our experimental results show that our hierarchical index is scalable and outperforms a centralized indexing scheme by an exponential factor in query intensive environments.
机译:从个人软件到高级系统,缓存机制一直是减少工作量的普遍手段。因此,不足为奇的是,在网格和集群范例下,中间件和其他大型应用程序经常寻求缓存解决方案。在这些分布式应用程序中,科学工作流管理系统已在减轻通常由科学数据集和服务的序列构成虚拟数据组成的繁琐过程中取得了进展。过去,工作流管理器依赖于低级系统缓存来提供重用支持。但是在分布式查询密集型环境中,大量的中间虚拟数据可能会存储在网格上的任何位置,因此需要一种新颖的缓存结构来有效地促进工作流规划。在本文中,我们描述了一种方法,该方法可应对为工作流组合维护大型快速虚拟数据缓存的挑战。提出了一种层次结构,用于跨网格节点对具有时空注释的科学数据建立索引。我们的实验结果表明,在查询密集型环境中,我们的层次结构索引具有可伸缩性,并且在性能上优于指数索引方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号