首页> 外文期刊>Tsinghua Science and Technology >Concurrent and storage-aware data streaming for data processing workflows in grid environments
【24h】

Concurrent and storage-aware data streaming for data processing workflows in grid environments

机译:并发和存储感知的数据流,用于网格环境中的数据处理工作流

获取原文
获取原文并翻译 | 示例
           

摘要

Data streaming applications, usually composed of sequential/parallel data processing tasks organized as a workflow, bring new challenges to workflow scheduling and resource allocation in grid environments. Due to the high volumes of data and relatively limited storage capability, resource allocation and data streaming have to be storage aware. Also to improve system performance, the data streaming and processing have to be concurrent. This study used a genetic algorithm (GA) for workflow scheduling, using on-line measurements and predictions with gray model (GM). On-demand data streaming is used to avoid data overflow through repertory strategies. Tests show that tasks with on-demand data streaming must be balanced to improve overall performance, to avoid system bottlenecks and backlogs of intermediate data, and to increase data throughput for the data processing workflows as a whole.
机译:数据流应用程序通常由组织为工作流的顺序/并行数据处理任务组成,给网格环境中的工作流调度和资源分配带来了新的挑战。由于海量数据和相对有限的存储能力,资源分配和数据流必须了解存储。同样为了提高系统性能,数据流和处理必须同时进行。这项研究使用遗传算法(GA)进行工作流调度,并使用带有灰色模型(GM)的在线测量和预测。按需数据流用于通过库策略避免数据溢出。测试表明,必须平衡具有按需数据流的任务,以提高整体性能,避免系统瓶颈和中间数据积压,并提高整个数据处理工作流的数据吞吐量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号