Concurrent and storage-aware data streaming for data processing workflows in grid environments

Zhang Wen; Cao Junwei; Zhong Yisheng; Liu Lianchen; Wu Cheng

首页> 外文期刊>Tsinghua Science and Technology >Concurrent and storage-aware data streaming for data processing workflows in grid environments

【24h】

Concurrent and storage-aware data streaming for data processing workflows in grid environments

机译：并发和存储感知的数据流，用于网格环境中的数据处理工作流

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data streaming applications, usually composed of sequential/parallel data processing tasks organized as a workflow, bring new challenges to workflow scheduling and resource allocation in grid environments. Due to the high volumes of data and relatively limited storage capability, resource allocation and data streaming have to be storage aware. Also to improve system performance, the data streaming and processing have to be concurrent. This study used a genetic algorithm (GA) for workflow scheduling, using on-line measurements and predictions with gray model (GM). On-demand data streaming is used to avoid data overflow through repertory strategies. Tests show that tasks with on-demand data streaming must be balanced to improve overall performance, to avoid system bottlenecks and backlogs of intermediate data, and to increase data throughput for the data processing workflows as a whole.

机译：数据流应用程序通常由组织为工作流的顺序/并行数据处理任务组成，给网格环境中的工作流调度和资源分配带来了新的挑战。由于海量数据和相对有限的存储能力，资源分配和数据流必须了解存储。同样为了提高系统性能，数据流和处理必须同时进行。这项研究使用遗传算法（GA）进行工作流调度，并使用带有灰色模型（GM）的在线测量和预测。按需数据流用于通过库策略避免数据溢出。测试表明，必须平衡具有按需数据流的任务，以提高整体性能，避免系统瓶颈和中间数据积压，并提高整个数据处理工作流的数据吞吐量。

著录项

来源
《Tsinghua Science and Technology》 |2010年第3期|p.335-346|共12页
作者
Zhang Wen; Cao Junwei; Zhong Yisheng; Liu Lianchen; Wu Cheng;
展开▼
作者单位

Department of Automation, Tsinghua University, Beijing 100084, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
concurrent; data streaming; grid; storage-aware; workflow;

机译：并发;数据流;网格;存储感知;工作流;

相似文献

外文文献
中文文献
专利

1. Transformation-Based Streaming Workflow Allocation on Geo-Distributed Datacenters for Streaming Big Data Processing [J] . Chen Wuhui, Paik Incheon, Hung Patrick C. K. Services Computing, IEEE Transactions on . 2019,第4期

机译：地理分布数据中心上基于转换的流工作流分配，用于流式处理大数据
2. Optimization of data-intensive workflows in stream-based data processing models [J] . Ahmad Saima Gulzar, Liew Chee Sun, Rafique M. Mustafa, Journal of supercomputing . 2017,第9期

机译：在基于流的数据处理模型中优化数据密集型工作流
3. Developing a streaming data processing workflow for querying space-time activities from geotagged tweets [J] . Wachowicz Monica, Arteaga M. Dolores, Cha Sangwhan, Computers，environment and urban systems . 2016,第sepa期

机译：开发流数据处理工作流以从地理标记的推文中查询时空活动
4. Block-Based Concurrent and Storage-Aware Data Streaming for Grid Applications with Lots of Small Files [C] . Wen Zhang, Junwei Cao, Yisheng Zhong, Cluster Computing and the Grid, 2009. CCGRID '09 . 2009

机译：具有大量小文件的网格应用程序的基于块的并发和存储感知数据流
5. Autonomic management of data streaming and in-transit processing for data intensive scientific workflows. [D] . Bhat, Viraj. 2008

机译：数据流的自主管理和数据密集型科学工作流的在途处理。
6. Workflows for microarray data processing in the Kepler environment [O] . Thomas Stropp, Timothy McPhillips, Bertram Ludäscher, 2012

机译：开普勒环境中微阵列数据处理的工作流程
7. Concurrent and storage-aware data streaming for data processing workflows in grid environments [O] . Wen Zhang, Junwei Cao, Yisheng Zhong, 2010

机译：在网格环境中的数据处理工作流程的并发和存储感知数据流

Concurrent and storage-aware data streaming for data processing workflows in grid environments

摘要

著录项

相似文献

相关主题

期刊订阅