Use case-based evaluation of workflow optimization strategy in real-time computation system

Ahmad Saima Gulzar; Khan Hikmat Ullah; Ijaz Samia; Munir Ehsan Ullah

首页> 外文期刊>Journal of supercomputing >Use case-based evaluation of workflow optimization strategy in real-time computation system

【24h】

Use case-based evaluation of workflow optimization strategy in real-time computation system

机译：实时计算系统中基于用例的工作流优化策略评估

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the start of big data era, data stream computing has emerged as a well-known approach to optimize data-intensive workflows. Apache STORM is an open-source real-time distributed computation system for processing data streams and has been opted by famous organizations such as Twitter, Yahoo, Alibaba, Baidu, Groupon. The workflows are implemented as topologies in STORM. The main aspect that controls the execution performance of a workflow in STORM is the strategy of scheduling the topology components (spout and bolts). In this paper, we evaluate and analyze the performance of our algorithm Partition-based Data-intensive Workflow optimization Algorithm (PDWA) in Apache STORM using a use case workflow, EURExpressII. It is a real-world application-based workflow that builds a transcriptome-wide atlas of gene expression for the developing mouse embryo established by ribonucleic acid (RNA) in situ hybridization. Our proposed algorithm, PDWA, partitions the application task graph so that the data movement between partitions is minimum. Each partition is then mapped on one machine for the execution of tasks of that partition. It provides minimum execution time for that particular partition. Partial task duplication is also part of this algorithm that enhances the performance. A STORM-based computing cluster is developed in OpenStack cloud which is used as a computing environment. The performance of PDWA-based optimizer is evaluated with the data sets of different sizes. The achieved results show that PDWA performs with 21% improved average execution time for different sizes of data sets and varying execution nodes. In addition, the comparative results show that on average the efficiency of PDWA is 20.4% higher as compared to STORM default scheduler (SDS).

机译：随着大数据时代的开始，数据流计算已经成为优化数据密集型工作流程的众所周知的方法。 Apache STORM是用于处理数据流的开源实时分布式计算系统，并已被Twitter，Yahoo，阿里巴巴，百度，Groupon等著名组织所采用。工作流在STORM中作为拓扑实现。在STORM中控制工作流执行性能的主要方面是调度拓扑组件（喷嘴和螺栓）的策略。在本文中，我们使用用例工作流程EURExpressII对Apache STORM中基于分区的数据密集型工作流优化算法（PDWA）的算法进行评估和分析。这是一个基于实际应用的工作流程，可为通过核糖核酸（RNA）原位杂交建立的正在发育的小鼠胚胎建立转录组范围的基因表达图谱。我们提出的算法PDWA对应用程序任务图进行分区，以使分区之间的数据移动最小。然后将每个分区映射到一台计算机上，以执行该分区的任务。它为该特定分区提供了最少的执行时间。部分任务重复也是该算法的一部分，可以提高性能。在OpenStack云中开发了一个基于STORM的计算集群，该集群用作计算环境。使用不同大小的数据集评估基于PDWA的优化器的性能。获得的结果表明，对于不同大小的数据集和不同的执行节点，PDWA的平均执行时间缩短了21％。此外，比较结果表明，与STORM默认调度程序（SDS）相比，PDWA的平均效率高20.4％。

著录项

来源
《Journal of supercomputing》 |2020年第1期|708-725|共18页
作者
Ahmad Saima Gulzar; Khan Hikmat Ullah; Ijaz Samia; Munir Ehsan Ullah;
展开▼
作者单位

COMSATS Univ Islamabad Wah Campus Islamabad Pakistan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Workflow optimization; STORM topology; Partitions; Data intensive; Stream data processing;

机译：工作流程优化;STORM拓扑;分区;数据密集型;流数据处理;

相似文献

外文文献
中文文献
专利

1. An energy-efficient, QoS-aware and cost-effective scheduling approach for real-time workflow applications in cloud computing systems utilizing DVFS and approximate computations [J] . Stavrinides Georgios L., Karatza Helen D. Future generation computer systems . 2019,第JULa期

机译：一种利用DVFS和近似计算的云计算系统中实时工作流应用程序的节能，QoS感知和经济高效的调度方法
2. An energy-efficient, QoS-aware and cost-effective scheduling approach for real-time workflow applications in cloud computing systems utilizing DVFS and approximate computations [J] . Stavrinides Georgios L., Karatza Helen D. Future generation computer systems . 2019,第Jula期

机译：用于利用DVF和近似计算的云计算系统中的实时工作流程应用的节能，QoS感知和经济高效的调度方法
3. Real-time optimization strategies of Fuel Cell Hybrid Power Systems based on Load-following control: A new strategy, and a comparative study of topologies and fuel economy obtained [J] . Bizon Nicu Applied Energy . 2019,第MAY1期

机译：基于负荷跟随控制的燃料电池混合动力系统实时优化策略：一种新策略，并获得拓扑和燃料经济性的比较研究
4. Real-Time Strategy Generation System Using Case-Based Reasoning [C] . Kim Wonil, Baik Sung Wook, Kwon Soonil, International Symposium on Computer, Consumer and Control . 2014

机译：基于案例推理的实时策略生成系统
5. Structural Performance Evaluation and Optimization Through Cyber-Physical Systems Using Substructure Real-Time Hybrid Simulation [D] . Zhang, Ruiyang. 2017

机译：使用子结构实时混合仿真通过网络物理系统进行结构性能评估和优化
6. Optimization of In Vivo Studies by Combining Planar Dynamic and Tomographic Imaging: Workflow Evaluation on a Superparamagnetic Nanoparticles System [O] . Maritina Rouchota, Alessio Adamiano, Michele Iafisco, 2021

机译：平面动力学和断层扫描成像的优化研究：超顺磁性纳米粒子系统的工作流程评估
7. Computational analysis of real-time convex optimization for control systems [O] . McGovern Lawrence Kent 2000

机译：控制系统实时凸优化的计算分析

Use case-based evaluation of workflow optimization strategy in real-time computation system

摘要

著录项

相似文献

相关主题

期刊订阅