首页> 外文会议>IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing >Experimental Study on the Performance and Resource Utilization of Data Streaming Frameworks
【24h】

Experimental Study on the Performance and Resource Utilization of Data Streaming Frameworks

机译:数据流框架的性能和资源利用的实验研究

获取原文
获取外文期刊封面目录资料

摘要

With the advent of the Internet of Things (IoT), data stream processing have gained increased attention due to the ever-increasing need to process heterogeneous and voluminous data streams. This work addresses the problem of selecting a correct stream processing framework for a given application to be executed within a specific physical infrastructure. For this purpose, we focus on a thorough comparative analysis of three data stream processing platforms - Apache Flink, Apache Storm, and Twitter Heron (the enhanced version of Apache Storm), that are chosen based on their potential to process both streams and batches in real-time. The goal of the work is to enlighten the cloud-clients and the cloud-providers with the knowledge of the choice of the resource-efficient and requirement-adaptive streaming platform for a given application so that they can plan during allocation or assignment of Virtual Machines for application execution. For the comparative performance analysis of the chosen platforms, we have experimented using 8-node clusters on Grid5000 experimentation testbed and have selected a wide variety of applications ranging from a conventional benchmark to sensor-based IoT application and statistical batch processing application. In addition to the various performance metrics related to the elasticity and resource usage of the platforms, this work presents a comparative study of the “green-ness” of the streaming platforms by analyzing their power consumption - one of the first attempts of its kind. The obtained results are thoroughly analyzed to illustrate the functional behavior of these platforms under different computing scenarios.
机译:随着物联网(IoT)的出现,由于对异类和大量数据流的需求不断增长,数据流处理已引起越来越多的关注。这项工作解决了为要在特定物理基础结构中执行的给定应用程序选择正确的流处理框架的问题。为此,我们专注于对三种数据流处理平台(Apache Flink,Apache Storm和Twitter Heron(Apache Storm的增强版本))进行全面的比较分析,这些平台是根据它们在处理流和批处理中的潜力而选择的。即时的。该工作的目标是通过了解给定应用程序的资源效率高和适应需求的流媒体平台的知识来启发云客户端和云提供商,以便他们可以在虚拟机的分配或分配期间进行计划用于应用程序执行。为了对所选平台进行比较性能分析,我们在Grid5000实验测试台上使用8节点集群进行了实验,并选择了从常规基准测试到基于传感器的IoT应用程序和统计批处理应用程序等各种应用程序。除了与平台的弹性和资源使用相关的各种性能指标外,这项工作还通过分析流媒体平台的功耗来提供对流媒体平台“绿色”的比较研究,这是同类平台的首次尝试之一。对获得的结果进行了彻底的分析,以说明这些平台在不同计算方案下的功能行为。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号