Nephele streaming: stream processing under QoS constraints at scale

Bj?rn Lohrmann; DanielWarneke; Odej Kao

首页> 外文期刊>Cluster computing >Nephele streaming: stream processing under QoS constraints at scale

【24h】

Nephele streaming: stream processing under QoS constraints at scale

机译：Nephele流：大规模处理QoS约束下的流处理

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The ability to process large numbers of continuous data streams in a near-real-time fashion has become a crucial prerequisite for many scientific and industrial use cases in recent years. While the individual data streams are usually trivial to process, their aggregated data volumes easily exceed the scalability of traditional stream processing systems. At the same time, massively-parallel data processing systems like MapReduce or Dryad currently enjoy a tremendous popularity for data-intensive applications and have proven to scale to large numbers of nodes. Many of these systems also provide streaming capabilities. However, unlike traditional stream processors, these systems have disregarded QoS requirements of prospective stream processing applications so far. In this paper we address this gap. First, we analyze common design principles of today’s parallel data processing frameworks and identify those principles that provide degrees of freedom in trading off the QoS goals latency and throughput. Second, we propose a highly distributed scheme which allows these frameworks to detect violations of userdefined QoS constraints and optimize the job execution without manual interaction. As a proof of concept, we implemented our approach for our massively-parallel data processing framework Nephele and evaluated its effectiveness through a comparison with Hadoop Online. For an example streaming application from the multimedia domain running on a cluster of 200 nodes, our approach improves the processing latency by a factor of at least 13 while preserving high data throughput when needed.

机译：近年来，以近实时方式处理大量连续数据流的能力已成为许多科学和工业用例的关键先决条件。尽管单个数据流通常不容易处理，但它们的聚合数据量很容易超过传统流处理系统的可伸缩性。同时，大规模并行数据处理系统（如MapReduce或Dryad）目前在数据密集型应用程序中享有很高的知名度，并已证明可以扩展到大量节点。其中许多系统还提供流功能。但是，与传统的流处理器不同，到目前为止，这些系统都忽略了预期的流处理应用程序的QoS要求。在本文中，我们解决了这一差距。首先，我们分析当今并行数据处理框架的通用设计原则，并确定在权衡QoS目标时延和吞吐量方面可以提供自由度的那些原则。其次，我们提出了一种高度分布式的方案，该方案允许这些框架检测到违反用户定义的QoS约束并优化作业执行的情况，而无需手动交互。作为概念验证，我们为大规模并行数据处理框架Nephele实施了我们的方法，并通过与Hadoop Online的比较评估了其有效性。对于在200个节点的群集上运行的来自多媒体域的流应用程序示例，我们的方法将处理延迟提高了至少13倍，同时在需要时保留了高数据吞吐量。

著录项

来源
《Cluster computing》 |2014年第1期|共18页
作者
Bj?rn Lohrmann; DanielWarneke; Odej Kao;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类分子生物学;
关键词
Massively-parallel; Stream processing; Distributed systems; Latency; QoS;

机译：大规模并行;流处理;分布式系统;延迟;QoS;

相似文献

外文文献
中文文献
专利

1. Nephele streaming: stream processing under QoS constraints at scale [J] . Bj?rn Lohrmann, DanielWarneke, Odej Kao Cluster computing . 2014,第1期

机译：Nephele流：大规模处理QoS约束下的流处理
2. Using a process-based catchment-scale modelfor enhancing field-based stream assessments and predicting stream fish assemblages [J] . ADAM KAUTZA, S. MAZEIKA P. SULLIVAN Aquatic Conservation: Marine and Freshwater Ecosystems . 2012,第4期

机译：使用基于过程的流域规模模型来增强基于野外的溪流评估并预测溪流鱼群
3. QoS-Awareness in Transaction Models for Stream Processing [J] . Shinji Kikuchi, Subhash Bhalla 電子情報通信学会技術研究報告. サービスコンピューティング. Services computing . 2016,第76期

机译：流处理的事务处理模型中的QoS意识
4. Massively-Parallel Stream Processing under QoS Constraints with Nephele [C] . Bjoern Lohrmann, Daniel Warneke, Odej Kao 21st ACM symposium on high-performance parallel distributed computing . 2012

机译：Nephele在QoS约束下的大规模并行流处理
5. QoS management for real-time data services and data stream query processing. [D] . Wei, Yuan. 2006

机译：QoS管理，用于实时数据服务和数据流查询处理。
6. Streaming MASSIF: Cascading Reasoning for Efficient Processing of IoT Data Streams [O] . Pieter Bonte, Riccardo Tommasini, Emanuele Della Valle, 2018

机译：流式MASSIF：物联网数据流高效处理的级联推理
7. Nephele Streaming: Stream Processing Under QoS Constraints At Scale [O] . Lohrmann, Björn, Warneke, Daniel, Kao, Odej 2013

机译：Nephele streaming：规模Qos约束下的流处理
8. Discretized Streams: A Fault-Tolerant Model for Scalable Stream Processing. [R] . Zaharia, M., Das, T., Li, H., 2012

机译：离散流：可扩展流处理的容错模型。

Nephele streaming: stream processing under QoS constraints at scale

摘要

著录项

相似文献

相关主题

期刊订阅