Apache Flink: Stream Analytics at Scale

机译：Apache Flink：大规模流分析

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Summary form only given. Apache Flink is an open source system for expressive, declarative, fast, and efficient data analysis on both historical (batch) and real-time (streaming) data. Flink combines the scalability and programming flexibility of distributed MapReduce-like platforms with the efficiency, out-of-core execution, and query optimization capabilities found in parallel databases. At its core, Flink builds on a distributed dataflow runtime that unifies batch and incremental computations over a true-streaming pipelined execution. Its programming model allows for stateful, fault tolerant computations, flexible user-defined windowing semantics for streaming and unique support for iterations. Flink is converging into a use-case complete system for parallel data processing with a wide range of top level libraries ranging from machine learning through to graph processing. Apache Flink originates from the Stratosphere project led by TU Berlin and has led to various scientific papers (e.g., in VLDBJ, SIGMOD, (P)VLDB, ICDE, and HPDC). In this half-day tutorial we will introduce Apache Flink, and give a tutorial on its streaming capabilities using concrete examples of application scenarios, focusing on concepts such as stream windowing, and stateful operators.

机译：仅提供摘要表格。 Apache Flink是一个开源系统，用于对历史（批）数据和实时（流）数据进行表达，声明，快速和高效的数据分析。 Flink将类似MapReduce的分布式平台的可伸缩性和编程灵活性与并行数据库中的效率，核外执行和查询优化功能结合在一起。 Flink的核心是建立在分布式数据流运行时上，该运行时将批处理和增量计算结合在真正流式的流水线执行上。其编程模型允许进行有状态的，容错的计算，用于流的灵活的用户定义窗口语义以及对迭代的独特支持。 Flink正在集成到一个用例完整的系统中，以进行并行数据处理，其中包含从机器学习到图形处理的各种顶级库。 Apache Flink源自柏林工业大学（TU Berlin）领导的Stratosphere项目，并已发表了许多科学论文（例如VLDBJ，SIGMOD，（P）VLDB，ICDE和HPDC）。在这个为期半天的教程中，我们将介绍Apache Flink，并使用应用程序场景的具体示例提供有关其流功能的教程，重点关注流窗口和状态操作符等概念。

著录项

来源
《2016 IEEE International Conference on Cloud Engineering Workshop》|2016年|193-193|共1页
会议地点 Berlin(DE)
作者
Asterios Katsifodimos; Sebastian Schelter;
展开▼
作者单位

Tech. Univ. Berlin, Berlin, Germany;

Tech. Univ. Berlin, Berlin, Germany;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Electronic mail; Programming; Query processing; Distributed databases; Terrestrial atmosphere; Tutorials; Data analysis;

机译：电子邮件;编程;查询处理;分布式数据库;地面气氛;教程;数据分析;
入库时间 2022-08-26 13:53:20

相似文献

外文文献
中文文献
专利

1. A comparison on scalability for batch big data processing on Apache Spark and Apache Flink [J] . Diego García-Gil, Sergio Ramírez-Gallego, Salvador García, Big Data Analytics . 2017,第1期

机译：Apache Spark和Apache Flink上批处理大数据处理的可伸缩性比较
2. Real-time incremental recommendation for streaming data based on apache flink [J] . Tang Zhuo, Liu Zeyu, Li Kenli, Intelligent data analysis . 2019,第6期

机译：基于Apache Flink的流媒体数据的实时增量推荐
3. Framework for Error Detection & its Localization in Sensor Data Stream for reliable big sensor data analytics using Apache Spark Streaming [J] . Govind P. Gupta, Jahanvi Khedwal Procedia Computer Science . 2020,第5期

机译：错误检测框架及其在传感器数据流中的本地化，用于使用Apache Spark流的可靠大传感器数据分析
4. Apache Flink: Stream Analytics at Scale [C] . Asterios Katsifodimos, Sebastian Schelter IEEE International Conference on Cloud Engineering Workshop . 2016

机译：Apache Flink：按比例流分析
5. Declarative Frameworks and Optimization Techniques for Developing Scalable Advanced Analytics over Databases and Data Streams [D] . Das, Ariyam . 2019

机译：用于在数据库和数据流中开发可扩展的高级分析的声明性框架和优化技术
6. Real-Time Heart Arrhythmia Detection Using Apache Spark Structured Streaming [O] . Sadegh Ilbeigipour, Amir Albadvi, Elham Akhondzadeh Noughabi 2021

机译：使用Apache Spark结构流媒体进行实时心脏心律失常检测
7. A comparison on scalability for batch big data processing on Apache Spark and Apache Flink [O] . 2017

机译：Apache Spark和Apache Flink上批处理大数据处理的可伸缩性比较

Apache Flink: Stream Analytics at Scale

摘要

著录项

相似文献

相关主题

期刊订阅