首页> 外文会议>Data Engineering Workshops (ICDEW), 2010 >Partitioning real-time ETL workflows
【24h】

Partitioning real-time ETL workflows

机译:分割实时ETL工作流程

获取原文

摘要

Many organizations are aiming to move away from traditional batch processing ETL to real-time ETL (RT-ETL). This move is motivated by a need to analyze and take decisions on as fresh a data as possible. The RT-ETL engines operate on the abstraction of data flow executed on parallel architectures. For high throughput and low response times, there is a need for partitioning the data over the large number of nodes in the engine. In this paper, we consider the problem of partitioning realtime ETL flows and we propose a high level architecture for that.
机译:许多组织的目标是从传统的批处理ETL转向实时ETL(RT-ETL)。此举是出于对尽可能多的最新数据进行分析并做出决策的需要。 RT-ETL引擎对在并行体系结构上执行的数据流的抽象进行操作。对于高吞吐量和低响应时间,需要在引擎中的大量节点上对数据进行分区。在本文中,我们考虑了对实时ETL流进行分区的问题,并为此提出了一个高级架构。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号