首页> 外文会议>International Conference on Data Management Technologies and Applications >Task Clustering on ETL Systems A Pattern-Oriented Approach
【24h】

Task Clustering on ETL Systems A Pattern-Oriented Approach

机译:在ETL系统上的任务聚类是一种以模式为导向的方法

获取原文

摘要

Usually, data warehousing populating processes are data-oriented workflows composed by dozens of granular tasks that are responsible for the integration of data coming from different data sources. Specific subset of these tasks can be grouped on a collection together with their relationships in order to form higher-level constructs. Increasing task granularity allows for the generalization of processes, simplifying their views and providing methods to carry out expertise to new applications. Well-proven practices can be used to describe general solutions that use basic skeletons configured and instantiated according to a set of specific integration requirements. Patterns can be applied to ETL processes aiming to simplify not only a possible conceptual representation but also to reduce the gap that often exists between two design perspectives. In this paper, we demonstrate the feasibility and effectiveness of an ETL pattern-based approach using task clustering, analyzing a real world ETL scenario through the definitions of two commonly used clusters of tasks: a data lookup cluster and a data conciliation and integration cluster.
机译:通常,数据仓库填充过程是由数十个精细任务组成的数据导向的工作流,该任务负责来自不同数据源的数据的集成。这些任务的具体子集可以与其关系一起分组在集合上,以便形成更高级别的构造。增加的任务粒度允许流程的泛化,简化他们的观点并提供对新应用程序进行专业知识的方法。经过验证的实践可用于描述使用根据一组特定集成要求配置和实例化的基本骨架的通用解决方案。模式可以应用于ETL过程,其目的不仅可以简化可能的概念表示,而且还可以减少两个设计视角之间经常存在的差距。在本文中,我们展示了使用任务群集的ETL模式的方法的可行性和有效性,通过两个常用的任务集群的定义分析了真实世界的ETL方案:数据查找群集和数据调音和集成群集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号