首页> 外文会议>IEEE International Advance Computing Conference >A Comparative Review of Data Warehousing ETL Tools with New Trends and Industry Insight
【24h】

A Comparative Review of Data Warehousing ETL Tools with New Trends and Industry Insight

机译:具有新趋势和行业洞察力的数据仓储ETL工具的比较综述

获取原文

摘要

Data Warehouse is a repository of strategic data from many sources gathered over a long period of time. Traditional DW operations mainly comprise of extracting data from multiple sources, transforming these data into a compatible form and finally loading them to DW schema for further analysis. The extract-transform-load (ETL) functions need to be incorporated into appropriate tools so that organisations can utilise these tools efficiently as required. There is a wide variety of such tools available in market. In this paper, we have compared different aspects of some popular ETL tools (Informatica, Datastage, Ab Initio, Oracle Data Integrator, SSIS) and have analysed their advantages and disadvantages. We have also highlighted some salient features (performance optimisation, data lineage, real time data analysis, cost, language binding etc.) of these tools and represented them with a comparative study. Apart from the review of the ETL tools, the paper also provides feedback from data science industry which narrates the market value and relevance of the tools in practical scenario. However, the traditional DW concept is expanding rapidly with the advent of big data, cloud computing, real time data analysis and the growing need of parsing information from structured and unstructured data sources. In this paper, we have also identified these factors which are transforming the definition of data warehousing.
机译:数据仓库是许多来源的战略数据存储库,在很长一段时间内聚集。传统的DW操作主要包括从多个源中提取数据,将这些数据转换为兼容的形式,并最终将它们加载到DW模式以进行进一步分析。提取 - 变换负载(ETL)功能需要合并到适当的工具中,以便组织根据需要有效地利用这些工具。市场上有各种各样的工具。在本文中,我们比较了一些流行的ETL工具(Informatica,DataStage,AB Initio,Oracle Data Integator,SSIS)的不同方面,并分析了它们的优缺点。我们还强调了这些工具的一些突出特征(性能优化,数据谱,实时数据分析,成本,语言绑定等),并以比较研究代表它们。除了审查ETL工具外,本文还提供了数据科学行业的反馈,叙述了实际情况下工具的市场价值和相关性。然而,传统的DW概念正在随着大数据,云计算,实时数据分析的出现以及从结构化和非结构化数据源解析信息的越来越多的需求而迅速扩展。在本文中,我们还确定了转换数据仓库定义的这些因素。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号