...
首页> 外文期刊>WSEAS Transactions on Information Science and Applications >XML based Framework for ETL Processes For Relational Databases
【24h】

XML based Framework for ETL Processes For Relational Databases

机译:基于XML的关系数据库ETL流程框架

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In Data Warehousing, Extraction-Transformation-Loading (ETL) are the key tasks that are responsible for the extraction of data from several sources, their cleansing, customization and insertion into data warehouse [10]. More specifically ETL tools are category of specialized tools with the task of dealing with data warehouse cleaning and loading problems. These task are very critical in every data warehouse environment, It is observed that ETL and data cleaning tools are estimated to cost at least one third of effort and expenses in the budget of the data warehouse [1,11], another evidence shows that ETL process costs 55% of the total cost of the data warehouse [1,12]. In this paper, we focus on the problem of the definition of ETL processes using xml in order to make this framework more generic and capable to deal with heterogeneous source systems. We described the framework that extract data from various heterogeneous source systems and carry it in xml files, later on data cleaning is performed using few predefined xml templates, predefined functions and ultimately data is loaded into data warehouse as per warehouse schema.
机译:在数据仓库中,提取-转换-加载(ETL)是关键任务,负责从多个来源提取数据,清理,自定义并将其插入数据仓库[10]。更具体地说,ETL工具是专用工具的类别,其任务是处理数据仓库清理和加载问题。这些任务在每个数据仓库环境中都是至关重要的,据观察,ETL和数据清理工具估计至少要花费数据仓库预算中三分之一的工作量和费用[1,11],另一证据表明ETL流程成本占数据仓库总成本的55%[1,12]。在本文中,我们将重点放在使用xml定义ETL流程的问题上,以使该框架更加通用并能够处理异构源系统。我们描述了从各种异构源系统提取数据并将其保存在xml文件中的框架,随后使用少量预定义的xml模板,预定义的功能执行数据清理,并最终按照仓库模式将数据加载到数据仓库中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号