首页> 外文期刊>Engineering Technology and Applied Science Research >Optimization of ETL Process in Data Warehouse Through a Combination of Parallelization and Shared Cache Memory
【24h】

Optimization of ETL Process in Data Warehouse Through a Combination of Parallelization and Shared Cache Memory

机译:通过并行化和共享高速缓存的组合优化数据仓库中的ETL过程

获取原文
       

摘要

Extraction, Transformation and Loading (ETL) is introduced as one of the notable subjects in optimization, management, improvement and acceleration of processes and operations in data bases and data warehouses. The creation of ETL processes is potentially one of the greatest tasks of data warehouses and so its production is a time-consuming and complicated procedure. Without optimization of these processes, the implementation of projects in data warehouses area is costly, complicated and time-consuming. The present paper used the combination of parallelization methods and shared cache memory in systems distributed on the basis of data warehouse. According to the conducted assessment, the proposed method exhibited 7.1% speed improvement to kattle optimization instrument and 7.9% to talend instrument in terms of implementation time of the ETL process. Therefore, parallelization could notably improve the ETL process. It eventually caused the management and integration processes of big data to be implemented in a simple way and with acceptable speed.
机译:引入提取,转换和加载(ETL)作为数据库,数据仓库中流程和操作的优化,管理,改进和加速的显着主题之一。 ETL流程的创建可能是数据仓库的最大任务之一,因此其生产是一项耗时且复杂的过程。如果不对这些过程进行优化,则在数据仓库区域中实施项目会变得昂贵,复杂且耗时。本文在基于数据仓库的分布式系统中,结合了并行化方法和共享缓存的使用。根据进行的评估,从ETL流程的实施时间来看,该方法相对于水壶优化仪器而言,速度提高了7.1%,对较弱仪器而言,速度提高了7.9%。因此,并行化可以显着改善ETL过程。最终导致以简单的方式并以可接受的速度实施大数据的管理和集成过程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号