...
首页> 外文期刊>International journal of computational intelligence systems >Materialized View Selection Based on Adaptive Genetic Algorithm and Its Implementation with Apache Hive
【24h】

Materialized View Selection Based on Adaptive Genetic Algorithm and Its Implementation with Apache Hive

机译:基于自适应遗传算法的物化视图选择及其Apache Hive实现

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Frequently accessed views in data warehouses are usually materialized in order to accelerate the speed of querying big data. However, the view materialization itself incurs huge costs. Moreover, some latest products of non-traditional data warehouse software, such as Apache Hive, still lack the support of ma- terialized views. In order to select the appropriate views to be materialized with the possible minimized cost, we propose a novel approach to the materialized view selection problem based on an adaptive ge- netic algorithm. We establish a cost model that integrates the query, maintenance and storage costs to evaluate the performance of approaches and measure the fitness of an individual in the genetic algorithm. In addition, we introduce the adjustable factors for crossover probability and mutation probability, allow- ing the genetic algorithm to run quickly and avoid premature convergence. We also conduct extensive experiments for its implementation with Apache Hive, which query and manage large datasets residing in distributed storage. Both the simulation results and experiments on Apache Hive show that the approx- imately optimal solution for selecting materialized views can be obtained effectively using the approach presented.
机译:通常会在数据仓库中实现经常访问的视图,以加快查询大数据的速度。但是,视图实现本身会带来巨大的成本。而且,一些非传统数据仓库软件的最新产品,例如Apache Hive,仍然缺乏对虚拟化视图的支持。为了选择可能的最小化成本来实现合适的视图,我们提出了一种基于自适应遗传算法的物化视图选择问题的新方法。我们建立了一个成本模型,该模型整合了查询,维护和存储成本,以评估方法的性能并衡量个体在遗传算法中的适应性。此外,我们介绍了交叉概率和变异概率的可调整因子,从而使遗传算法能够快速运行并避免过早收敛。我们还使用Apache Hive对其进行了广泛的实验,该查询和管理驻留在分布式存储中的大型数据集。仿真结果和在Apache Hive上进行的实验均表明,使用所提出的方法可以有效地获得选择物化视图的最佳方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号