首页> 外文OA文献 >Materialized View Selection Based on Adaptive Genetic Algorithm and Its Implementation with Apache Hive
【2h】

Materialized View Selection Based on Adaptive Genetic Algorithm and Its Implementation with Apache Hive

机译:基于自适应遗传算法的物质化视图选择及其与Apache Hive实现

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Frequently accessed views in data warehouses are usually materialized in order to accelerate the speed of querying big data. However, the view materialization itself incurs huge costs. Moreover, some latest products of non-traditional data warehouse software, such as Apache Hive, still lack the support of ma- terialized views. In order to select the appropriate views to be materialized with the possible minimized cost, we propose a novel approach to the materialized view selection problem based on an adaptive ge- netic algorithm. We establish a cost model that integrates the query, maintenance and storage costs to evaluate the performance of approaches and measure the fitness of an individual in the genetic algorithm. In addition, we introduce the adjustable factors for crossover probability and mutation probability, allow- ing the genetic algorithm to run quickly and avoid premature convergence. We also conduct extensive experiments for its implementation with Apache Hive, which query and manage large datasets residing in distributed storage. Both the simulation results and experiments on Apache Hive show that the approx- imately optimal solution for selecting materialized views can be obtained effectively using the approach presented.
机译:通常会化数据仓库中的频繁访问视图,以便加速查询大数据的速度。然而,视野实现本身会引发巨额成本。此外,某些最新产品的非传统数据仓库软件(如Apache Hive)仍然缺乏对MA-Terialized视图的支持。为了选择具有可能最小化成本的适当视图,我们提出了一种基于自适应地GE - 算法的物流化视图选择问题的新方法。我们建立了一项成本模型,集成了查询,维护和储存成本,以评估方法的性能并测量遗传算法中个体的ï。此外,我们介绍了交叉概率和突变概率的可调因素,允许遗传算法快速运行并避免过早收敛。我们还对Apache Hive实现了广泛的实验,该实验与Apache Hive进行了查询和管理驻留在分布式存储中的大型数据集。 Apache Hive上的仿真结果和实验表明,可以使用所呈现的方法有效地获得用于选择实施物化视图的大约最佳解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号