Referential Horizontal Partitioning Selection Problem in Data Warehouses: Hardness Study and Selection Algorithms

Bellatreche Ladjel; Boukhalfa Kamel; Richard Pascal; Woameno Komla Yamavo

首页> 外文期刊>International Journal of Data Warehousing and Mining >Referential Horizontal Partitioning Selection Problem in Data Warehouses: Hardness Study and Selection Algorithms

【24h】

Referential Horizontal Partitioning Selection Problem in Data Warehouses: Hardness Study and Selection Algorithms

机译：数据仓库中的参照水平分区选择问题：硬度研究和选择算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Horizontal Partitioning has been largely adopted by the database community, where it took a significant part in the physical design process. Actually, it is supported by most commercial database systems (DBMS), where a native Data Definition Language for decomposing tables/materialized views using various modes is proposed. In traditional databases, horizontal partitioning has been largely studied, where several fragmentation algorithms were proposed to partition tables in isolation. In the relational data warehouse environment, horizontal partitioning consists in decomposing the whole warehouse schema into sub schemas, where each schema contains fragments of dimension and fact tables. Dimension tables are fragmented using the primary partitioning mode, whereas the fact table is divided using referential mode. In this article, the authors first focus on the evolution of horizontal partitioning in commercial DBMS motivated by decision support applications. Secondly, they give a formalization of the referential fragmentation schema selection problem in the data warehouse and they study its hardness to select an optimal solution. Due to its high complexity, they develop two algorithms: hill climbing and simulated annealing with several variants to select a near optimal partitioning schema. Finally, extensive experimental studies are conducted using the data set of APB1 benchmark to compare the quality the proposed algorithms using a mathematical cost model. Based on these experiments, some recommendations are given to advise database administrator for well using horizontal partitioning.

机译：水平分区已被数据库社区广泛采用，它在物理设计过程中发挥了重要作用。实际上，它受到大多数商业数据库系统（DBMS）的支持，其中提出了一种本机数据定义语言，用于使用各种模式分解表/实例化视图。在传统数据库中，对水平分区进行了广泛的研究，其中提出了几种碎片算法来隔离表。在关系数据仓库环境中，水平分区包括将整个仓库模式分解为子模式，其中每个模式都包含维和事实表的片段。维度表使用主要分区模式进行分段，而事实表使用引用模式进行划分。在本文中，作者首先关注由决策支持应用程序推动的商业DBMS中水平分区的发展。其次，他们给出了数据仓库中参考碎片模式选择问题的形式化形式，并研究了它的难点以选择最佳解决方案。由于其复杂性高，他们开发了两种算法：爬山和具有几种变体的模拟退火，以选择接近最佳的分区方案。最后，使用APB1基准数据集进行了广泛的实验研究，以使用数学成本模型比较所提出算法的质量。基于这些实验，提出了一些建议，以建议数据库管理员更好地使用水平分区。

著录项

来源
《International Journal of Data Warehousing and Mining》 |2009年第4期|1-23|共23页
作者
Bellatreche Ladjel; Boukhalfa Kamel; Richard Pascal; Woameno Komla Yamavo;
展开▼
作者单位

University of Poitiers, France;

University of Poitiers, France;

University of Poitiers, France;

University of Poitiers, France;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Data Definition Language; Fragmentation Schema Selection Algorithm; partitioning; Primary Star Join Query Optimization; Referential Horizontal Partitioning;

机译：数据定义语言碎片模式选择算法分区主星联接查询优化参照水平分区;

相似文献

外文文献
中文文献
专利

1. Referential Horizontal Partitioning Selection Problem in Data Warehouses: Hardness Study and Selection Algorithms [J] . Ladjel Bellatreche, Kamel Boukhalfa, Pascal Richard, International Journal of Data Warehousing and Mining . 2009,第4期

机译：数据仓库中的参照水平分区选择问题：硬度研究和选择算法
2. Mutual information algorithms for optimal attribute selection in data driven partitions of databases [J] . Stephanakis Ioannis M., Iliou Theodoros, Anastassopoulos George Evolving Systems . 2020,第3期

机译：数据库数据驱动分区中最佳属性选择的互信息算法
3. Applying evolutionary algorithms to materialized view selection in a data warehouse [J] . J.-T. Horng, Y.-J. Chang, B.-J. Liu Soft computing: A fusion of foundations, methodologies and applications . 2003,第8期

机译：将进化算法应用于数据仓库中的物化视图选择
4. A comparative analysis of fragmentation selection algorithms for data warehouse partitioning [C] . Thenmozhi M., Vivekanandan K. 2014 International Conference on Advances in Engineering and Technology Research . 2014

机译：数据仓库分区碎片选择算法的比较分析
5. An Information Based Optimal Subdata Selection Algorithm for Big Data Linear Regression and a Suitable Variable Selection Algorithm. [D] . Zheng, Yi. 2017

机译：大数据线性回归的基于信息的最优子数据选择算法和合适的变量选择算法。
6. Haplotype Block Partitioning and Tag SNP Selection Using Genotype Data and Their Applications to Association Studies [O] . Kui Zhang, Zhaohui S. Qin, Jun S. Liu, 2004

机译：利用基因型数据进行单倍型基因组分区和标签SNP选择及其在关联研究中的应用
7. Multiobjective genetic algorithms for materialized view selection in olap data warehouses [O] . Michael Lawrence 2006

机译：olap数据仓库中物化视图选择的多目标遗传算法

Referential Horizontal Partitioning Selection Problem in Data Warehouses: Hardness Study and Selection Algorithms

摘要

著录项

相似文献

相关主题

期刊订阅