DMBVA - A Compression-Based Distributed Data Warehouse Management In Parallel Environment

Abu Sayed Md. Latiful Hoque; Fazlul Hasan Siddiqui

首页> 外文期刊>Malaysian Journal of Computer Science >DMBVA - A Compression-Based Distributed Data Warehouse Management In Parallel Environment

【24h】

DMBVA - A Compression-Based Distributed Data Warehouse Management In Parallel Environment

机译：DMBVA-并行环境中基于压缩的分布式数据仓库管理

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Parallel and distributed data warehouse architectures have been evolved to support online queries on massive data in a short time. Unfortunately, the emergence of e-application has been creating extremely high volume of data that reaches to terabyte threshold. The conventional data warehouse management system is costlier in terms of storage space and processing speed and sometimes it is unable to handle such huge amount of data. As a result, there is a crucial need for the new algorithms and techniques to store and manipulate these data. In this paper, we have presented a compression-based distributed data warehouse architecture – ‘DMBVA’ for storage of warehouse data, and support online queries efficiently. We have achieved a factor of 25-30 compression compared to SQL server data warehouse. The main computational component of data warehouse is the generation and querying on the data cube. Our algorithm – ‘PCVDC’ generates data cube directly from the compressed form of data in parallel. The reduction in the size of data cube is a factor of 30-45 compared to existing methods. The response time has also been significantly improved. These improvements are achieved by eliminating the suffix and prefix redundancy, virtual nature of the data cube, direct addressability of compressed form of data and parallel computation. Experimental evaluation shows the improved performance over the existing systems.

机译：并行和分布式数据仓库体系结构已经得到发展，可以在短时间内支持对大量数据的在线查询。不幸的是，电子应用程序的出现一直在创建大量数据，达到TB阈值。传统的数据仓库管理系统在存储空间和处理速度方面较为昂贵，有时无法处理如此大量的数据。结果，迫切需要用于存储和操纵这些数据的新算法和技术。在本文中，我们介绍了一种基于压缩的分布式数据仓库体系结构-“ DMBVA”，用于存储仓库数据，并有效地支持在线查询。与SQL Server数据仓库相比，我们实现了25-30的压缩率。数据仓库的主要计算组件是对数据多维数据集的生成和查询。我们的算法“ PCVDC”直接从压缩数据的并行形式直接生成数据立方体。与现有方法相比，数据立方体的大小减少了30-45倍。响应时间也得到了明显改善。这些改进是通过消除后缀和前缀冗余，数据多维数据集的虚拟特性，数据压缩形式的直接寻址能力以及并行计算来实现的。实验评估表明，与现有系统相比，性能有所提高。

著录项

来源
《Malaysian Journal of Computer Science》 |2007年第1期|共页
作者
Abu Sayed Md. Latiful Hoque; Fazlul Hasan Siddiqui;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类情报学、情报工作;
关键词

相似文献

外文文献
中文文献
专利

1. Data Warehouse using Parallel Processing on a Distributed Environment [J] . WALDEMAR RUGGIERO JUNIOR, LIRIA MATSUMOTO SATO WSEAS Transactions on Computers . 2007,第5期

机译：在分布式环境中使用并行处理的数据仓库
2. Special Issue on Managing, Evolving and Distributing Data Warehouses and OLAP Data Cubes in Novel Environments [J] . Alfredo Cuzzocrea, Il-Yeol Song International Journal of Data Warehousing and Mining . 2013,第3期

机译：在新型环境中管理，发展和分发数据仓库和OLAP数据立方体的特刊
3. A Survey of Parallel and Distributed Data Warehouses [J] . Furtado Pedro International Journal of Data Warehousing and Mining . 2009,第2期

机译：并行和分布式数据仓库概览
4. A Global Paradigm for Designing Parallel Relational Data Warehouses in Distributed Environments [C] . Soumia Benkrid, Ladjel Bellatreche, Alfredo Cuzzocrea East European conference on advances in databases and information systems . 2014

机译：分布式环境中设计并行关系数据仓库的全局范例
5. Distributed agent management in a parallel simulation and analysis environment. [D] . Wasous, Cherie Lee. 2014

机译：并行仿真和分析环境中的分布式代理管理。
6. A late-binding distributed NoSQL warehouse for integrating patient data from clinical trials [O] . Eric Yang, Jeremy D Scheff, Shih C Shen, 2019

机译：后期绑定的分布式NoSQL仓库用于集成临床试验中的患者数据
7. DMBVA- A COMPRESSION-BASED DISTRIBUTED DATA WAREHOUSE MANAGEMENT IN PARALLEL ENVIRONMENT [O] . Fazlul Hasan Siddiqui, Abu Sayed, Md. Latiful Hoque 2015

机译：DmBVa-基于压缩的并行环境中的分布式数据仓库管理

DMBVA - A Compression-Based Distributed Data Warehouse Management In Parallel Environment

摘要

著录项

相似文献

相关主题

期刊订阅