COMPARISON OF TABLE JOIN EXECUTION TIME FOR PARALLEL DBMS AND MAPREDUCE

机译：并行DBMS和MAPREDUCE的表联接执行时间的比较

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Analysis of existing research work indicates that preference for implementation of queries to structured data is given to parallel DBMS. MapReduce (MR) is perceived as supplementary to DBMS technology. We attempt to figure out behavior pattern of parallel row-storage DBMS and MR system Hadoop on the example of Join task depending on the variation of the parameters that in other authors' experiments do not vary or differ from ours. This article presents detailed process models for table joins in the parallel row-storage DBMS and MR-system, as well as the results of detailed calculation experiments performed on these models. The models were set up for various scalability schemes for MR (number of nodes) and DMBS (data volume in a node) and fragmentation of the joined tables by the primary key. The following parameters were varied: queried data selectivity, number of sorted resulting records and cardinality of the grouping attribute. The modeling results showed that with the increase of the stored data volume parallel DBMS starts losing against MR-system at certain thresholds.

机译：对现有研究工作的分析表明，并行DBMS优先考虑对结构化数据进行查询。 MapReduce（MR）被认为是DBMS技术的补充。我们试图根据Join任务的示例来确定并行行存储DBMS和MR系统Hadoop的行为模式，具体取决于其他作者实验中没有变化或与我们不同的参数变化。本文介绍了并行行存储DBMS和MR系统中表联接的详细过程模型，以及在这些模型上执行的详细计算实验的结果。这些模型是针对MR（节点数）和DMBS（节点中的数据量）以及通过主键对连接表进行分段的各种可伸缩性方案而建立的。更改了以下参数：查询数据的选择性，排序后的结果记录数和分组属性的基数。建模结果表明，随着存储数据量的增加，并行DBMS在某些阈值下开始针对MR系统而丢失。

著录项

来源
《Proceedings of the IASTED international conferences on informatics》|2014年|162-169|共8页
会议地点 Innsbruck(AT)
作者
Aleksey V. Burdakov; Uriy A. Grigorev; Andrey D. Ploutenko;
展开▼
作者单位

Bauman Moscow State Technical University, 2ya Baumanskaya 5, Moscow, Russia 105005 Amur State University, 21 Ignatievskoe sh., Blagoveschensk, Amurskaya obl., Russia 675000;

Bauman Moscow State Technical University, 2ya Baumanskaya 5, Moscow, Russia 105005 Amur State University, 21 Ignatievskoe sh., Blagoveschensk, Amurskaya obl., Russia 675000;

Bauman Moscow State Technical University, 2ya Baumanskaya 5, Moscow, Russia 105005 Amur State University, 21 Ignatievskoe sh., Blagoveschensk, Amurskaya obl., Russia 675000;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
DBMS; SQL; MapReduce technology; table join request; query execution time estimate; execution time comparison;

机译：DBMS； SQL; MapReduce技术；表连接请求；查询执行时间估计；执行时间比较;
入库时间 2022-08-26 13:51:32

相似文献

外文文献
中文文献
专利

1. Comparison of Checkpointed Aided Parallel Execution Against Mapreduce [J] . Nisha Rani N., Shiju Sathyadevan, Eric Renault, International Journal of Applied Engineering Research . 2015,第11期

机译：针对Mapreduce的检查点辅助并行执行的比较
2. A comparison of parallel large-scale knowledge acquisition using rough set theory on different MapReduce runtime systems [J] . Junbo Zhang, Jian-Syuan Wong, Tianrui Li, Acoustic bulletin . 2014,第3期

机译：使用粗糙集理论在不同MapReduce运行时系统上并行进行大规模知识获取的比较
3. Structured Parallel Efficient Execution Database Management System Over Enormous Dataset with MapReduce using Matlab [J] . Uma Mahesh Kumar Gandham, P. Suresh Varma Indian Journal of Science and Technology . 2017,第20期

机译：使用Matlab的MapReduce在庞大数据集上构建结构化并行高效执行数据库管理系统
4. COMPARISON OF TABLE JOIN EXECUTION TIME FOR PARALLEL DBMS AND MAPREDUCE [C] . Aleksey V. Burdakov, Uriy A. Grigorev, Andrey D. Ploutenko IASTED International Conference on Parallel and Distributed Computing and Networks . 2014

机译：比较并行DBMS和MapReduce的表连接执行时间
5. Parallel Gene Upstream Comparison via Multi-level Hash Tables on GPU [D] . Todd, Andrew 2016

机译：通过GPU上的多级哈希表进行并行基因上游比较
6. Parallel MapReduce: Maximizing Cloud Resource Utilization and Performance Improvement Using Parallel Execution Strategies [O] . Ahmed Abdulhakim Al-Absi, Najeeb Abbas Al-Sammarraie, Wael Mohamed Shaher Yafooz, -1

机译：并行MapReduce：使用并行执行策略来最大程度地利用云资源并提高性能
7. Comparison Study between MapReduce (MR) and Parallel Data Management Systems (DBMs) in Large Scale Data Analysis [O] . Mchome Miriam Lawrence 2011

机译：大规模数据分析中MapReduce（MR）与并行数据管理系统（DBM）之间的比较研究

COMPARISON OF TABLE JOIN EXECUTION TIME FOR PARALLEL DBMS AND MAPREDUCE

摘要

著录项

相似文献

相关主题

期刊订阅