Design and Evaluation of Network-Levitated Merge for Hadoop Acceleration

Weikuan Yu; Yandong Wang; Xinyu Que

首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Design and Evaluation of Network-Levitated Merge for Hadoop Acceleration

【24h】

Design and Evaluation of Network-Levitated Merge for Hadoop Acceleration

机译：Hadoop加速的网络悬浮合并的设计和评估

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Hadoop is a popular open source implementation of the MapReduce programming model for cloud computing. However, it faces a number of issues to achieve the best performance from the underlying systems. These include a serialization barrier that delays the reduce phase, repetitive merges, and disk accesses, and the lack of portability to different interconnects. To keep up with the increasing volume of data sets, Hadoop also requires efficient I/O capability from the underlying computer systems to process and analyze data. We describe Hadoop-A, an acceleration framework that optimizes Hadoop with plug-in components for fast data movement, overcoming the existing limitations. A novel network-levitated merge algorithm is introduced to merge data without repetition and disk access. In addition, a full pipeline is designed to overlap the shuffle, merge, and reduce phases. Our experimental results show that Hadoop-A significantly speeds up data movement in MapReduce and doubles the throughput of Hadoop. In addition, Hadoop-A significantly reduces disk accesses caused by intermediate data.

机译：Hadoop是用于云计算的MapReduce编程模型的流行开源实现。但是，要从基础系统中获得最佳性能，将面临许多问题。这些包括延迟延迟阶段，重复合并和磁盘访问的序列化障碍，以及对不同互连的可移植性不足。为了跟上不断增长的数据集数量，Hadoop还需要底层计算机系统的有效I / O功能来处理和分析数据。我们描述了Hadoop-A，这是一个加速框架，它使用插件组件优化Hadoop以实现快速数据移动，克服了现有限制。引入了一种新颖的网络悬浮合并算法，无需重复和磁盘访问即可合并数据。另外，完整的流水线被设计为与改组，合并和减少阶段重叠。我们的实验结果表明，Hadoop-A显着加快了MapReduce中的数据移动，并使Hadoop的吞吐量增加了一倍。此外，Hadoop-A大大减少了由中间数据引起的磁盘访问。

著录项

来源
《IEEE Transactions on Parallel and Distributed Systems 》 |2014年第3期| 602-611| 共10页
作者
Weikuan Yu; Yandong Wang; Xinyu Que;
展开▼
作者单位

Dept. of Comput. Sci. & Software Eng., Auburn Univ., Auburn, AL, USA|c|;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Hadoop; Hadoop acceleration; MapReduce; cloud computing; network-levitated merge;

机译：Hadoop;Hadoop加速;MapReduce;云计算;网络悬浮合并;

相似文献

外文文献
中文文献
专利

1. Design, implementation and evaluation of a medical English-Chinese parallel corpus based on hadoop and XMPP [J] . Yang M., Bao S. B., Meng F. Q. Basic & clinical pharmacology & toxicology. . 2019 ,第S7期

机译：基于Hadoop和XMPP的医学英汉并联语料库的设计，实施和评估
2. Design of Student Capability Evaluation System Merging Blockchain Technology [J] . Wenshuang Zhao, Kun Liu, Kun Ma Journal of Physics: Conference Series . 2019 ,第3期

机译：融合区块链技术的学生能力评估系统设计
3. Design and performance evaluation of inerter-based tuned mass dampers for a ground acceleration excited structure [J] . Javidialesaadi Abdollah, Wierschem Nicholas E. Soil Dynamics and Earthquake Engineering . 2021 ,第Jana期

机译：基于终端调谐质量阻尼器的地面加速励磁结构的设计与性能评估
4. Hadoop acceleration through network levitated merge [C] . Wang Yandong, Que Xinyu, Yu Weikuan, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis . 2011

机译：通过网络悬浮合并加速Hadoop
5. Freeway merging behaviour and safety of acceleration lanes: Field study. [D] . Ahammed, Mohammad Alauddin. 2005

机译：高速公路合并行为与加速车道的安全性：实地研究。
6. Merging pathology with biomechanics using CHIMERA (Closed-Head Impact Model of Engineered Rotational Acceleration): a novel surgery-free model of traumatic brain injury [O] . Dhananjay R Namjoshi, Wai Hang Cheng, Kurt A McInnes, 2014

机译：使用CHIMERA（工程旋转加速度的闭合头冲击模型）将病理学与生物力学相结合：创伤性脑损伤的新型免手术模型
7. A Review on Design and Development of Performance Evaluation Model for Bio-Informatics Data Using Hadoop [O] . Ravi Kumar A Et. al. 2021

机译：使用Hadoop进行生物信息学数据性能评估模型的设计与开发综述

Design and Evaluation of Network-Levitated Merge for Hadoop Acceleration

摘要

著录项

相似文献

相关主题

期刊订阅