BlobSeer: Bringing high throughput under heavy concurrency to Hadoop Map-Reduce applications

机译：BlobSeer：在高并发性下为Hadoop Map-Reduce应用程序带来高吞吐量

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Hadoop is a software framework supporting the Map-Reduce programming model. It relies on the Hadoop Distributed File System (HDFS) as its primary storage system. The efficiency of HDFS is crucial for the performance of Map-Reduce applications. We substitute the original HDFS layer of Hadoop with a new, concurrency-optimized data storage layer based on the BlobSeer data management service. Thereby, the efficiency of Hadoop is significantly improved for data-intensive Map-Reduce applications, which naturally exhibit a high degree of data access concurrency. Moreover, BlobSeer's features (built-in versioning, its support for concurrent append operations) open the possibility for Hadoop to further extend its functionalities. We report on extensive experiments conducted on the Grid'5000 testbed. The results illustrate the benefits of our approach over the original HDFS-based implementation of Hadoop.

机译：Hadoop是一个支持Map-Reduce编程模型的软件框架。它依靠Hadoop分布式文件系统（HDFS）作为其主要存储系统。 HDFS的效率对于Map-Reduce应用程序的性能至关重要。我们将基于BlobSeer数据管理服务的新的并发优化数据存储层替换为Hadoop的原始HDFS层。因此，对于数据密集型Map-Reduce应用程序，Hadoop的效率得到了显着提高，这些应用程序自然展现出高度的数据访问并发性。此外，BlobSeer的功能（内置版本控制，对并发追加操作的支持）为Hadoop进一步扩展其功能提供了可能性。我们报告了在Grid'5000测试床上进行的广泛实验。结果说明了我们的方法相对于基于HDFS的原始Hadoop实现的好处。

著录项

来源
《2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS)》|2010年|P.1-11|共11页
会议地点 Atlanta GA(US);Atlanta GA(US)
作者
Nicolae Bogdan; Moise Diana; Antoniu Gabriel; Bouge Luc; Dorier Matthieu;
展开▼
作者单位

University of Rennes 1, IRISA, Rennes, France;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP311.133;
关键词
BlobSeer; Data-intensive; Distributed file systems; Hadoop; Heavy access concurrency; High Throughput; Large-scale distributed computing; Map-Reduce-based application;

机译：BlobSeer;数据密集型;分布式文件系统; Hadoop;大量访问并发;高吞吐量;大型分布式计算;基于Map-Reduce的应用程序;

相似文献

外文文献
中文文献
专利

1. Parallel K-Means Implementation for Data Clustering Using Hadoop Map-Reduce [J] . Maithri C, Chandramouli E Journal of computational and theoretical nanoscience . 2018,第11a12期

机译：使用Hadoop地图 - 减少的数据群集并行K-Meanse实现
2. Visualization of Big Data with the Map-Reduce program execution platform: Hadoop [J] . Sara Riahi, Azzeddine Riahi International Journal of Engineering Trends and Technology . 2017,第2期

机译：使用Map-Reduce程序执行平台可视化大数据：Hadoop
3. Investigating Hadoop Architecture and Fault Tolerance in Map-Reduce [J] . Armin Kashkouli, Behzad Soleimani, mina rahbari International journal of computer science and network security . 2017,第6期

机译：在Map-Reduce中研究Hadoop架构和容错
4. BlobSeer: Bringing high throughput under heavy concurrency to Hadoop Map-Reduce applications [C] . Nicolae B., Moise D., Antoniu G., 2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS) . 2010

机译：BlobSeer：在高并发性下为Hadoop Map-Reduce应用程序带来高吞吐量
5. Accelerating Hadoop Map-Reduce for small/intermediate data sizes using the Comet coordination framework [D] . Chaudhari, Shivangi 2009

机译：使用Comet协调框架为小型/中型数据加速Hadoop Map-Reduce
6. A Fast and Scalable Workflow for SNPs Detection in Genome Sequences Using Hadoop Map-Reduce [O] . Muhammad Tahir, Muhammad Sardaraz 2020

机译：使用Hadoop Map-Reduce的基因组序列中SNP检测的快速可扩展工作流
7. BlobSeer: Bringing High Throughput under Heavy Concurrency to Hadoop Map-Reduce Applications [O] . Bogdan Nicolae, Diana Moise, Gabriel Antoniu, 2014

机译：Blobseer：在重度并发下为Hadoop map-Reduce应用程序带来高吞吐量

BlobSeer: Bringing high throughput under heavy concurrency to Hadoop Map-Reduce applications

摘要

著录项

相似文献

相关主题

期刊订阅