BlobSeer: Bringing high throughput under heavy concurrency to Hadoop Map-Reduce applications

机译：Blobseer：将繁重的吞吐量带到Hadoop地图 - 减少应用程序下

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Hadoop is a software framework supporting the Map-Reduce programming model. It relies on the Hadoop Distributed File System (HDFS) as its primary storage system. The efficiency of HDFS is crucial for the performance of Map-Reduce applications. We substitute the original HDFS layer of Hadoop with a new, concurrency-optimized data storage layer based on the BlobSeer data management service. Thereby, the efficiency of Hadoop is significantly improved for data-intensive Map-Reduce applications, which naturally exhibit a high degree of data access concurrency. Moreover, BlobSeer's features (built-in versioning, its support for concurrent append operations) open the possibility for Hadoop to further extend its functionalities. We report on extensive experiments conducted on the Grid'5000 testbed. The results illustrate the benefits of our approach over the original HDFS-based implementation of Hadoop.

机译：Hadoop是一种支持地图减少编程模型的软件框架。它依赖于Hadoop分布式文件系统（HDFS）作为其主存储系统。 HDFS的效率对于Map-Deally应用程序的性能至关重要。我们使用新的并发优化的数据存储层替换Hadoop的原始HDFS层，基于Blobse数据管理服务。因此，对于数据密集型地图减少应用，Hadoop的效率显着提高，这自然地表现出高度的数据访问并发性。此外，Blobseer的功能（内置版本控制，它对并发追加操作的支持）打开Hadoop的可能性，以进一步扩展其功能。我们报告了在网格5000试验台上进行的广泛实验。结果说明了我们对Hadoop的原始HDFS实施的方法的好处。

著录项

来源
《IEEE International Symposium on Parallel Distributed Processing》|2010年||共11页
会议地点
作者
Nicolae B.; Moise D.; Antoniu G.; Bouge L.; Dorier M.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.138-53;
关键词
BlobSeer; Data-intensive; Distributed file systems; Hadoop; Heavy access concurrency; High Throughput; Large-scale distributed computing; Map-Reduce-based application;

机译：Blobseer;数据密集型;分布式文件系统;Hadoop;沉重的访问并发;高吞吐量;大规模分布式计算;地图 - 基于地图的应用程序;

相似文献

外文文献
中文文献
专利

1. Parallel K-Means Implementation for Data Clustering Using Hadoop Map-Reduce [J] . Maithri C, Chandramouli E Journal of computational and theoretical nanoscience . 2018,第11a12期

机译：使用Hadoop地图 - 减少的数据群集并行K-Meanse实现
2. Visualization of Big Data with the Map-Reduce program execution platform: Hadoop [J] . Sara Riahi, Azzeddine Riahi International Journal of Engineering Trends and Technology . 2017,第2期

机译：使用Map-Reduce程序执行平台可视化大数据：Hadoop
3. Investigating Hadoop Architecture and Fault Tolerance in Map-Reduce [J] . Armin Kashkouli, Behzad Soleimani, mina rahbari International journal of computer science and network security . 2017,第6期

机译：在Map-Reduce中研究Hadoop架构和容错
4. BlobSeer: Bringing high throughput under heavy concurrency to Hadoop Map-Reduce applications [C] . Nicolae Bogdan, Moise Diana, Antoniu Gabriel, 2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS) . 2010

机译：BlobSeer：在高并发性下为Hadoop Map-Reduce应用程序带来高吞吐量
5. Accelerating Hadoop Map-Reduce for small/intermediate data sizes using the Comet coordination framework [D] . Chaudhari, Shivangi 2009

机译：使用Comet协调框架为小型/中型数据加速Hadoop Map-Reduce
6. A Fast and Scalable Workflow for SNPs Detection in Genome Sequences Using Hadoop Map-Reduce [O] . Muhammad Tahir, Muhammad Sardaraz 2020

机译：使用Hadoop Map-Reduce的基因组序列中SNP检测的快速可扩展工作流
7. BlobSeer: Bringing High Throughput under Heavy Concurrency to Hadoop Map-Reduce Applications [O] . Bogdan Nicolae, Diana Moise, Gabriel Antoniu, 2014

机译：Blobseer：在重度并发下为Hadoop map-Reduce应用程序带来高吞吐量

BlobSeer: Bringing high throughput under heavy concurrency to Hadoop Map-Reduce applications

摘要

著录项

相似文献

相关主题

期刊订阅