Keynote talk: Experiences with MapReduce, an abstraction for large-scale computation

机译：主题演讲：MapReduce的经验，MapReduce是大规模计算的抽象

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

MapReduce is a programming model and an associated implementation for processing and generating large data sets. Users specify a Map function that processes a key/value pair to generate a set of intermediate key/value pairs, and a Reduce function that merges all intermediate values associated with the same intermediate key. Many real world tasks are expressible in this model. Programs written in this functional style are automatically parallelized and executed on a large cluster of commodity machines. The MapReduce run-time system takes care of the details of partitioning the input data, scheduling the program's execution across a set of machines, handling machine failures, and managing the required inter-machine communication. This allows programmers without any experience with parallel and distributed systems to easily utilize the resources of a large distributed system. Our implementation of MapReduce runs on a large cluster of commodity machines and is highly scalable: a typical MapReduce computation processes many terabytes of data on thousands of machines. Programmers find the system easy to use: thousands of MapReduce programs have been implemented and several thousand thousand MapReduce jobs are executed on Google's clusters every day. In this talk I'll describe the basic programming model, discuss our experience using it in a variety of domains, and talk about the implications of programming models like MapReduce as one paradigm to simplify development of parallel software for multi-core microprocessors.

机译：MapReduce是用于处理和生成大型数据集的编程模型和相关的实现。用户指定一个Map函数处理一个键/值对以生成一组中间键/值对，以及一个Reduce函数，该函数合并与同一中间键关联的所有中间值。在此模型中，许多现实世界的任务都是可以表达的。以这种功能风格编写的程序会自动并行化，并在大型商用机器集群上执行。 MapReduce运行时系统负责划分输入数据，安排程序在一组计算机上的执行，处理计算机故障以及管理所需的计算机间通信的细节。这使没有并行和分布式系统经验的程序员可以轻松利用大型分布式系统的资源。我们对MapReduce的实现可在大型商用机器集群上运行，并且具有高度可扩展性：典型的MapReduce计算可在数千台机器上处理数TB的数据。程序员发现该系统易于使用：每天执行数千个MapReduce程序，每天在Google的集群上执行数千个MapReduce作业。在本次演讲中，我将描述基本的编程模型，讨论我们在各种领域中使用它的经验，并讨论诸如MapReduce这样的编程模型作为简化多核微处理器并行软件开发的一种范例的含义。

著录项

来源
《International conference on Parallel architectures and compilation techniques》|2006年|1-1|共1页
会议地点 Seattle(US)
作者
Jeffrey Dean;
展开▼
作者单位

Google Inc. Mountain View CA USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Google; Programming; Computational modeling; Biological system modeling; Object oriented modeling; Microprocessors; Data models;

机译：谷歌;编程；计算建模；生物系统建模；面向对象的建模；微处理器；资料模型;
入库时间 2022-08-26 14:37:39

相似文献

外文文献
中文文献
专利

1. The MapReduce-based approach to improve the shortest path computation in large-scale road networks: the case of A* algorithm [J] . Wilfried Yves Hamilton Adoni, Tarik Nahhal, Brahim Aghezzaf, Journal of Big Data . 2018,第1期

机译：基于MapReduce的方法来改善大规模道路网络中的最短路径计算：以A *算法为例
2. The Abstraction/Representation Account of Computation and Subjective Experience [J] . Szangolies Jochen Minds and Machines . 2020,第2期

机译：计算和主观体验的抽象/代表叙述
3. Enabling Enriched TV Shopping Experience via Computational and Temporal Aware View-Centric Multimedia Abstraction [J] . Fleites Fausto C., Wang Haohong, Chen Shu-Ching Multimedia, IEEE Transactions on . 2015,第7期

机译：通过计算和时间感知以视图为中心的多媒体抽象来丰富电视购物体验
4. Experiences with MapReduce, an abstraction for large-scale computation [C] . Jeffrey Dean, PJeffrey Dean International conference on Parallel architectures and compilation techniques;PACT . 2006

机译：使用MapReduce的经验，MapReduce是大规模计算的抽象
5. Improving MapReduce performance in large-scale clusters. [D] . Ahmad, Faraz. 2013

机译：改善大型集群中的MapReduce性能。
6. CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological Datasets [O] . Ruiqi Liao, Yifan Zhang, Jihong Guan, 2014

机译：CloudNMF：大规模生物数据集非负矩阵分解的MapReduce实现
7. The MapReduce-based approach to improve the shortest path computation in large-scale road networks: the case of A* algorithm [O] . Wilfried Yves Hamilton Adoni, Tarik Nahhal, Brahim Aghezzaf, 2018

机译：基于MapReduce的方法，提高大型道路网络中最短路径计算：*算法的情况

Keynote talk: Experiences with MapReduce, an abstraction for large-scale computation

摘要

著录项

相似文献

相关主题

期刊订阅