Generate-map-reduce: An extension to map-reduce to support shared data and recursive computations

Janakiram Dharanipragada; Geeta Iyer; Sriram Kailasam

首页> 外文期刊>Concurrency and computation: practice and experience >Generate-map-reduce: An extension to map-reduce to support shared data and recursive computations

【24h】

Generate-map-reduce: An extension to map-reduce to support shared data and recursive computations

机译：Generate-map-reduce：对map-reduce的扩展，以支持共享数据和递归计算

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

It is difficult to express the parallelism present in complex computations by using existing higher levelrnabstractions such as MapReduce and Dryad. These computations include applications from wide variety ofrndomains, like Artificial Intelligence, Decision Tree Algorithms, Association Rule Mining, RecommenderrnSystems, Graph Algorithms, Clustering Algorithms, Compute Intensive Scientific Workflows, OptimizationrnAlgorithms, and so forth. Their execution graphs introduce new challenges in terms of programmerrnexpressibility and runtime performance such as iterative and recursive computations, shared communicationrnmodel, and so forth.We propose an extension to MapReduce, called Generate-Map-Reduce (GMR), targetedrntowards modeling these applications. GMR introduces a new Generate abstraction into the MapReducernframework that captures recursive computations. The runtime also supports iterative jobs and a distributedrncommunication model by using shared data structures. We illustrate recursive computations with GMR byrnmodeling complex applications such as simulated annealing, A* search, and adaptive quadrature computationrnthat require recursive spawning of new tasks to handle variable degree of parallelism. GMR runtimernsupports caching of common data across iterations in memory and local disks.We illustrate how this cachingrnhelps in achieving significant speedup for iterative computations by modeling k-means clustering.

机译：通过使用诸如MapReduce和Dryad之类的现有高级摘要很难表达复杂计算中存在的并行性。这些计算包括来自广泛领域的应用程序，例如人工智能，决策树算法，关联规则挖掘，推荐系统，图算法，聚类算法，计算密集型科学工作流，优化算法等。他们的执行图在程序员的可表达性和运行时性能方面提出了新的挑战，例如迭代和递归计算，共享的通信模型等。我们建议对MapReduce进行扩展，称为Generate-Map-Reduce（GMR），以对这些应用程序进行建模。 GMR在MapReducernframework中引入了一个新的Generate抽象，以捕获递归计算。运行时还通过使用共享数据结构来支持迭代作业和分布式通信模型。我们通过建模复杂的应用程序（例如模拟退火，A *搜索和自适应正交计算）来说明GMR的递归计算，这些应用程序需要递归产生新任务以处理可变的并行度。 GMR运行时支持在内存和本地磁盘中的各个迭代之间缓存公共数据。我们说明了这种缓存如何通过对k-means聚类进行建模来帮助实现迭代计算的显着加速。

著录项

来源
《Concurrency and computation: practice and experience》 |2014年第2期|561-585|共25页
作者
Janakiram Dharanipragada; Geeta Iyer; Sriram Kailasam;
展开▼
作者单位

Dept. of CSE, IIT Madras, Chennai 600036, India;

Dept. of CSE, IIT Madras, Chennai 600036, India;

Dept. of CSE, IIT Madras, Chennai 600036, India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
cloud computing; MapReduce; recursive computations; A* search; adaptive quadrature; shared data structures;

机译：云计算;MapReduce;递归计算;A *搜索;自适应正交共享数据结构;

相似文献

外文文献
中文文献
专利

1. Shared Subtypes: Subtyping Recursive Parametrized Algebraic Data Types [J] . Ahn KY, Sheard T ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2009,第2期

机译：共享子类型：子类型化递归参数化代数数据类型
2. Extension of research data repository system to support direct compute access to biomedical datasets: enhancing Dataverse to support large datasets [J] . Bill McKinney, Peter A. Meyer, Merce Crosas, Annals of the New York Academy of Sciences . 2017,第期

机译：扩展研究数据存储库系统以支持对生物医学数据集的直接计算访问：增强Dataverse以支持大型数据集
3. Depth-Recursive Tomography Along the Eger Rift Using the S01 Profile Refraction Data: Tested at the KTB Super Drilling Hole, Structural Interpretation Supported by Magnetic, Gravity and Petrophysical Data [J] . Novotny M, Skacelova Z, Mrlina J, Surveys in Geophysics: An International Review Journal of Geophysics and Planetary Sciences . 2009,第6期

机译：使用S01轮廓折射数据沿埃格裂谷进行深度递归层析成像：在KTB超级钻孔上进行了测试，并通过磁，重力和岩石物理数据支持了结构解释
4. Field Extension in Secret-Shared Form and Its Applications to Efficient Secure Computation [C] . Ryo Kikuchi, Nuttapong Attrapadung, Koki Hamada, Australasian conference on information security and privacy . 2019

机译：秘密共享形式的现场扩展及其在高效安全计算中的应用
5. Cables: Thread and memory extensions to support a single shared virtual memory cluster image. [D] . Jamieson, Peter Andrew. 2002

机译：电缆：线程和内存扩展，以支持单个共享的虚拟内存群集映像。
6. Extension of research data repository system to support direct compute access to biomedical datasets: enhancing Dataverse to support large datasets [O] . Bill McKinney, Peter A. Meyer, Mercè Crosas, -1

机译：扩展研究数据存储库系统以支持对生物医学数据集的直接计算访问：增强Dataverse以支持大型数据集
7. Map-reduce extensions and recursive queries [O] . Afrati FN, Borkar V, Carey M, 2011

机译：map-reduce扩展和递归查询

Generate-map-reduce: An extension to map-reduce to support shared data and recursive computations

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅