Implicit Parallelism through Deep Language Embedding

Alexandrov Alexander; Katsifodimos Asterios; Krastev Georgi; Markl Volker

首页> 外文期刊>SIGMOD record >Implicit Parallelism through Deep Language Embedding

【24h】

Implicit Parallelism through Deep Language Embedding

机译：通过深度语言嵌入隐式并行

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Parallel collection processing based on second-order functions such as map and reduce has been widely adopted for scalable data analysis. Initially popularized by Google, over the past decade this programming paradigm has found its way in the core APIs of parallel dataflow engines such as Hadoop's MapReduce, Spark's RDDs, and Flink's DataSets. We review programming patterns typical of these APIs and discuss how they relate to the underlying parallel execution model. We argue that fixing the abstraction leaks exposed by these patterns will reduce the cost of data analysis due to improved programmer productivity. To achieve that, we first revisit the algebraic foundations of parallel collection processing. Based on that, we propose a simplified API that (i) provides proper support for nested collection processing and (ii) alleviates the need of certain second-order primitives through comprehensions - a declarative syntax akin to SQL. Finally, we present a metaprogramming pipeline that performs algebraic rewrites and physical optimizations which allow us to target parallel dataflow engines like Spark and Flink with competitive performance.

机译：基于可映射数据分析等二阶函数的并行收集处理已被广泛采用。在谷歌最初普及之后，在过去的十年中，这种编程范例已在并行数据流引擎（如Hadoop的MapReduce，Spark的RDD和Flink的DataSet）的核心API中找到了自己的方式。我们将回顾这些API的典型编程模式，并讨论它们与底层并行执行模型之间的关系。我们认为，由于提高了程序员的生产率，解决这些模式所暴露的抽象泄漏将降低数据分析的成本。为此，我们首先回顾并行收集处理的代数基础。在此基础上，我们提出了一种简化的API，该API（i）为嵌套集合处理提供适当的支持，并且（ii）通过理解（类似于SQL的声明性语法）减轻某些二阶基元的需要。最后，我们提供了一个元代编程管道，该管道执行代数重写和物理优化，使我们能够针对并行数据流引擎（例如Spark和Flink）提供具有竞争力的性能。

著录项

来源
《SIGMOD record》 |2016年第1期|51-58|共8页
作者
Alexandrov Alexander; Katsifodimos Asterios; Krastev Georgi; Markl Volker;
展开▼
作者单位

TU Berlin, Berlin, Germany;

TU Berlin, Berlin, Germany;

TU Berlin, Berlin, Germany;

TU Berlin, Berlin, Germany;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Technical Perspective - Implicit Parallelism through Deep Language Embedding [J] . Ives Zachary G. SIGMOD record . 2016,第1期

机译：技术观点-通过深度语言嵌入的隐式并行
2. Combining deep and shallow embedding of domain-specific languages [J] . Svenningsson Josef, Axelsson Emil Computer Languages, Systems & Structures . 2015,第DECaPTaB期

机译：结合深度和浅层嵌入特定领域的语言
3. Folding Domain-Specific Languages: Deep and Shallow Embeddings [J] . Gibbons Jeremy, Wu Nicolas ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2014,第9期

机译：折叠领域特定的语言：深浅的嵌入
4. Exploiting implicit parallelism of logic languages with the SAM [C] . Giancarlo Succi ACM/SIGAPP symposium on Applied computing . 1992

机译：利用SAM利用逻辑语言的隐式并行性
5. Accelerating Decoupled Look-ahead to Exploit Implicit Parallelism. [D] . Parihar, Raj. 2016

机译：加速解耦前瞻以利用隐式并行性。
6. Prosodic Parallelism—Comparing Spoken and Written Language [O] . Richard Wiese -1

机译：韵律平行主义-口语和书面语比较
7. Complementing user-level coarse-grain parallelism with implicit speculative parallelism [O] . Nikolas Ioannou, Marcelo Cintra 2011

机译：用隐式推测并行性补充用户级粗粒并行性
8. Some Language Issues in High Performance Computing: Translation from Fine-grained Parallelism to Coarse-grained Parallelism [R] . Goudy, S. 2006

机译：高性能计算中的一些语言问题：从细粒度并行到粗粒度并行的转换

Implicit Parallelism through Deep Language Embedding

摘要

著录项

相似文献

相关主题

期刊订阅