首页> 外文会议>International conference on very large data bases >REEF: Retainable Evaluator Execution Framework
【24h】

REEF: Retainable Evaluator Execution Framework

机译:Reef:保留的评估员执行框架

获取原文
获取外文期刊封面目录资料

摘要

In this demo proposal, we describe REEF, a framework that makes it easy to implement scalable, fault-tolerant runtime environments for a range of computational models. We will demonstrate diverse workloads, including extract-transform-load MapReduce jobs, iterative machine learning algorithms, and ad-hoc declarative query processing. At its core, REEF builds atop YARN (Apache Hadoop 2's resource manager) to provide retainable hardware resources with lifetimes that are decoupled from those of computational tasks. This allows us to build persistent (cross-job) caches and cluster-wide services, but, more importantly, supports high-performance iterative graph processing and machine learning algorithms. Unlike existing systems, REEF aims for composability of jobs across computational models, providing significant performance and usability gains, even with legacy code. REEF includes a library of interoperable data management primitives optimized for communication and data movement (which are distinct from storage locality). The library also allows REEF applications to access external services, such as user-facing relational databases. We were careful to decouple lower levels of REEF from the data models and semantics of systems built atop it. The result was two new standalone systems: Tang, a configuration manager and dependency injector, and Wake, a state-of-the-art event-driven programming and data movement framework. Both are language independent, allowing REEF to bridge the JVM and .NET.
机译:在此演示提案中,我们描述了Reef,这是一个框架,使其易于为一系列计算模型实现可扩展的容错运行时环境。我们将展示不同的工作负载,包括提取转换加载MapReduce作业,迭代机器学习算法和ad-hoc声明查询处理。在其核心,Reef构建yarn(apache hadoop 2的资源管理器),以提供可保留的硬件资源,其中包括与计算任务的生命周期。这使我们能够构建持久(交叉作业)缓存和群集范围的服务,但更重要的是,支持高性能迭代图处理和机器学习算法。与现有系统不同,REEF旨在跨计算模型的作业的可操作性,即使具有遗留码,也提供了显着的性能和可用性增益。 Reef包括用于通信和数据移动优化的可互操作数据管理原语库(与存储位置不同)。该库还允许Reef应用程序访问外部服务,例如面向用户的关系数据库。我们小心地从内置的系统的数据模型和系统的数据模型和语义中解耦了较低的珊瑚礁。结果是两个新的独立系统:唐,配置管理器和依赖性注射器,以及唤醒,最先进的事件驱动的编程和数据移动框架。两者都是独立的语言,允许ref桥接JVM和.NET。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号