...
首页> 外文期刊>Operating systems review >GLADE: A Scalable Framework for Efficient Analytics
【24h】

GLADE: A Scalable Framework for Efficient Analytics

机译:GLADE:高效分析的可扩展框架

获取原文
获取原文并翻译 | 示例

摘要

In this paper we introduce GLADE, a scalable distributed framework for large scale data analytics. GLADE consists of a simple user-interface to define Generalized Linear Aggregates (GLA), the fundamental abstraction at the core of GLADE, and a distributed runtime environment that executes GLAs by using parallelism extensively. GLAs are derived from User-Defined Aggregates (UDA), a relational database extension that allows the user to add specialized aggregates to be executed inside the query processor. GLAs extend the UDA interface with methods to Serialize/Deserialize the state of the aggregate required for distributed computation. As a significant departure from UDAs which can be invoked only through SQL, GLAs give the user direct access to the state of the aggregate, thus allowing for the computation of significantly more complex aggregate functions. GLADE runtime is an execution engine optimized for the GLA computation. The runtime takes the user-defined GLA code, compiles it inside the engine, and executes it right near the data by taking advantage of parallelism both inside a single machine as well as across a cluster of computers. This results in maximum possible execution time performance (all our experimental tasks are I/O-bound) and linear scaleup.
机译:在本文中,我们介绍GLADE,这是一种用于大规模数据分析的可扩展分布式框架。 GLADE包含一个用于定义广义线性聚合(GLA)的简单用户界面,GLADE核心的基本抽象以及一个通过广泛使用并行性执行GLA的分布式运行时环境。 GLA源自用户定义的聚合(UDA),这是一个关系数据库扩展,允许用户添加要在查询处理器内部执行的专用聚合。 GLA使用方法对UDA接口进行扩展,以对分布式计算所需的聚合状态进行序列化/反序列化。与只能通过SQL调用的UDA明显不同,GLA使用户可以直接访问聚合的状态,从而允许计算复杂得多的聚合函数。 GLADE运行时是为GLA计算而优化的执行引擎。运行时采用用户定义的GLA代码,在引擎内部对其进行编译,并通过利用单个计算机内部以及跨计算机集群的并行性在数据附近执行该代码。这样可以最大程度地提高执行时间性能(我们所有的实验任务都受I / O限制)和线性扩展。

著录项

  • 来源
    《Operating systems review 》 |2012年第1期| p.12-18| 共7页
  • 作者

    Florin Rusu; Alin Dobra;

  • 作者单位

    University of California, Merced 5200 N Lake Road Merced, CA 95343;

    University of Florida PO Box 116120 Gainesville, FL 32611;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号