An Efficient Foundation for Big Data Processing on Modern Clusters.

机译：现代集群上大数据处理的高效基础。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years, the world has seen an explosion in the amount of data being generated. Google proposed the MapReduce framework to allow programmers easily process massive amounts of data in parallel using a cluster of shared-nothing commodity machines. What started out as a tool for human efficiency subsequently began to be used as an intermediate representation for queries compiled from higher-level declarative languages. In this thesis, we present an alternate software stack for building scalable Big Data systems. We specifically focus on two parts of the stack. Hyracks is a new partitioned-parallel runtime layer that provides an efficient, generalized model for executing data-processing jobs on a cluster of commodity machines. Algebricks is a compiler framework that helps to build high-level declarative language compilers for parallel processing on top of Hyracks.

机译：近年来，全世界看到的数据量激增。谷歌提出了MapReduce框架，该框架允许程序员使用无共享商品的集群轻松地并行处理大量数据。最初作为提高人类效率的工具，后来开始用作从高级声明性语言编译的查询的中间表示。在本文中，我们提出了用于构建可伸缩大数据系统的备用软件堆栈。我们特别关注堆栈的两个部分。 Hyracks是一个新的分区并行运行时层，它提供了一种高效的通用模型，用于在商用机器集群上执行数据处理作业。 Algebricks是一个编译器框架，可帮助构建高级声明式语言编译器以在Hyracks上进行并行处理。

著录项

作者
Borkar, Vinayak.;
展开▼
作者单位

University of California, Irvine.;

展开▼
授予单位 University of California, Irvine.;
学科 Computer science.
学位 Ph.D.
年度 2016
页码 119 p.
总页数 119
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. The application of modern process-data captures systems as an efficient tool for the control, gathering and analysis of foundry-specific data [J] . Ronald Schneider, Matthias Kuhne, Michael Coldltz, Casting Plant and Technology International . 2003,第3期

机译：现代过程数据捕获系统的应用是控制，收集和分析铸造特定数据的有效工具
2. A UNIFIED SPARSE MATRIX DATA FORMAT FOR EFFICIENT GENERAL SPARSE MATRIX-VECTOR MULTIPLICATION ON MODERN PROCESSORS WITH WIDE SIMD UNITS [J] . Kreutzer Moritz, Hager Georg, Wellein Gerhard, SIAM Journal on Scientific Computing . 2014,第5期

机译：在具有宽模拟单元的现代处理器上有效地通用稀疏矩阵-向量乘法的统一稀疏矩阵数据格式
3. Foundations of Modern Query Languages for Graph Databases [J] . Angles Renzo, Arenas Marcelo, Barcelo Pablo, ACM Computing Surveys . 2017,第5期

机译：图数据库现代查询语言的基础
4. TOWARDS FOUNDATIONS OF PROCESSING IMPRECISE DATA: FROM TRADITIONAL STATISTICAL TECHNIQUES OF PROCESSING CRISP DATA TO STATISTICAL PROCESSING OF FUZZY DATA [C] . Hung T. Nguyen, Tonghui Wang, Vladik Kreinovich International Conference on Fuzzy Information Processing: Theories and Applications vol.2; 20030301-04; Beijing(CN) . 2003

机译：走向处理不精确数据的基础：从处理CRISP数据的传统统计技术到模糊数据的统计处理
5. Efficient query processing for modern data management. [D] . Srivastava, Utkarsh Hriday. 2006

机译：用于现代数据管理的高效查询处理。
6. Clinical Natural Language Processing in 2014: Foundational Methods Supporting Efficient Healthcare [O] . A. Névéol, P. Zweigenbaum, P Biyani, 2015

机译：2014年临床自然语言处理：支持高效医疗保健的基础方法
7. Foundations of Statistical Processing of Set-Valued Data: Towards Efficient Algorithms [O] . Nguyen Hung T., Kreinovich Vladik, Xiang Gang 2004

机译：集值数据统计处理的基础：有效算法

An Efficient Foundation for Big Data Processing on Modern Clusters.

摘要

著录项

相似文献

相关主题

期刊订阅