首页> 美国卫生研究院文献>other >Optimizing Interactive Development of Data-Intensive Applications

【2h】

Optimizing Interactive Development of Data-Intensive Applications

机译：优化数据密集型应用程序的交互式开发

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Modern Data-Intensive Scalable Computing (DISC) systems are designed to process data through batch jobs that execute programs (e.g., queries) compiled from a high-level language. These programs are often developed interactively by posing ad-hoc queries over the base data until a desired result is generated. We observe that there can be significant overlap in the structure of these queries used to derive the final program. Yet, each successive execution of a slightly modified query is performed anew, which can significantly increase the development cycle. Vega is an Apache Spark framework that we have implemented for optimizing a series of similar Spark programs, likely originating from a development or exploratory data analysis session. Spark developers (e.g., data scientists) can leverage Vega to significantly reduce the amount of time it takes to re-execute a modified Spark program, reducing the overall time to market for their Big Data applications.

机译：现代数据密集型可伸缩计算（DISC）系统旨在通过批处理作业来处理数据，这些批处理作业执行从高级语言编译的程序（例如查询）。这些程序通常是通过对基础数据进行临时查询直到生成所需结果来交互式开发的。我们观察到，用于得出最终程序的这些查询的结构可能存在重大重叠。但是，重新执行每次稍加修改的查询都会重新执行，这可能会大大增加开发周期。 Vega是我们已实现的Apache Spark框架，用于优化一系列类似的Spark程序，这些程序可能源自开发或探索性数据分析会话。 Spark开发人员（例如，数据科学家）可以利用Vega来大大减少重新执行经过修改的Spark程序所需的时间，从而减少其大数据应用程序的总体上市时间。

著录项

期刊名称 other
作者
Matteo Interlandi; Sai Deep Tetali; Muhammad Ali Gulzar; Joseph Noor; Tyson Condie; Miryung Kim; Todd Millstein;
展开▼
作者单位

展开▼
年(卷),期 -1(2016),-1
年度 -1
页码 510–522
总页数 36
原文格式 PDF
正文语种
中图分类
关键词
Query Rewriting Incremental Evaluation Spark Interactive Development Big Data;

机译：查询重写;增量评估;Spark;交互开发;大数据;

相似文献

外文文献
中文文献
专利

1. Collaborative Optimization of Service Composition for Data-Intensive Applications in a Hybrid Cloud [J] . Ma Hua, Zhu Haibin, Li Keqin, IEEE Transactions on Parallel and Distributed Systems . 2019,第5期

机译：混合云中数据密集型应用程序的服务组合协同优化
2. Collaborative Optimization of Service Composition for Data-Intensive Applications in a Hybrid Cloud [J] . Ma Hua, Zhu Haibin, Li Keqin, IEEE Transactions on Parallel and Distributed Systems . 2019,第5期

机译：混合云中数据密集型应用的服务组合的协作优化
3. Scheduling Data-Intensive Work-Flow Applications Using Variable Neighborhood Particle Swarm Optimization [J] . HONGBO LIU, AJITH ABRAHAM, OKKYUNG CHOI, WSEAS Transactions on Circuits and Systems . 2006,第8期

机译：使用可变邻域粒子群优化调度数据密集型工作流应用程序
4. Transfer Scheduling Schemes for Data-Intensive, Interactive Applications [C] . Takizawa, M., Shimizu, GLOBECOM 2007, 2007 IEEE Global Telecommunications Conference . 2007

机译：数据密集型交互式应用的传输计划方案
5. An Interactive Design Framework Based on Data-Intensive Simulations: Implementation and Application to Device-Tissue Interaction Design Problems. [D] . Lin, Chi-Lun. 2015

机译：基于数据密集型仿真的交互式设计框架：对设备-组织交互设计问题的实现和应用。
6. Hybrid Clouds for Data-Intensive 5G-Enabled IoT Applications: An Overview Key Issues and Relevant Architecture [O] . Panagiotis Trakadas, Nikolaos Nomikos, Emmanouel T. Michailidis, 2019

机译：适用于数据密集型启用5G的IoT应用的混合云：概述关键问题和相关架构
7. Optimizing VM allocation and data placement for data-intensive applications in cloud using ACO metaheuristic algorithm [O] . T.P. Shabeera, S.D. Madhu Kumar, Sameera M. Salam, 2017

机译：使用aCO元启发式算法优化数据密集型云应用程序的Vm分配和数据放置

Optimizing Interactive Development of Data-Intensive Applications

摘要

著录项

相似文献

相关主题

期刊订阅