首页> 外文会议>International Conference on Applied System Innovation >Resilient distributed computing platforms for big data analysis using Spark and Hadoop

【24h】

Resilient distributed computing platforms for big data analysis using Spark and Hadoop

机译：使用Spark和Hadoop的大数据分析的弹性分布式计算平台

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper introduces the integration of three platforms using Apache Hive, Cloudera Impala and BDAS Spark SQL which enables to support SQL-like queries in big data environment. In order to fast respond to user's query for big data processing, the optimized system can automatically select the appropriate platform to best perform a query. In addition, the rapid data retrieval from the in-memory cache or in-disk cache has achieved for the repeated SQL command. The proposed approach improves the efficiency of data retrieval significantly.

机译：本文介绍了使用Apache Hive，Cloudera Impala和BDAS Spark SQL的三个平台的集成，这使得能够支持大数据环境中的SQL样查询。为了快速响应用户对大数据处理的查询，优化的系统可以自动选择适当的平台以最好执行查询。此外，对于重复的SQL命令，已经实现了来自内存中缓存或磁盘中缓存的快速数据检索。所提出的方法显着提高了数据检索的效率。

著录项

来源
《International Conference on Applied System Innovation 》|2016年|850p|共4页
会议地点
作者
Bao Rong Chang; Hsiu-Fen Tsai; Yo-Ai Wang; Chien-Feng Huang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Decision support systems; Operating systems; Conferences; Business; Facebook; Multimedia communication;

机译：决策支持系统;操作系统;会议;业务;Facebook;多媒体通信;

相似文献

外文文献
中文文献
专利

1. A security framework in G-Hadoop for big data computing across distributed Cloud data centres [J] . Jiaqi Zhao, Lizhe Wang, Jie Tao, Journal of computer and system sciences . 2014 ,第5期

机译：G-Hadoop中用于跨分布式云数据中心进行大数据计算的安全框架
2. G-Hadoop: MapReduce across distributed data centers for data-intensive computing [J] . Lizhe Wang, Jie Tao, Rajiv Ranjan, Future generation computer systems . 2013 ,第3期

机译：G-Hadoop：跨分布式数据中心的MapReduce，用于数据密集型计算
3. Typhoon quantitative rainfall prediction from big data analytics by using the apache hadoop spark parallel computing framework [J] . C- C. Wei, T.- H. Chou Oceanographic Literature Review . 2020 ,第10期

机译：台风通过使用Apache Hadoop火花并行计算框架来从大数据分析的量化降雨预测
4. Resilient distributed computing platforms for big data analysis using Spark and Hadoop [C] . Bao Rong Chang, Hsiu-Fen Tsai, Yo-Ai Wang, 2016 International Conference on Applied System Innovation . 2016

机译：弹性分布式计算平台，可使用Spark和Hadoop进行大数据分析
5. Design and implementation of distributed mobile computing platform using hadoop. [D] . Pandhe, Shraddha. 2013

机译：使用hadoop的分布式移动计算平台的设计与实现。
6. Biospark: scalable analysis of large numerical datasets from biologicalsimulations and experiments using Hadoop and Spark [O] . Max Klein, Rati Sharma, Chris H Bohrer, -1

机译：Biospark：来自生物学的大型数值数据集的可扩展分析使用Hadoop和Spark进行模拟和实验
7. Analysis and Research of Distributed network Crawler based on Cloud Computing Hadoop Platform [O] . Hongsheng Xu, Ganglong Fan, Ke Li 2018

机译：基于云计算Hadoop平台的分布式网络履带分析与研究

Resilient distributed computing platforms for big data analysis using Spark and Hadoop

摘要

著录项

相似文献

相关主题

期刊订阅