A Time Based Analysis of Data Processing on Hadoop Cluster

机译：基于时间的Hadoop集群数据处理分析

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Data when it becomes in that much amount that it cannot be managed by the traditional database management system then it is Big data. It is difficult to manage this much amount of the data. Hadoop is a technological answer to the Big Data. Data storage and retrieval of information from the data is done by the Hadoop Distributed File System and the Map Reduce Programming model. MapReduce provides effective bench marks for retrieving the information from the Big Data. In this paper we present our experimental work done on the Hadoop Cluster. We have analyzed the time required by the cluster for processing the data with increasing number of nodes into the cluster. We started with a single node and then increase the node by one each time. We have analyzed three types of time. The real time, user time, system time is analyzed.

机译：当数据量大到传统数据库管理系统无法管理的数据时，它就是大数据。管理如此大量的数据非常困难。 Hadoop是大数据的技术解决方案。数据存储和从数据中检索信息是由Hadoop分布式文件系统和Map Reduce编程模型完成的。 MapReduce为从大数据中检索信息提供了有效的基准。在本文中，我们介绍了在Hadoop集群上完成的实验工作。我们分析了集群中处理节点所需数量增加的数据所需的时间。我们从一个节点开始，然后每次增加一个节点。我们分析了三种类型的时间。实时，用户时间，系统时间进行了分析。

著录项

来源
《International Conference on Computational Intelligence and Communication Networks》|2014年|608-612|共5页
会议地点
作者
Pal Amrit; Agrawal Sanjay;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Data Node; Hadoop Distributed File System; Job Tracker; MapReduce; Name Node; Task Tracker;

机译：数据节点; Hadoop分布式文件系统;作业跟踪器; MapReduce;名称节点;任务跟踪器;

相似文献

外文文献
中文文献
专利

1. A novel model for Time-Series Data Clustering Based on piecewise SVD and BIRCH for Stock Data Analysis on Hadoop Platform [J] . Ibgtc Bowala, Mgnas Fernando Advances in Science, Technology and Engineering Systems . 2017,第3期

机译：基于分段SVD和BIRCH的时间序列数据聚类新模型，用于Hadoop平台上的股票数据分析
2. Performance Modeling and Analysis of a Hadoop Cluster for Efficient Big Data Processing [J] . Advanced Science Letters . 2016,第9期

机译：Hadoop集群的性能建模与分析高效大数据处理
3. QAOC: Novel query analysis and ontology-based clustering for data management in Hadoop [J] . D. Pradeep, C. Sundar Future generation computer systems . 2020,第Jula期

机译：QAOC：用于Hadoop中数据管理的新颖查询分析和基于本体的集群
4. A Time Based Analysis of Data Processing on Hadoop Cluster [C] . Pal Amrit, Agrawal Sanjay International Conference on Computational Intelligence and Communication Networks . 2014

机译：Hadoop集群数据处理的基于时间分析
5. Sentiment analysis of big social data with Apache Hadoop. [D] . Kang, Qiuling. 2014

机译：使用Apache Hadoop对大型社交数据进行情感分析。
6. Exploring mediators of the recovery process over time among mental health service users using a mixed model regression analysis based on cluster RCT data [O] . Elisabeth Argentzell, Martin Bäckström, Kristine Lund, 2020

机译：在心理健康服务用户中探讨恢复过程的调解器使用基于集群RCT数据的混合模型回归分析
7. A COMPARATIVE ANALYSIS OF CONVENTIONAL HADOOP WITH PROPOSED CLOUD ENABLED HADOOP FRAMEWORK FOR SPATIAL BIG DATA PROCESSING [O] . A. K. Tripathi, S. Agrawal, R. D. Gupta 2018

机译：具有拟议云的传统Hadoop的比较分析使Hadoop框架用于空间大数据处理

A Time Based Analysis of Data Processing on Hadoop Cluster

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅