首页> 外文期刊>Big Data, IEEE Transactions on >Handling Big Data Using a Data-Aware HDFS and Evolutionary Clustering Technique
【24h】

Handling Big Data Using a Data-Aware HDFS and Evolutionary Clustering Technique

机译:使用数据感知的HDFS和进化集群技术处理大数据

获取原文
获取原文并翻译 | 示例
           

摘要

The increased use of cyber-enabled systems and Internet-of-Things (IoT) led to a massive amount of data with different structures. Most big data solutions are built on top of the Hadoop eco-system or use its distributed file system (HDFS). However, studies have shown inefficiency in such systems when dealing with today's data. Some research overcame these problems for specific types of graph data, but today's data are more than one type of data. Such efficiency issues may lead to large-scale problems, including larger space requirements in data centers, and waste in resources (like power consumption), that in turn lead to environmental problems (such as more carbon emission) [1] , as per scholars. We propose a data-aware module for the Hadoop eco-system. We also propose a distributed encoding technique for genetic algorithms efficient data processing. Our framework allows Hadoop to manage the distribution of data and its placement based on cluster analysis of the data itself. We are able to handle a broad range of data types as well as optimize query time and resource usage. We performed experiments on multiple datasets generated via LUBM (Lehigh University Benchmark) and reported results along with performance analysis.
机译:越来越多地使用具有网络功能的系统和物联网(IoT),导致具有不同结构的大量数据。大多数大数据解决方案都建立在Hadoop生态系统之上或使用其分布式文件系统(HDFS)。但是,研究表明,在处理当今数据时,此类系统效率低下。一些研究克服了针对特定类型的图形数据的这些问题,但是今天的数据不止一种类型的数据。这种效率问题可能会导致大规模的问题,包括数据中心的空间需求增加以及资源浪费(例如功耗),进而导致环境问题(例如更多的碳排放)[1]。 。我们为Hadoop生态系统提出了一个数据感知模块。我们还提出了一种用于遗传算法有效数据处理的分布式编码技术。我们的框架允许Hadoop基于数据本身的集群分析来管理数据的分布及其位置。我们能够处理各种数据类型,并优化查询时间和资源使用。我们对通过LUBM(利哈伊大学基准)生成的多个数据集进行了实验,并报告了结果以及性能分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号