首页> 中文期刊> 《信息网络安全》 >基于Hadoop的PB级海量数据处理系统的设计与实现

基于Hadoop的PB级海量数据处理系统的设计与实现

         

摘要

随着互联网的普及,PB级海量数据的存储、处理需求越来越大,传统数据库和存储架构已不能满足如此大数据量下的快速响应需求。作为一个开源的分布式系统基础架构,Hadoop提供了高可靠性的分布式存储架构和高速的海量数据计算方式,被视为解决海量数据处理瓶颈的有效途径。文章通过搭建Hadoop集群平台对1PB海量数据进行存储、处理,大大提高了系统处理性能。%With the popularization of internet, the needs of petabyte-scale data storage and processing are bigger and bigger, traditional database and storage structure couldn’t meet the quick response based on so large amount of data. As a open source distributed system structure, Hadoop gives high-reliability distributed storage structure and high-speed mass data computing methods, is considered a effective way to resolve the bottleneck of mass data processing. In this paper, we build a hadoop platform to store and process a petabyte data, and the system performance is improved greatly.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号