首页> 外文会议>IEEE International Conference on Cloud Computing and Big Data Analysis >Design and Implementation of Meteorological Big Data Platform Based on Hadoop and Elasticsearch
【24h】

Design and Implementation of Meteorological Big Data Platform Based on Hadoop and Elasticsearch

机译:基于Hadoop和Elasticsearch的气象大数据平台的设计与实现。

获取原文

摘要

With the launching of high resolution meteorological satellites and the development of high spatial and temporal resolution numerical models, the types and amounts of various meteorological data are increasing year by year. The existing relational databases are no longer able to meet the business requirements of real-time or non-real-time data storage, processing and retrieval. The Hadoop ecosystem, combining with the Elasticsearch cluster (ES cluster) is used to build the meteorological big data platform. The real-time data is processed by Kafka message queue, combing with the Storm DataAnly topology and finally enters the ES cluster. The non-real-time data is mainly processed by the file monitoring component. The file metadata information such as indexes is stored in the ES cluster. The files are saved in the HDFS. The implemented Big Data platform can process about 1.5 million real-time and non-real-time meteorological data per day, while the Elasticsearch cluster can provide ultrafast searching at a speed level of millisecond in a dataset of 2.0 million. Experiments show that the meteorological big data platform can meet the needs of modern meteorological business.
机译:随着高分辨率气象卫星的发射和高时空分辨率数值模型的发展,各种气象数据的类型和数量逐年增加。现有的关系数据库不再能够满足实时或非实时数据存储,处理和检索的业务需求。 Hadoop生态系统结合Elasticsearch集群(ES集群)用于构建气象大数据平台。实时数据由Kafka消息队列处理,并与Storm DataAnly拓扑结合,最后进入ES集群。非实时数据主要由文件监视组件处理。文件元数据信息(例如索引)存储在ES群集中​​。文件保存在HDFS中。已实施的大数据平台每天可处理约150万个实时和非实时气象数据,而Elasticsearch集群可在200万个数据集中以毫秒级的速度提供超快速搜索。实验表明,气象大数据平台可以满足现代气象业务的需求。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号