首页> 中文期刊> 《计算机应用与软件》 >Hadoop下基于粗糙集与贝叶斯的气象数据挖掘研究

Hadoop下基于粗糙集与贝叶斯的气象数据挖掘研究

         

摘要

的重点。气象数据具有维度高、依赖性强等特点,这就对气象数据挖掘提出了更高的要求。经典数据挖掘算法在处理海量气象数据时在性能与准确率方面无法获得较好的结果。在分析了MapReduce计算模型与粗糙集、贝叶斯分类的基础上,给出了基于MapRe-duce的计算等价类的数据约简算法与朴素贝叶斯分类算法。最后在Hadoop平台上进行了相关实验。实验结果表明,该并行数据挖掘方案可以有效处理海量气象数据,并具有良好的扩展性。%With the continuous development of meteorological informatisation level,massive meteorological data has been piled up in meteorological departments,how to extract useful knowledge from massive data becomes the focus of attention.Meteorological data has the features of high dimensions and strong dependence,which puts forward higher requirements to meteorological data mining.Classic data mining algorithms cannot achieve better results in performance and accuracy when processing massive meteorological data.On the basis of analysing MapReduce calculation model,rough set theory and Bayesian classification,we propose a MapReduce-based data reduction algorithm and native Bayesian classification algorithm for computing equivalence class.Finally,on Hadoop platform we carry out the correlated experiment. It is demonstrated by the experimental results that this paralleled data mining scheme can efficiently process massive meteorological data and has good scalability.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号