...
首页> 外文期刊>Current Journal of Applied Science and Technology >Methodologies for Imputation of Missing Values in Rice Pest Data
【24h】

Methodologies for Imputation of Missing Values in Rice Pest Data

机译:水稻害虫数据中缺失值归咎的方法

获取原文

摘要

Data Mining is an emerging research field in the analysis of agricultural data. In fact the most important problem in extracting knowledge from the agriculture data is the missing values of the attributes in the selected data set. If such deficiencies are there in the selected data set then it needs to be cleaned during preprocessing of the data in order to obtain a functional data. The main objective of this paper is to analyse the effectiveness of the various imputation methods in producing a complete data set that can be more useful for applying data mining techniques and presented a comparative analysis of the imputation methods for handling missing values. The pest data set of rice crop collected throughout Maharashtra state under Crop Pest Surveillance and Advisory Project (CROPSAP) during 2009-2013 was used for analysis. The different methodologies like Deleting of rows, Mean & Median, Linear regression and Predictive Mean Matching were analysed for Imputation of Missing values. The comparative analysis shows that Predictive Mean Matching Methodology was better than other methods and effective for imputation of missing values in large data set.
机译:数据挖掘是农业数据分析中的新兴研究领域。实际上,从农业数据中提取知识中最重要的问题是所选数据集中的属性的缺失值。如果在所选数据集中存在这种缺陷,则需要在数据预处理期间清除它以获得功能数据。本文的主要目的是分析各种估算方法在制造完整数据集中的有效性,该方法对于应用数据挖掘技术并且呈现了用于处理缺失值的撤销方法的比较分析。在2009 - 2013年在农作物害虫监测和咨询项目(CHOPORSAP)下的Maharashtra国家收集的稻瘟病稻田害虫数据集用于分析。删除行,平均值和amp等不同的方法;分析中位数,线性回归和预测平均匹配归因于缺失值的归属。比较分析表明,预测性平均匹配方法优于其他方法,并且有效地用于大数据集中缺失值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号