...
首页> 外文期刊>Applied Engineering in Agriculture >OPTIMUM FEATURE SUBSET FOR OPTIMIZING CROP YIELD PREDICTION USING FILTER AND WRAPPER APPROACHES
【24h】

OPTIMUM FEATURE SUBSET FOR OPTIMIZING CROP YIELD PREDICTION USING FILTER AND WRAPPER APPROACHES

机译:优化滤波器和包装方法优化作物产量预测的最佳特征子集

获取原文
获取原文并翻译 | 示例

摘要

In agriculture, crop yield prediction is critical. Crop yield depends on various features which can be categorized as geographical, climatic, and biological. Geographical features consist of cultivable land in hectares, canal length to cover the cultivable land, number of tanks and tube wells available for irrigation. Climatic features consist of rainfall, temperature, and radiation. Biological features consist of seeds, minerals, and nutrients. In total, 15 features were considered for this study to understand features impact on paddy crop yield for all seasons of each year. For selecting vital features, five filter and wrapper approaches were applied. For predicting accuracy of features selection algorithm, Multiple Linear Regression (MLR) model was used. The RMSE, MAE, R, and RRMSE metrics were used to evaluate the performance of feature selection algorithms. Data used for the analysis was drawn from secondary sources of state Agriculture Department, Government of Tamil Nadu, India, for over 30 years. Seventy-five percent of data was used for training and 25% was used for testing. Low computational time was also considered for the selection of best feature subset. Outcome of all feature selection algorithms have given similar results in the RMSE, RRMSE, R, and MAE values. The adjusted R(2 )value was used to find the optimum feature subset despite all the deviations. The evaluation of the dataset used in this work shows that total area of cultivation, number of tanks and open wells used for irrigation, length of canals used for irrigation, and average maximum temperature during the season of the crop are the best features for better crop yield prediction on the study area. The MLR gives 85% of model accuracy for the selected features with low computational time.
机译:在农业中,作物产量预测至关重要。作物产量取决于各种特征,可以分类为地理,气候和生物学。地理特征包括耕地,运河长度,覆盖可耕地,坦克和管井数可供灌溉。气候特征包括降雨,温度和辐射。生物学特征包括种子,矿物质和营养素。总共考虑了这项研究的15个特征,以了解每年所有季节对稻谷作物产量的影响。为了选择重要的功能,应用了五个过滤器和包装方法。为了预测特征选择算法的准确性,使用多个线性回归(MLR)模型。 RMSE,MAE,R和RRMSE指标用于评估特征选择算法的性能。用于分析的数据是从印度泰米尔纳德邦政府的州农业部的二级来源,超过30年。 75%的数据用于培训,25%用于测试。还考虑选择最佳特征子集的低计算时间。所有特征选择算法的结果已经在RMSE,RRMSE,R和MAE值中给出了类似的结果。尽管所有偏差,所调整的R(2)值用于找到最佳特征子集。在本工作中使用的数据集的评估显示,用于灌溉的耕种总面积,罐数和开放井,用于灌溉的运河长度,并且在作物季节的平均最高温度是更好的作物的最佳功能研究区的产量预测。 MLR为具有低计算时间的所选功能提供85%的模型精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号