首页> 外文期刊>Progress in Physical Geography >Prediction of winter wheat yield at county level in China using ensemble learning
【24h】

Prediction of winter wheat yield at county level in China using ensemble learning

机译:基于集成学习的县域冬小麦产量预测

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Since increasing food demand and continuous reduction of available farmland, reliable and near-real-time wheat yield forecasts are essential to ensure regional and global food supplies. Although the crop model has been widely used in yield estimation, its applicability in large-scale yield prediction is limited due to the large amount of data required for parameterization. We took the main winter wheat growing areas in China and developed an ensemble learning framework based on seven machine learning algorithms, such as extreme gradient boosting, random forest, and support vector regression. The model used satellite vegetation index time series, climate, soil properties, and elevation data to provide county-level winter wheat yield forecasts from 2001 to 2015. The results showed that the ensemble explained 86 of the yield variability, which outperformed all base learners. By calculating the correlation between the prediction results of the base learners, we believed that the prediction performance of ensemble learning still has the potential for improvement. Soil properties and elevation data effectively improved the performance of the model because they contained information about yield prediction that could not be fully captured by vegetation index and climate data. As the growing season went on, the unique contribution of increasing climate data to yield forecasts was always more than that of vegetation index, especially in the early growing season. Furthermore, we evaluated the model's ability to perform within-season prediction, and the model achieved satisfactory prediction accuracy 2 months before harvest (R-2 = 0.85, RMSE = 480 kg/ha, MAPE = 7.52). The framework of yield forecast established in this research can be applied to other crop varieties and regions and provide stakeholders with sufficiently accurate yield predictions.
机译:由于粮食需求不断增加,可用耕地不断减少,可靠和近乎实时的小麦单产预测对于确保区域和全球粮食供应至关重要。尽管作物模型在产量估算中得到了广泛的应用,但由于参数化需要大量数据,因此其在大规模产量预测中的适用性有限。我们以中国冬小麦主产区为例,基于极端梯度提升、随机森林和支持向量回归等7种机器学习算法,开发了一个集成学习框架。该模型利用卫星植被指数时间序列、气候、土壤性质和海拔数据,提供了2001—2015年县级冬小麦单产预测。结果显示,集成解释了 86% 的产量变异性,优于所有基础学习器。通过计算基础学习器预测结果之间的相关性,我们认为集成学习的预测性能仍有改进的潜力。土壤属性和高程数据有效地提高了模型的性能,因为它们包含植被指数和气候数据无法完全捕获的产量预测信息。随着生长季节的进行,气候数据增加对单产预测的独特贡献总是大于植被指数,尤其是在生长季节的早期。此外,我们评估了该模型的季节内预测能力,该模型在收获前 2 个月达到了令人满意的预测准确性(R-2 = 0.85,RMSE = 480 kg/ha,MAPE = 7.52%)。本研究建立的产量预测框架可以应用于其他作物品种和地区,并为利益相关者提供足够准确的产量预测。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号