首页> 外文期刊>Precision Agriculture >An approach to forecast grain crop yield using multi-layered, multi-farm data sets and machine learning
【24h】

An approach to forecast grain crop yield using multi-layered, multi-farm data sets and machine learning

机译:使用多层多农场数据集和机器学习预测粮食作物产量的方法

获取原文
获取原文并翻译 | 示例
           

摘要

Many broadacre farmers have a time series of crop yield monitor data for their fields which are often augmented with additional data, such as soil apparent electrical conductivity surveys and soil test results. In addition there are now readily available national and global datasets, such as rainfall and MODIS, which can be used to represent the crop-growing environment. Rather than analysing one field at a time as is typical in precision agriculture research, there is an opportunity to explore the value of combining data over multiple fields/farms and years into one dataset. Using these datasets in conjunction with machine learning approaches allows predictive models of crop yield to be built. In this study, several large farms in Western Australia were used as a case study, and yield monitor data from wheat, barley and canola crops from three different seasons (2013, 2014 and 2015) that covered similar to 11 000 to similar to 17 000 hectares in each year were used. The yield data were processed to a 10 m grid, and for each observation point associated predictor variables in space and time were collated. The data were then aggregated to a 100 m spatial resolution for modelling yield. Random forest models were used to predict crop yield of wheat, barley and canola using this dataset. Three separate models were created based on pre-sowing, mid-season and late-season conditions to explore the changes in the predictive ability of the model as more within-season information became available. These time points also coincide with points in the season when a management decision is made, such as the application of fertiliser. The models were evaluated with cross-validation using both fields and years for data splitting, and this was assessed at the field spatial resolution. Cross-validated results showed the models predicted yield relatively accurately, with a root mean square error of 0.36 to 0.42 t ha(-1), and a Lin's concordance correlation coefficient of 0.89 to 0.92 at the field resolution. The models performed better as the season progressed, largely because more information about within-season data became available (e.g. rainfall). The more years of yield data that were available for a field, the better the predictions were, and future work should use a longer time-series of yield data. The generic nature of this method makes it possible to apply to other agricultural systems where yield monitor data is available. Future work should also explore the integration of more data sources into the models, focus on predicting at finer spatial resolutions within fields, and the possibility of using the yield forecasts to guide management decisions.
机译:许多抗屠杀农民都有一个时间序列的作物产量监测数据,其田地经常被额外的数据增强,例如土壤表观电导率调查和土壤测试结果。此外,现在还有国家和全球数据集,如降雨和MODIS,可用于代表种植环境的环境。而不是在精确农业研究中的典型中分析一个领域,而且有机会探讨将数据与多个字段/场和多年来的数据组合到一个数据集中。使用这些数据集与机器学习方法结合使用允许建造的作物产量的预测模型。在这项研究中,西澳大利亚的几个大型农场被用作案例研究,并从小麦,大麦和油菜厂从三个不同季节(2013,2014和2015)的疗养数据库数据(2013,2014和2015),这些数据类似于11 000到类似于17 000使用每年的公顷。将产量数据处理到10米网格,并且对于每个观察点,在空间和时间中的每个观察点相关的预测变量被整理。然后将数据汇总到100米的空间分辨率以进行建模产量。随机森林模型用于使用该数据集预测小麦,大麦和油菜的作物产量。根据预播,中期和季节条件创建了三种独立的模型,以探讨模型的预测能力变化,随着季节内部信息的可用。当制定管理决定时,这些时间点也与本赛季的点相吻合,例如施肥的应用。使用对数据分割的字段和多年进行交叉验证评估模型,并在现场空间分辨率下进行评估。交叉验证结果显示模型相对准确地预测产量,具有0.36至0.42 t ha(-1)的根均方误差,并且LIN在现场分辨率下为0.89至0.92的林的一致性系数。随着赛季的进展情况更好的模型,很大程度上是因为有关季节内部数据的更多信息,所以可以获得(例如降雨)。对于场所可用的收益数据越多,预测越好,未来的工作应使用较长的时间系列产量数据。该方法的通用性质使得可以应用于其他农业系统,其中可以提供产量监控数据。未来的工作还应该探讨更多数据来源的集成到模型中,专注于在领域内更精细的空间分辨率预测,以及使用产量预测来指导管理决策的可能性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号