...
首页> 外文期刊>Plant methods >Principal variable selection to explain grain yield variation in winter wheat from features extracted from UAV imagery
【24h】

Principal variable selection to explain grain yield variation in winter wheat from features extracted from UAV imagery

机译:利用从无人机影像中提取的特征来解释冬小麦籽粒产量变化的主变量选择

获取原文

摘要

Abstract BackgroundAutomated phenotyping technologies are continually advancing the breeding process. However, collecting various secondary traits throughout the growing season and processing massive amounts of data still take great efforts and time. Selecting a minimum number of secondary traits that have the maximum predictive power has the potential to reduce phenotyping efforts. The objective of this study was to select principal features extracted from UAV imagery and critical growth stages that contributed the most in explaining winter wheat grain yield. Five dates of multispectral images and seven dates of RGB images were collected by a UAV system during the spring growing season in 2018. Two classes of features (variables), totaling to 172 variables, were extracted for each plot from the vegetation index and plant height maps, including pixel statistics and dynamic growth rates. A parametric algorithm, LASSO regression (the least angle and shrinkage selection operator), and a non-parametric algorithm, random forest, were applied for variable selection. The regression coefficients estimated by LASSO and the permutation importance scores provided by random forest were used to determine the ten most important variables influencing grain yield from each algorithm.ResultsBoth selection algorithms assigned the highest importance score to the variables related with plant height around the grain filling stage. Some vegetation indices related variables were also selected by the algorithms mainly at earlier to mid growth stages and during the senescence. Compared with the yield prediction using all 172 variables derived from measured phenotypes, using the selected variables performed comparable or even better. We also noticed that the prediction accuracy on the adapted NE lines ( r =?0.58–0.81)?was higher than the other lines ( r =?0.21–0.59) included in this study with different genetic backgrounds.ConclusionsWith the ultra-high resolution plot imagery obtained by the UAS-based phenotyping we are now able to derive more features, such as the variation of plant height or vegetation indices within a plot other than just an averaged number, that are potentially very useful for the breeding purpose. However, too many features or variables can be derived in this way. The promising results from this study suggests that the selected set from those variables can have comparable prediction accuracies on the grain yield prediction than the full set of them but possibly resulting in a better allocation of efforts and resources on phenotypic data collection and processing.
机译:抽象背景自动表型技术正在不断推进育种过程。但是,在整个生长季节收集各种次要特征并处理大量数据仍然需要大量的精力和时间。选择具有最大预测能力的最少数量的次级性状有可能减少表型研究的努力。这项研究的目的是选择从无人机图像中提取的主要特征和关键生长阶段,这些特征在解释冬小麦籽粒产量方面贡献最大。 UAV系统在2018年春季生长季节收集了五个日期的多光谱图像和七个日期的RGB图像。每个样地从植被指数和植物高度中提取了两类特征(变量),共172个变量地图,包括像素统计信息和动态增长率。将参数算法LASSO回归(最小角度和收缩选择算子)和非参数算法随机森林应用于变量选择。分别使用LASSO估计的回归系数和随机森林提供的排列重要性得分来确定影响算法产量的十个最重要变量。结果两种选择算法均将最高重要性得分分配给与籽粒灌浆周围植物高度相关的变量阶段。该算法还选择了一些与植被指数相关的变量,这些变量主要是在生长的早期到中期以及衰老期间。与使用来自测得表型的所有172个变量进行的产量预测相比,使用选定的变量表现可比甚至更好。我们还注意到,在具有不同遗传背景的本研究中,适应的NE品系(r =?0.58–0.81)?的预测准确性高于其他研究品系(r =?0.21-0.59)。通过基于UAS的表型获得的样地图像,我们现在能够推导出更多的特征,例如,除了平均数以外,样地中植物高度或植被指数的变化,这可能对繁殖目的非常有用。但是,以这种方式可以导出太多的特征或变量。这项研究的有希望的结果表明,从这些变量中选出的一组与全部变量相比,在谷物产量预测上可以具有可比的预测精度,但可能导致在表型数据收集和处理方面的努力和资源得到更好的分配。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号