首页> 外文期刊>TAG Theoretical and Applied Genetics >Prediction of maize single-cross hybrid performance: support vector machine regression versus best linear prediction
【24h】

Prediction of maize single-cross hybrid performance: support vector machine regression versus best linear prediction

机译:玉米单杂交种性能预测:支持向量机回归与最佳线性预测

获取原文
获取原文并翻译 | 示例
           

摘要

Accurate prediction of the phenotypic performance of a hybrid plant based on the molecular fingerprints of its parents should lead to a more cost-effective breeding programme as it allows to reduce the number of expensive field evaluations. The construction of a reliable prediction model requires a representative sample of hybrids for which both molecular and phenotypic information are accessible. This phenotypic information is usually readily available as typical breeding programmes test numerous new hybrids in multi-location field trials on a yearly basis. Earlier studies indicated that a linear mixed model analysis of this typically unbalanced phenotypic data allows to construct ɛ-insensitive support vector machine regression and best linear prediction models for predicting the performance of single-cross maize hybrids. We compare these prediction methods using different subsets of the phenotypic and marker data of a commercial maize breeding programme and evaluate the resulting prediction accuracies by means of a specifically designed field experiment. This balanced field trial allows to assess the reliability of the cross-validation prediction accuracies reported here and in earlier studies. The limits of the predictive capabilities of both prediction methods are further examined by reducing the number of training hybrids and the size of the molecular fingerprints. The results indicate a considerable discrepancy between prediction accuracies obtained by cross-validation procedures and those obtained by correlating the predictions with the results of a validation field trial. The prediction accuracy of best linear prediction was less sensitive to a reduction of the number of training examples compared with that of support vector machine regression. The latter was, however, better at predicting hybrid performance when the size of the molecular fingerprints was reduced, especially if the initial set of markers had a low information content. Communicated by M. Cooper.
机译:基于其亲本的分子指纹对杂种植物表型表现的准确预测,应该会导致更具成本效益的育种计划,因为它可以减少昂贵的田间评估次数。可靠的预测模型的构建需要具有代表性的杂交体样本,才能获得分子和表型信息。这种表型信息通常很容易获得,因为典型的育种计划每年在多地点田间试验中测试众多新杂种。较早的研究表明,对这种典型的不平衡表型数据进行线性混合模型分析,可以构建不敏感的支持向量机回归模型和用于预测单杂交玉米杂交种性能的最佳线性预测模型。我们使用商业玉米育种计划的表型和标记数据的不同子集来比较这些预测方法,并通过专门设计的田间实验评估所得的预测准确性。这项平衡的现场试验可以评估此​​处和早期研究中报告的交叉验证预测准确性的可靠性。通过减少训练混合体的数量和分子指纹的大小,进一步检查了两种预测方法的预测能力的局限性。结果表明,通过交叉验证程序获得的预测准确性与通过将预测与验证现场试验的结果相关联而获得的预测准确性之间存在相当大的差异。与支持向量机回归相比,最佳线性预测的预测精度对训练示例数量的减少较不敏感。但是,当分子指纹的大小减小时,尤其是在初始标记物的信息含量较低的情况下,后者在预测杂种性能方面更好。由M. Cooper沟通。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号