首页> 外文学位 >Predictions of Genetic Merit in Tree Breeding Using Factor Analytic Linear Mixed Models and Blended Genomic Relationship Matrices.
【24h】

Predictions of Genetic Merit in Tree Breeding Using Factor Analytic Linear Mixed Models and Blended Genomic Relationship Matrices.

机译:使用因子分析线性混合模型和混合基因组关系矩阵预测树木育种的遗传优势。

获取原文
获取原文并翻译 | 示例

摘要

Increase in computer power and efficiency in DNA sequencing technologies is providing new opportunities to plant and animal breeders to fit more complex statistical models for predictions of genetic merit of individuals. Such models can be powerful to account for heterogeneity in the data and as a result can increase the accuracy of predictions and genetic gains from breeding programs.;In this study, I first evaluated the efficiency of factor analytic (FA) linear mixed models for a large, multi-environmental trial of loblolly pine (Pinus taeda L.). Height was assessed on 37,269 trees at age six years in a diallel experiment. Among models fitted, FA models produced the smallest AIC model fit statistics. FA models captured both the variance and covariance at the genetic level better than models with simpler covariance structures, and they provided more accurate predictions of genotypes. The mean narrow-sense heritability estimates for height was about 0.20 when more complex variance structures were used, compared to 0.13 when simpler variance structures were employed. FA models were parsimonious compared to US structures. The FA models provided a natural framework for modeling genotype by environment interaction. Genotype by environment interaction was non-significant as suggested by high genetic correlations both for additive (0.83) and dominance (0.91) effects.;Molecular marker data, especially single-nucleotide polymorphic (SNP) markers have been commonly used to predict genetic merit in animal breeding. However, marker data probably have missing genotypes and they need to be imputed. The effects of percent (level) and pattern (random or structured) of missing data, and mating designs on the accuracy imputation of genotypes were investigated. I used linkage based BLUP to impute missing genotypes for an empirical (unbalanced) data set for loblolly pine. For simulated (balanced) data sets, both BLUP and Hidden Markov Model (HMM) approaches were used. The actual data had 178 clones that were genotyped at 3,461 biallelic SNP markers. The simulated data consist of double-pair and half-diallel mating design with 2880 and 2940 individuals, respectively. For empirical data, accuracy of imputation was higher for the structured pattern of missing data at any level of missing percentages. Regardless of the pattern of the missing data, imputation accuracy was less than 0.70 when the data had greater than 40% missing values. For the simulated data, the imputation accuracy was not affected by mating design for the BLUP approach when the pattern of the missing data was structured. For HMM approach, when the pattern of the missing data was evenly-spaced, the mating design had no effect on the imputation accuracy.;Combining information from pedigree and DNA markers might improve prediction accuracies in tree breeding programs. In the third chapter, a cloned population of loblolly pine and simulated data sets were used to examine the efficiency of blended additive genetic relationships and realized genomic relationships were examined. Cloned 166 individuals were genotyped at 3,461 SNP markers out of a total 354 clones. For simulated data 300 or 600 trees genotyped at 1000 markers out of 1200. For the empirical data, the accuracy of predictions based on the ABLUP was 0.79. Predictions based on HBLUP had accuracy of 0.72 to 0.76, depending on the genomic relationships used. For the simulated data set, the accuracy values for HBLUP models were higher compared to ABLUP model. Also, as the genotyped population size increased the accuracies increased. HBLUP uses all the available phenotype, pedigree, and genotype data in a single-step and is easy to implement for genomic based selection. The major advantage of using genomic relationships matrices is that markers can capture the Mendelian sampling effect within full-sib families for selection.
机译:DNA测序技术中计算机功能的增强和效率的提高,为动植物育种者提供了新的机会,使其可以使用更复杂的统计模型来预测个体的遗传价值。这样的模型可以强大地解决数据中的异质性,从而可以提高预测的准确性和育种程序的遗传增益。在本研究中,我首先评估了因子分析(FA)线性混合模型的效率。大火炬松(Pinus taeda L.)的多环境试验。在一项年龄为6岁的实验中,对37,269棵树进行了高度评估。在拟合的模型中,FA模型产生的最小的AIC模型拟合统计量。 FA模型比具有简单协方差结构的模型更好地捕获了遗传水平的方差和协方差,并且它们提供了更准确的基因型预测。当使用更复杂的变异结构时,高度的平均狭义遗传力估计值约为0.20,而使用简单变异结构时为0.13。与美国的结构相比,FA模型简朴。 FA模型为通过环境相互作用对基因型进行建模提供了自然的框架。如加性(0.83)和优势(0.91)效应的高遗传相关性所表明,环境相互作用的基因型不显着。动物繁殖。但是,标记数据可能缺少基因型,因此需要进行估算。研究了缺失数据的百分比(水平)和模式(随机或结构化)以及配对设计对基因型准确性推算的影响。我使用基于链接的BLUP为火炬松的经验(不平衡)数据集估算缺失的基因型。对于模拟(平衡)数据集,同时使用BLUP和隐马尔可夫模型(HMM)方法。实际数据有178个克隆,以3,461个双等位基因SNP标记进行基因分型。模拟数据包括分别具有2880和2940个个体的双对和半月形交配设计。对于经验数据,在任何百分比的缺失百分比水平下,缺失数据的结构化模式的估算准确性都较高。无论缺失数据的模式如何,当缺失值大于40%时,插补精度均小于0.70。对于模拟数据,当构造丢失数据的模式时,插补精度不受BLUP方法配合设计的影响。对于HMM方法,当丢失数据的模式均匀分布时,交配设计对插补精度没有影响。;将谱系和DNA标记信息相结合可能会提高树木育种程序的预测准确性。在第三章中,使用克隆的火炬松种群和模拟数据集来检验混合加性遗传关系的效率,并检验了已实现的基因组关系。在总共354个克隆中,以3,461个SNP标记对166个个体进行了基因分型。对于模拟数据,以1200个中的1000个标记进行基因分型的300或600棵树。对于经验数据,基于ABLUP的预测的准确性为0.79。基于HBLUP的预测的准确度为0.72至0.76,具体取决于所使用的基因组关系。对于模拟数据集,与ABLUP模型相比,HBLUP模型的精度值更高。而且,随着基因型人口规模的增加,准确性也随之提高。 HBLUP只需一步即可使用所有可用的表型,谱系和基因型数据,并且易于实现基于基因组的选择。使用基因组关系矩阵的主要优点是标记可以捕获全同胞族内的孟德尔采样效应以供选择。

著录项

  • 作者

    Ogut, Funda.;

  • 作者单位

    North Carolina State University.;

  • 授予单位 North Carolina State University.;
  • 学科 Agriculture Forestry and Wildlife.
  • 学位 Ph.D.
  • 年度 2012
  • 页码 196 p.
  • 总页数 196
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:42:48

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号