...
首页> 外文期刊>Heredity: An International Journal of Genetics >A robust multiple-locus method for quantitative trait locus analysis of non-normally distributed multiple traits
【24h】

A robust multiple-locus method for quantitative trait locus analysis of non-normally distributed multiple traits

机译:用于非正态分布多个性状数量性状基因座分析的鲁棒多位点方法

获取原文
获取原文并翻译 | 示例
           

摘要

Linear regression-based quantitative trait loci/association mapping methods such as least squares commonly assume normality of residuals. In genetics studies of plants or animals, some quantitative traits may not follow normal distribution because the data include outlying observations or data that are collected from multiple sources, and in such cases the normal regression methods may lose some statistical power to detect quantitative trait loci. In this work, we propose a robust multiple-locus regression approach for analyzing multiple quantitative traits without normality assumption. In our method, the objective function is least absolute deviation (LAD), which corresponds to the assumption of multivariate Laplace distributed residual errors. This distribution has heavier tails than the normal distribution. In addition, we adopt a group LASSO penalty to produce shrinkage estimation of the marker effects and to describe the genetic correlation among phenotypes. Our LAD-LASSO approach is less sensitive to the outliers and is more appropriate for the analysis of data with skewedly distributed phenotypes. Another application of our robust approach is on missing phenotype problem in multiple-trait analysis, where the missing phenotype items can simply be filled with some extreme values, and be treated as outliers. The efficiency of the LAD-LASSO approach is illustrated on both simulated and real data sets.
机译:基于线性回归的定量特征位点/关联映射方法(例如最小二乘法)通常采用残差的正态性。在动植物遗传学研究中,某些定量性状可能不遵循正态分布,因为数据包括离群的观测值或从多个来源收集的数据,在这种情况下,正常回归方法可能会失去检测定量性状基因座的统计能力。在这项工作中,我们提出了一种鲁棒的多位点回归方法,可以在没有正态假设的情况下分析多个定量特征。在我们的方法中,目标函数是最小绝对偏差(LAD),它对应于多元拉普拉斯分布残差误差的假设。该分布的尾部比正态分布重。此外,我们采用组LASSO罚分来产生标记效应的收缩估计并描述表型之间的遗传相关性。我们的LAD-LASSO方法对异常值不太敏感,并且更适合于分析具有偏态分布表型的数据。我们的鲁棒方法的另一个应用是在多特征分析中的缺失表型问题上,其中缺失表型项可以简单地用一些极值填充,并被视为离群值。 LAD-LASSO方法的效率在模拟和真实数据集上都有说明。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号