...
首页> 外文期刊>Journal of human genetics >Establishment of a standardized system to perform population structure analyses with limited sample size or with different sets of SNP genotypes.
【24h】

Establishment of a standardized system to perform population structure analyses with limited sample size or with different sets of SNP genotypes.

机译:建立标准化系统,以有限的样本量或不同的SNP基因型集进行种群结构分析。

获取原文
获取原文并翻译 | 示例

摘要

Recent studies have demonstrated that principal component analysis (PCA) can detect the presence of population mixture and admixture in a sample and thus can be used to correct population stratification in genome-wide association studies (GWAS). We propose a complementary approach to PCA that compensates for potential weaknesses associated with PCA, so that one can perform population structure analyses using limited numbers of subjects and single-nucleotide polymorphisms (SNPs). Our method first requires a PCA of the largest reference sample from a population to standardize the system. Once the system is established, it can perform PCA for each individual with a much smaller number of SNPs drawn from the same population. This is because of the introduction of the probabilistic PCA, so that the prediction of the principal components (PCs) is performed under a rigorous probabilistic framework. The subsequent linear discriminant analysis also helps to understand from which ancestries or subpopulations a given individual is more likely to derive, in terms of posterior probabilities given the predicted PCs. A real-world prototype of the system for the Japanese population is developed based on 19 260 subjects, which illustrates the potential usefulness of the system as an aid in the detection of population structures in validation samples, or to help with the correction of population stratification in GWAS.
机译:最近的研究表明,主成分分析(PCA)可以检测样本中种群混合物和混合物的存在,因此可以用于校正全基因组关联研究(GWAS)中的种群分层。我们为PCA提出了一种补充方法,该方法可以弥补与PCA相关的潜在缺陷,从而使人们可以使用数量有限的受试者和单核苷酸多态性(SNP)进行种群结构分析。我们的方法首先需要从总体中获得最大参考样品的PCA来标准化系统。一旦建立了系统,它就可以为每个人执行PCA,并从相同的群体中抽取数量少得多的SNP。这是因为引入了概率PCA,因此在严格的概率框架下对主成分(PC)进行了预测。随后的线性判别分析还可以帮助您根据给定预测PC的后验概率来了解给定个体更可能从哪些祖先或亚种群中获得。基于19 260个主题,开发了针对日本人口的系统的真实世界原型,这说明了该系统在检测验证样本中的人口结构或帮助校正人口分层方面的潜在实用性在GWAS中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号