首页> 外文期刊>The American Journal of Human Genetics >Estimating local ancestry in admixed populations.
【24h】

Estimating local ancestry in admixed populations.

机译:估计混合种族中的本地血统。

获取原文
获取原文并翻译 | 示例
           

摘要

Large-scale genotyping of SNPs has shown a great promise in identifying markers that could be linked to diseases. One of the major obstacles involved in performing these studies is that the underlying population substructure could produce spurious associations. Population substructure can be caused by the presence of two distinct subpopulations or a single pool of admixed individuals. In this work, we focus on the latter, which is significantly harder to detect in practice. New advances in this research direction are expected to play a key role in identifying loci that are different among different populations and are still associated with a disease. We evaluated current methods for inference of population substructure in such cases and show that they might be quite inaccurate even in relatively simple scenarios. We therefore introduce a new method, LAMP (Local Ancestry in adMixed Populations), which infers the ancestry of each individual at every single-nucleotide polymorphism (SNP). LAMP computes theancestry structure for overlapping windows of contiguous SNPs and combines the results with a majority vote. Our empirical results show that LAMP is significantly more accurate and more efficient than existing methods for inferring locus-specific ancestries, enabling it to handle large-scale datasets. We further show that LAMP can be used to estimate the individual admixture of each individual. Our experimental evaluation indicates that this extension yields a considerably more accurate estimate of individual admixture than state-of-the-art methods such as STRUCTURE or EIGENSTRAT, which are frequently used for the correction of population stratification in association studies.
机译:SNP的大规模基因分型在鉴定可能与疾病相关的标记物方面显示了巨大的希望。开展这些研究的主要障碍之一是潜在的人口子结构可能产生虚假的关联。人口亚结构可能是由于存在两个不同的亚群或一个混合个体的单一池而引起的。在这项工作中,我们将重点放在后者上,这在实践中很难发现。预期该研究方向的新进展将在鉴定不同人群之间不同且仍与疾病相关的基因座中发挥关键作用。我们评估了在这种情况下用于推断人口子结构的当前方法,并表明,即使在相对简单的情况下,它们也可能非常不准确。因此,我们引入了一种新方法LAMP(adMixed人口中的局部祖先),该方法可以推断每个个体在每个单核苷酸多态性(SNP)的祖先。 LAMP计算连续SNP的重叠窗口的祖先结构,并将结果与​​多数表决相结合。我们的经验结果表明,LAMP比现有的推断特定位点祖先的方法更准确,更高效,从而能够处理大规模数据集。我们进一步表明,LAMP可用于估计每个人的混合物。我们的实验评估表明,与诸如STRUCTURE或EIGENSTRAT之类的最新方法(通常用于校正关联研究中的人口分层)相比,这种扩展对单个混合物的估计要准确得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号