首页> 外文期刊>IEEE/ACM transactions on computational biology and bioinformatics >Detecting Population-Differentiation Copy Number Variants in Human Population Tree by Sparse Group Selection
【24h】

Detecting Population-Differentiation Copy Number Variants in Human Population Tree by Sparse Group Selection

机译:稀疏组选择检测人口树中的人口分化拷贝数变体

获取原文
获取原文并翻译 | 示例
           

摘要

Copy-number variants (CNVs) account for a substantial proportion of human genetic variations. Understanding the CNV diversities across populations is a computational challenge because CNV patterns are often present in several related populations and only occur in a subgroup of individuals within each of the population. This paper introduces a tree-guided sparse group selection algorithm (treeSGS) to detect population-differentiation CNV markers of subgroups across populations organized by a phylogenetic tree of human populations. The treeSGS algorithm detects CNV markers of populations associated with nodes from all levels of the tree such that the evolutionary relations among the populations are incorporated for more accurate detection of population-differentiation CNVs. We applied treeSGS algorithm to study the 1,179 samples from the 11 populations in Hapmap3 CNV data. The treeSGS algorithm accurately identifies CNV markers of each population and the collection of populations organized under the branches of the human population tree, validated by consistency among family trios and SNP characterizations of the CNVregions. Further comparison between the detected CNV markers and other population-differentiation CNVs reported in 1,000 genome data and other recent studies also shows that treeSGS can significantly improve the current annotations of population-differentiation CNV markers. TreeSGS package is available at https://github.com/kuanglab/treeSGS.
机译:复制编号变体(CNVS)占人类遗传变异的大量比例。了解跨人群的CNV多样性是一个计算挑战,因为CNV模式通常存在于若干相关群体中,并且只出现在每个人口中的个体的子组中。本文介绍了一种树木引导稀疏组选择算法(TRAGEGS),以检测跨人口系统群体组织组织组织的人群的人口分化CNV标记。树本算法检测与树的各个级别相关联的群体的CNV标记,使得群体中的进化关系被纳入更准确地检测人口分化CNV。我们应用了Treadgs算法来研究来自HAPMAP3 CNV数据的11个群体的1,179个样本。树木算法准确地识别每个人口的CNV标记,以及在人口树的分支下组织的群体的集合,通过家庭TRIOS和CNVREGIONS的SNP表征之间的一致性验证。在1,000个基因组数据和其他最近的研究中,检测到的CNV标记和其他人群分化CNV之间的进一步比较也表明,树木可以显着改善人口分化CNV标记的当前注释。 Treegs包裹在https://github.com/kuanglab/treesgs提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号