首页> 美国卫生研究院文献>Bioinformatics >A probabilistic method for the detection and genotyping of small indels from population-scale sequence data
【2h】

A probabilistic method for the detection and genotyping of small indels from population-scale sequence data

机译:一种从人口规模序列数据中检测小插入缺失并进行基因分型的概率方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Motivation: High-throughput sequencing technologies have made population-scale studies of human genetic variation possible. Accurate and comprehensive detection of DNA sequence variants is crucial for the success of these studies. Small insertions and deletions represent the second most frequent class of variation in the human genome after single nucleotide polymorphisms (SNPs). Although several alignment tools for the gapped alignment of sequence reads to a reference genome are available, computational methods for discriminating indels from sequencing errors and genotyping indels directly from sequence reads are needed.>Results: We describe a probabilistic method for the accurate detection and genotyping of short indels from population-scale sequence data. In this approach, aligned sequence reads from a population of individuals are used to automatically account for context-specific sequencing errors associated with indels. We applied this approach to population sequence datasets from the 1000 Genomes exon pilot project generated using the Roche 454 and Illumina sequencing platforms, and were able to detect a significantly greater number of indels than reported previously. Comparison to indels identified in the 1000 Genomes pilot project demonstrated the sensitivity of our method. The consistency in the number of indels and the fraction of indels whose length is a multiple of three across different human populations and two different sequencing platforms indicated that our method has a low false discovery rate. Finally, the method represents a general approach for the detection and genotyping of small-scale DNA sequence variants for population-scale sequencing projects.>Availability: A program implementing this method is available at >Contact: >Supplementary information: are available at Bioinformatics online.
机译:>动机:高通量测序技术使对人类遗传变异的人群规模研究成为可能。准确,全面地检测DNA序列变异对于这些研究的成功至关重要。小插入和缺失代表了人类基因组中仅第二核苷酸多态性(SNPs)之后第二大最常见的变异类别。尽管有几种比对工具可用于将序列读取与参考基因组进行空位比对,但仍需要用于从序列错误中区分插入缺失和从序列读取中直接对插入缺失进行基因分型的计算方法。>结果:我们描述了一种概率方法从人口规模序列数据中准确检测和检测短插入缺失的基因型。在这种方法中,使用了来自一群人的比对序列读数来自动解决与插入缺失相关的特定于上下文的测序错误。我们将这种方法应用于使用Roche 454和Illumina测序平台生成的1000个基因组外显子试验项目的种群序列数据集,并且能够检测到比以前报告的数量更大的插入缺失。与1000个基因组试验项目中鉴定出的插入缺失进行比较,证明了我们方法的敏感性。在不同人群和两个不同测序平台上,长度为3的倍数的indel数目和indel分数的一致性表明,我们的方法的错误发现率较低。最后,该方法代表了一种用于人群规模测序项目的小规模DNA序列变异的检测和基因分型的通用方法。>可用性:可在>联系方式上找到实现此方法的程序:< / strong> >补充信息:可在线访问生物信息学。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号