首页> 美国卫生研究院文献>Bioinformatics >PyroHMMvar: a sensitive and accurate method to call short indels and SNPs for Ion Torrent and 454 data
【2h】

PyroHMMvar: a sensitive and accurate method to call short indels and SNPs for Ion Torrent and 454 data

机译:PyroHMMvar:一种灵敏且准确的方法可调用短插入/缺失和INP来获取离子激流和454数据

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Motivation: The identification of short insertions and deletions (indels) and single nucleotide polymorphisms (SNPs) from Ion Torrent and 454 reads is a challenging problem, essentially because these techniques are prone to sequence erroneously at homopolymers and can, therefore, raise indels in reads. Most of the existing mapping programs do not model homopolymer errors when aligning reads against the reference. The resulting alignments will then contain various kinds of mismatches and indels that confound the accurate determination of variant loci and alleles.>Results: To address these challenges, we realign reads against the reference using our previously proposed hidden Markov model that models homopolymer errors and then merges these pairwise alignments into a weighted alignment graph. Based on our weighted alignment graph and hidden Markov model, we develop a method called PyroHMMvar, which can simultaneously detect short indels and SNPs, as demonstrated in human resequencing data. Specifically, by applying our methods to simulated diploid datasets, we demonstrate that PyroHMMvar produces more accurate results than state-of-the-art methods, such as Samtools and GATK, and is less sensitive to mapping parameter settings than the other methods. We also apply PyroHMMvar to analyze one human whole genome resequencing dataset, and the results confirm that PyroHMMvar predicts SNPs and indels accurately.>Availability and implementation: Source code freely available at the following URL: , implemented in C++ and supported on Linux.>Contact: or
机译:>动机:从离子洪流和454读取中识别短插入和缺失(indels)和单核苷酸多态性(SNP)是一个具有挑战性的问题,主要是因为这些技术容易在均聚物上错误地测序,并且可能,因此提高了indel的读取量。当将读数与参考进行比对时,大多数现有的映射程序都不会对均聚物错误进行建模。然后,产生的比对将包含各种错配和插入/缺失,这会混淆对变异基因座和等位基因的准确确定。>结果:为解决这些挑战,我们使用先前提出的隐马尔可夫模型对参考值进行了比对。该模型可模拟均聚物错误,然后将这些成对的比对合并为加权的比对图。根据我们的加权比对图和隐马尔可夫模型,我们开发了一种称为PyroHMMvar的方法,该方法可以同时检测到较短的indel和SNP,如人类重测序数据所示。具体来说,通过将我们的方法应用于模拟二倍体数据集,我们证明了PyroHMMvar所产生的结果要优于最新的方法(例如Samtools和GATK),并且比其他方法对映射参数设置的敏感性更低。我们还应用PyroHMMvar分析了一个人类全基因组重测序数据集,结果证实PyroHMMvar可以准确预测SNP和插入缺失。>可用性和实现方式:可通过以下URL免费获得源代码:,使用C ++和在Linux上受支持。>联系方式:

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号