首页> 外文期刊>IEEE/ACM transactions on computational biology and bioinformatics >Pluribus—Exploring the Limits of Error Correction Using a Suffix Tree
【24h】

Pluribus—Exploring the Limits of Error Correction Using a Suffix Tree

机译:Pluribus-使用后缀树探索错误校正的极限

获取原文
获取原文并翻译 | 示例
       

摘要

Next generation sequencing technologies enable efficient and cost-effective genome sequencing. However, sequencing errors increase the complexity of the de novo assembly process, and reduce the quality of the assembled sequences. Many error correction techniques utilizing substring frequencies have been developed to mitigate this effect. In this paper, we present a novel and effective method called PLURIBUS, for correcting sequencing errors using a generalized suffix trie. PLURIBUS utilizes multiple manifestations of an error in the trie to accurately identify errors and suggest corrections. We show that PLURIBUS produces the least number of false positives across a diverse set of real sequencing datasets when compared to other methods. Furthermore, PLURIBUS can be used in conjunction with other contemporary error correction methods to achieve higher levels of accuracy than either tool alone. These increases in error correction accuracy are also realized in the quality of the contigs that are generated during assembly. We explore, in-depth, the behavior of PLURIBUS, to explain the observed improvement in accuracy and assembly performance. PLURIBUS is freely available at http://compbio.case.edu/pluribus/.
机译:下一代测序技术可实现高效且经济高效的基因组测序。然而,测序错误增加了从头组装过程的复杂性,并降低了组装序列的质量。已经开发出许多利用子串频率的纠错技术来减轻这种影响。在本文中,我们提出了一种新颖且有效的方法,称为PLURIBUS,用于使用广义后缀trie纠正序列错误。 PLURIBUS利用Trie中错误的多种表现来准确识别错误并提出纠正建议。我们显示,与其他方法相比,PLURIBUS在各种各样的真实测序数据集中产生最少数量的假阳性。此外,PLURIBUS可以与其他现代纠错方法结合使用,以实现比单独使用任何一种工具更高的准确性。在组装过程中产生的重叠群的质量中也实现了纠错精度的这些提高。我们深入探讨PLURIBUS的行为,以解释观察到的精度和组装性能方面的改进。 PLURIBUS可从http://compbio.case.edu/pluribus/免费获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号