Denoising DNA deep sequencing data-high-throughput sequencing errors and their correction

Laehnemann David; Borkhardt Arndt; McHardy Alice Carolyn

首页> 外文期刊>Briefings in bioinformatics >Denoising DNA deep sequencing data-high-throughput sequencing errors and their correction

【24h】

Denoising DNA deep sequencing data-high-throughput sequencing errors and their correction

机译：对DNA深度测序数据进行去噪-高通量测序错误及其纠正

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Characterizing the errors generated by common high-throughput sequencing platforms and telling true genetic variation from technical artefacts are two interdependent steps, essential to many analyses such as single nucleotide variant calling, haplotype inference, sequence assembly and evolutionary studies. Both random and systematic errors can show a specific occurrence profile for each of the six prominent sequencing platforms surveyed here: 454 pyrosequencing, Complete Genomics DNA nanoball sequencing, Illumina sequencing by synthesis, Ion Torrent semiconductor sequencing, Pacific Biosciences single-molecule real-time sequencing and Oxford Nanopore sequencing. There is a large variety of programs available for error removal in sequencing read data, which differ in the error models and statistical techniques they use, the features of the data they analyse, the parameters they determine from them and the data structures and algorithms they use. We highlight the assumptions they make and for which data types these hold, providing guidance which tools to consider for benchmarking with regard to the data properties. While no benchmarking results are included here, such specific benchmarks would greatly inform tool choices and future software development. The development of stand-alone error correctors, as well as single nucleotide variant and haplotype callers, could also benefit from using more of the knowledge about error profiles and from (re)combining ideas from the existing approaches presented here.

机译：表征常见的高通量测序平台所产生的错误并告诉技术伪像鉴定真正的遗传变异是两个相互依赖的步骤，这对于许多分析（例如单核苷酸变异调用，单倍型推断，序列组装和进化研究）都是必不可少的。随机和系统错误都可以显示此处调查的六个主要测序平台各自的特定发生情况：454焦磷酸测序，完整基因组DNA纳米球测序，Illumina合成测序，离子洪流半导体测序，Pacific Biosciences单分子实时测序和牛津纳米孔测序。有多种程序可用于对读取的数据进行序列化中的错误消除，这些程序在使用的错误模型和统计技术，分析的数据的特征，从中确定的参数以及使用的数据结构和算法方面有所不同。我们重点介绍它们所做的假设以及这些假设所适用的数据类型，并提供指导以考虑使用哪些工具进行数据属性基准测试。尽管此处未包含基准测试结果，但此类特定的基准测试将极大地指导工具选择和未来的软件开发。独立错误校正器以及单核苷酸变异体和单倍型调用者的开发，也可以从使用更多有关错误概况的知识以及（重新）组合此处介绍的现有方法中获得的思想中受益。

著录项

来源
《Briefings in bioinformatics》 |2016年第1期|共26页
作者
Laehnemann David; Borkhardt Arndt; McHardy Alice Carolyn;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类遗传学;
关键词
next-generation sequencing; high-throughput sequencing; error profile; error correction; error model; bias;

机译：下一代测序;高通量测序;错误概况;纠错;错误模型;偏差;

相似文献

外文文献
中文文献
专利

1. Denoising DNA deep sequencing data-high-throughput sequencing errors and their correction [J] . Laehnemann David, Borkhardt Arndt, McHardy Alice Carolyn Briefings in bioinformatics . 2016,第1期

机译：对DNA深度测序数据进行去噪-高通量测序错误及其纠正
2. Data Denoising and Post-Denoising Corrections in Single Cell RNA Sequencing [J] . Agarwal Divyansh, Wang Jingshu, Zhang Nancy R. Statistical science . 2020,第1期

机译：单细胞RNA测序中的数据去噪和后衰老校正
3. A benchmark study on error-correction by read-pairing and tag-clustering in amplicon-based deep sequencing [J] . Tian-Hao Zhang, Nicholas C. Wu, Ren Sun BMC Genomics . 2016,第1期

机译：在基于扩增子的深度测序中通过读对和标签聚类进行纠错的基准研究
4. DNA sequencing error correction using spectral alignment [C] . Caesar Novaldo, Kusuma Wisnu Ananta, Wijaya Sony Hartono International Conference on Advanced Computer Science and Information Systems . 2013

机译：使用光谱比对校正DNA测序误差
5. Probabilistic insertion, deletion and substitution error correction using Markov inference in next generation sequencing reads [D] . Noroozi, Vahid 2016

机译：在下一代测序读取中使用马尔可夫推论进行概率插入，删除和取代错误校正
6. Denoising DNA deep sequencing data—high-throughput sequencing errors and their correction [O] . David Laehnemann, *, Arndt Borkhardt, -1

机译：对DNA深度测序数据进行去噪-高通量测序错误及其更正
7. Denoising DNA deep sequencing data—high-throughput sequencing errors and their correction [O] . David Laehnemann, Arndt Borkhardt, Alice Carolyn McHardy 2015

机译：去噪DNA深度测序数据 - 高通量测序误差及其校正
8. Iternative algorithm for correcting sequencing errors in DNA coding regions [R] . Xu, Y. , Mural, R. J. , Uberbacher, E. C. 1995

机译：用于校正DNa编码区中的测序错误的替代算法

Denoising DNA deep sequencing data-high-throughput sequencing errors and their correction

摘要

著录项

相似文献

相关主题

期刊订阅