首页> 美国卫生研究院文献>Genome Biology >Performance difference of graph-based and alignment-based hybrid error correction methods for error-prone long reads

【2h】

Performance difference of graph-based and alignment-based hybrid error correction methods for error-prone long reads

机译：基于图和基于对齐的混合纠错方法在易错长读中的性能差异

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Illustration of alignment-based and graph-based method; results for model fitness and accuracy gain on simulated data. Schematic of alignment-based method. is a certain base on the long read, and is the corresponding base on the reference sequence. The real short reads are aligned to the long read (with of them being successfully aligned), and then the consensus is inferred at each base. Relationship of the successful alignment probability for short reads with the mismatch rate , lower threshold on perfect match -mer size and the upper threshold of mismatches . In spite of the changes of or/and , is near to one when p > 30%. This indicates that mismatch rate is the most dominant factor on . As increases from 10 to 20, the curves move upper (from blue to red and green), implying that increases with . Moreover, the divergence between the dashed and solid blue, red, and green lines also shows an increasing tendency, which means the effect of on also increases with . Schematic of graph-based error correction method. DBG is built based on short reads. Solid -mers are detected on the long reads. The fragment between two adjacent solid -mers is then aligned with the correlated path on the DBG. The path is used to correct the fragment when certain criteria are satisfied. Accuracy gain at each error rate for simulated long reads corrected by alignment-based method. The boxplots represent the accuracy gain distribution for long reads. The solid lines represent the theoretical values. The dashed gray lines (diagonal lines) correspond to perfect correction. Proportion of simulated long reads with solid -mer detected at each error rate level. The solid lines represent the theoretical values. The dashed lines represent the results on simulated long reads. Accuracy gain at each error rate for simulated long reads corrected by graph-based method. : long read length; : size of perfectly matched seed or solid -mer

机译：基于对齐和基于图的方法的说明；模拟数据的模型适应性和准确性增益的结果。基于对齐方式的方法示意图。是基于长期阅读的确定基础，是与参考序列相对应的基础。真正的短读与长读对齐（它们已成功对齐），然后在每个碱基处推断出共识。短阅读的成功比对概率与错配率，完美匹配分子大小的下限和错配上限的关系。尽管或/和变化，当p> 30％时仍接近1。这表明不匹配率是上最主要的因素。当从10增加到20时，曲线向上移动（从蓝色到红色和绿色），这意味着随增大。此外，蓝色，红色和绿色虚线与实线之间的散度也显示出增加的趋势，这意味着的影响也随增大。基于图的纠错方法的示意图。 DBG基于短读而构建。在长读数中检测到固态单体。然后将两个相邻的固态单体之间的片段与DBG上的相关路径对齐。当满足某些条件时，该路径用于更正片段。通过基于比对的方法校正的模拟长读在每种错误率下的准确度增益。箱线图表示长时间读取的精度增益分布。实线代表理论值。灰色虚线（对角线）对应于完美校正。在每个错误率水平上检测到的带有固态单体的模拟长读的比例。实线代表理论值。虚线表示模拟长读的结果。通过基于图的方法校正的模拟长读在每个错误率处的准确度增益。：读取时间长；：完美匹配的种子或固体的大小

著录项

期刊名称 Genome Biology
作者
Anqi Wang; Kin Fai Au;
展开▼
作者单位

展开▼
年(卷),期 2020(21),-1
年度 2020
页码 -1
总页数 8
原文格式 PDF
正文语种
中图分类生物学;
关键词

相似文献

外文文献
中文文献
专利

1. Performance difference of graph-based and alignment-based hybrid error correction methods for error-prone long reads [J] . Anqi Wang, Kin Fai Au Genome Biology . 2020,第1期

机译：基于图形和基于对准的混合误差校正方法的性能差异，用于易于易于的长读取
2. A hybrid and scalable error correction algorithm for indel and substitution errors of long reads [J] . Arghya Kusum Das, Sayan Goswami, Kisung Lee, BMC Genomics . 2019,第S11期

机译：长读取的indel和替换误差的混合和可伸缩误差校正算法
3. A method to avoid errors associated with the analysis of hypermutated viral sequences by alignment-based methods [J] . Journal of biomedical informatics. . 2015,第Null期

机译：一种避免基于比对的方法与超突变病毒序列分析相关的错误的方法
4. Extremely Biased Error Correction Method to Reduce Read Disturb Errors of 3D-TLC NAND Flash Memories by 60 [C] . Hiroki Aihara, Kyosuke Maeda, Shun Suzuki, IEEE International Memory Workshop . 2020

机译：极度偏置的纠错方法，可将3D-TLC NAND闪存的读取干扰错误降低60％
5. Genome Assembly of Long Error-Prone Reads Using de Bruijn Graphs and Repeat Graphs [D] . Yuan, Jeffrey. 2019

机译：使用de bruijn图表和重复图来读取长时间错误的基因组组合
6. A comparative evaluation of hybrid error correction methods for error-prone long reads [O] . Shuhua Fu, Anqi Wang, Kin Fai Au 2019

机译：易出错长读混合错误校正方法的比较评估
7. Figure S3: Difference and 95 CI of median predicted protein length of different assembly types to evaluate the impact of sequencing error and error correction of VirION reads with short-read data [O] . -1

机译：图S3：不同装配类型的中位数预测蛋白质长度的差异和95％CI，以评估序列误差和Viriv的纠错与短读数据的影响

Performance difference of graph-based and alignment-based hybrid error correction methods for error-prone long reads

摘要

著录项

相似文献

相关主题

期刊订阅