首页> 外文会议>WABI (Workshop) >Efficient Local Alignment Discovery amongst Noisy Long Reads

【24h】

Efficient Local Alignment Discovery amongst Noisy Long Reads

机译：高效的局部对齐发现在嘈杂的长读数中

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Long read sequencers portend the possibility of producing reference quality genomes not only because the reads are long, but also because sequencing errors and read sampling are almost perfectly random. However, the error rates are as high as 15%, necessitating an efficient algorithm for finding local alignments between reads at a 30% difference rate, a level that current algorithm designs cannot handle or handle inefficiently. In this paper we present a very efficient yet highly sensitive, threaded filter, based on a novel sort and merge paradigm, that proposes seed points between pairs of reads that are likely to have a significant local alignment passing through them. We also present a linear expected-time heuristic based on the classic O(nd) difference algorithm [1] that finds a local alignment passing through a seed point that is exceedingly sensitive, failing but once every billion base pairs. These two results have been combined into a software program we call DALIGN that realizes the fastest program to date for finding overlaps and local alignments in very noisy long read DNA sequencing data sets and is thus a prelude to de novo long read assembly.

机译：长读取序列器不仅是因为读数长，而且因为读取错误和读取采样几乎完全随意地产生参考质量基因组的可能性。然而，误差率高达15％，所以需要一种有效的算法，用于以30％的差异速率找到读取之间的局部对齐，当前算法设计无法处理或处理效率低下的级别。在本文中，我们介绍了一个基于新颖的排序和合并范例的非常有效但高度敏感的螺纹滤波器，该滤波器提出了可能具有通过它们的显着局部对准的读取对之间的种子点。我们还基于经典O（ND）差分算法[1]呈现线性预期时间启发式算法[1]，该算法通过超越非常敏感的种子点的局部对齐，每亿基对一次失败。这两个结果已被组合到软件程序中，我们调用Dalign，以实现最快的程序，以便在非常嘈杂的长读取DNA测序数据集中找到重叠和局部对准，因此是De Novo长读组件的前奏。

著录项

来源
《WABI (Workshop) 》|2014年||共16页
会议地点
作者
Gene Myers;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类生物科学的研究方法与技术 ;
关键词

相似文献

外文文献
中文文献
专利

1. Improved variant discovery through localre-alignment of short-read next-generationsequencing data using SRMA [J] . Nils Homer, Stanley F Nelson Genome Biology . 2010 ,第10期

机译：通过使用SRMA对短读的下一代测序数据进行本地重新对齐来改进变体发现
2. Improved variant discovery through local re-alignment of short-read next-generation sequencing data using SRMA [J] . Nils Homer, Stanley F Nelson Genome Biology . 2010 ,第10期

机译：通过使用SRMA对短读的下一代测序数据进行本地重新比对来改善变体发现
3. S-conLSH: alignment-free gapped mapping of noisy long reads [J] . Angana Chakraborty, Burkhard Morgenstern, Sanghamitra Bandyopadhyay BMC Bioinformatics . 2021 ,第1期

机译：s-conlsh：无噪音长读取的无对齐映射
4. Efficient Local Alignment Discovery amongst Noisy Long Reads [C] . Gene Myers International workshop on algorithms in bioinformatics . 2014

机译：嘈杂的长读中的高效本地比对发现
5. Novel Computational Methods for Improving Functional Analysis for Long Noisy Reads [D] . Du, Nan . 2019

机译：改善长噪声读取功能分析的新型计算方法
6. Improved variant discovery through local re-alignment of short-read next-generation sequencing data using SRMA [O] . Nils Homer, Stanley F Nelson 2010

机译：通过使用SRMA对短读的下一代测序数据进行本地重新比对来改善变体发现
7. Improved variant discovery through local re-alignment of short-read next-generation sequencing data using SRMA [O] . Nils Homer, Stanley F Nelson 2010

机译：通过使用SRMA对短读的下一代测序数据进行本地重新比对来改善变体发现

Efficient Local Alignment Discovery amongst Noisy Long Reads

摘要

著录项

相似文献

相关主题

期刊订阅