首页> 外文会议>Asia-Pacific Bioinformatics Conference >Crystallizing short-read assemblies around seeds
【24h】

Crystallizing short-read assemblies around seeds

机译:在种子周围结晶短读组件

获取原文

摘要

Background: New short-read sequencing technologies produce enormous volumes of 25-30 base paired-end reads. The resulting reads have vastly different characteristics than produced by Sanger sequencing, and require different approaches than the previous generation of sequence assemblers. In this paper, we present a short-read de novo assembler particularly targeted at the new ABI SOLiD sequencing technology.Results: This paper presents what we believe to be the first de novo sequence assembly resultson real data from the emerging SOLiD platform, introduced by Applied Biosystems. Our assembler SHORTY augments short-paired reads using a trivially small number (5 - 10) of seeds of length 300 - 500 bp. These seeds enable us to produce significant assemblies using short-read coverage no more than 100*, which can be obtained in a single run of these high-capacity sequencers. SHORTY exploits two ideas which we believe to be of interest to the short-read assembly community: (I) using single seed reads to crystallize assemblies, and (2) estimating intercontig distances accurately from multiple spanning paired-end reads.Conclusion: We demonstrate effective assemblies (N50 contig sizes -40 kb) of three different bacterial species using simulated SOLiD data. Sequencing artifacts limit our performance on real data, however our results on this data are substantially betterthan those achieved by competing assemblers.
机译:背景:新短读测序技术产生25-30个碱基配对末端读取的巨量。所得的读取具有非常不同的特性比通过Sanger测序产生的,并且需要比上一代序列装配的不同的方法。在本文中,我们提出了一个短读从头汇编尤其是针对新的SOLiD ABI测序technology.Results:本文介绍了我们认为是第一个从头序列来自新兴坚实的平台,通过引进组装resultson真实数据Applied Biosystems公司。我们的汇编SHORTY增强件短配对读取使用平凡少数(5 - 10) - 500bp的长度300的种子。这些种子使我们使用短的读取覆盖率不超过100 *,这可以在这些高容量定序器的单次运行来获得,以产生显著组件。 SHORTY利用两个思路,我们认为这是利益对短读总成社区:(i)使用单一的种子读结晶组件,和(2)估计从多个跨越配对末端reads.Conclusion准确intercontig距离:我们证明的三种不同的细菌物种有效组件(N50重叠群大小-40 KB)使用模拟固体数据。测序文物限制了我们对真实数据的表现,但是我们对这个数据结果基本上betterthan那些竞争装配来实现的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号