首页> 外文会议>Asia-pacific bioinformatics conference >EULER: From Assembling Short DNA Reads to Shotgun Protein Sequencing by Assembling Mass Spectra
【24h】

EULER: From Assembling Short DNA Reads to Shotgun Protein Sequencing by Assembling Mass Spectra

机译:欧拉:通过组装质谱来组装短DNA读取霰弹枪蛋白测序

获取原文

摘要

At present the prokaryote taxonomy is largely based on 16S rRNA phylogeny. There is an urCurrent sequencing technologies attempt to maximize read length without sacrificing basecalling accuracy. Since fragment assembly with inaccurate reads is difficult, the common practice is to trim the inaccurate tails of the reads. We present a new computational technique EULER-USR for assembling short reads that bypasses the problem of limited accuracy in the tails of reads. An important and counterintuitive implication of this result is that one may extend sequencing reactions "past their prime" to where the error rate grows above what is normally acceptable for fragment assembly. We compare EULER-USR with other short read assemblers and illustrate its applications for assembling mate-paired Illumina reads from bacterial genomes. We further address the problem of sequencing molecules that are not directlyinscribed in the genomes (e.g., antibodies or antibiotics-like non-ribosomal peptides) and propose to assemble them from tandem mass spectra. We show that our Eulerian approach to DNA sequencing can be generalized to Shotgun Protein Sequencing (SPS). We illustrate applications of SPS to de novo sequencing of antibodies (collaboration with Jennie Lill at Genentech). We further show how multistage mass-spectrometry enables high-throughput de novo sequencing of peptide-like natural products. This is a joint work with Mark Chaisson, Dima Brinza (DNA fragment assembly), Nuno Bandeira (protein sequencing), Julio Ng and Pieter Dorrestein (sequencing of natural products).
机译:目前,原毒素分类基本上基于16S rRNA文学发生。有一个恒定的测序技术试图最大化读取长度而不牺牲基准的精度。由于具有不准确读取的片段组装很难,但常见的做法是修剪读取的不准确的尾部。我们提出了一个新的计算技术Euler-USR,用于组装短读取,绕过读取的尾部有限的精度问题。对该结果的重要和反思意义是,可以将测序反应延伸到“过去它们的素数”到误差率高于通常可接受的碎片组件的情况。我们将Euler-USR与其他短读组件进行比较,并说明其用于从细菌基因组组装配对的Illumina读取的应用。我们进一步解决了在基因组中未生植物的测序分子(例如,抗体或抗生素样非核糖体肽)的问题,并提出从串联质谱组装它们。我们表明,我们的DNA测序方法可以推广到霰弹枪蛋白测序(SPS)。我们说明了SPS对抗体的Novo测序的应用(与Genentech的Jennie Lill合作)。我们进一步展示多级质谱仪如何能够实现肽样天然产物的高通量Novo测序。这是一个与Mark Chaisson,Dima Brinza(DNA片段组装),Nuno Bandeira(蛋白质测序),Julio Ng和Pieter Dorrestein(天然产物的测序)的联合工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号