Fast and simple protein-alignment-guided assembly of orthologous gene families from microbiome sequencing reads

Daniel H. Huson; Rewati Tappu; Adam L Bazinet; Chao Xie; Michael P. Cummings; Kay Nieselt; Rohan Williams

首页> 外文期刊>Microbiome >Fast and simple protein-alignment-guided assembly of orthologous gene families from microbiome sequencing reads

【24h】

Fast and simple protein-alignment-guided assembly of orthologous gene families from microbiome sequencing reads

机译：通过微生物组测序读取直系同源基因家族的快速简单的蛋白质比对引导组装

获取原文

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

BackgroundMicrobiome sequencing projects typically collect tens of millions of short reads per sample. Depending on the goals of the project, the short reads can either be subjected to direct sequence analysis or be assembled into longer contigs. The assembly of whole genomes from metagenomic sequencing reads is a very difficult problem. However, for some questions, only specific genes of interest need to be assembled. This is then a gene-centric assembly where the goal is to assemble reads into contigs for a family of orthologous genes. MethodsWe present a new method for performing gene-centric assembly, called protein-alignment-guided assembly, and provide an implementation in our metagenome analysis tool MEGAN. Genes are assembled on the fly, based on the alignment of all reads against a protein reference database such as NCBI-nr. Specifically, the user selects a gene family based on a classification such as KEGG and all reads binned to that gene family are assembled. ResultsUsing published synthetic community metagenome sequencing reads and a set of 41 gene families, we show that the performance of this approach compares favorably with that of full-featured assemblers and that of a recently published HMM-based gene-centric assembler, both in terms of the number of reference genes detected and of the percentage of reference sequence covered. ConclusionsProtein-alignment-guided assembly of orthologous gene families complements whole-metagenome assembly in a new and very useful way.

机译：背景微生物组测序项目通常每个样本收集数千万个短读。根据项目目标，可以将短读段进行直接序列分析，也可以将其组合成更长的重叠群。从宏基因组测序读取中组装整个基因组是一个非常困难的问题。但是，对于某些问题，仅需要组装特定的目标基因。然后，这是一个以基因为中心的装配，其目标是将读段装配到直系同源基因家族的重叠群中。方法我们提出了一种新的执行以基因为中心的装配的方法，称为蛋白质比对引导装配，并在我们的元基因组分析工具MEGAN中提供了一种实现方法。基于蛋白质参考数据库（例如NCBI-nr）的所有读段的比对，基因可以即时组装。具体而言，用户根据分类（例如KEGG）选择一个基因家族，并组装到该基因家族的所有读段。结果使用已发表的合成社区元基因组测序读数和一组41个基因家族，我们显示出该方法的性能与功能齐全的汇编程序以及最近发布的基于HMM的以基因为中心的汇编程序相比具有优越的优势。检测到的参考基因的数量以及所覆盖参考序列的百分比。结论蛋白质比对引导的直系同源基因家族的装配以一种新的且非常有用的方式补充了全基因组装配。

著录项

来源
《Microbiome》 |2017年第1期|共1页
作者
Daniel H. Huson; Rewati Tappu; Adam L Bazinet; Chao Xie; Michael P. Cummings; Kay Nieselt; Rohan Williams;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类消化生理学;
关键词

相似文献

外文文献
中文文献
专利

1. Simple and Scalable Genome Analysis with Transposase Enzyme Linked Long-Read Sequencing (TELL-Seq): From Haplotype Phasing to De Novo Assembly in a Tube [J] . Tom Chen Journal of biomolecular techniques :JBT. . 2019,第Suppl期

机译：使用转座酶链接的长读测序（TELL-Seq）进行简单且可扩展的基因组分析：从单倍型测序到管中的De Novo组装
2. Efficient and unique cobarcoding of second-generation sequencing reads from long DNA molecules enabling cost-effective and accurate sequencing, haplotyping, and de novo assembly [J] . Wang Ou, Chin Robert, Cheng Xiaofang, Genome research . 2019,第5期

机译：高DNA分子的高效且独特的二代测序读取读取，从而实现具有成本效益和准确的测序，单倍型和De Novo组装
3. MECAT : fast mapping, error correction, and de novo assembly for single-molecule sequencing reads [J] . Xiao Chuan-Le, Chen Ying, Xie Shang-Qian, Nature methods . 2017,第11期

机译：Mecat：用于单分子测序的快速映射，纠错和DE Novo组件读取
4. Read cloud sequencing elucidates microbiome dynamics in a hematopoietic cell transplant patient [C] . Joyce Kang, Benjamin Siranosian, Eli Moss, IEEE International Conference on Bioinformatics and Biomedicine . 2018

机译：读取云测序可阐明造血细胞移植患者的微生物组动态
5. Identification of a large set of single copy orthologous genes (COSII) across euasterid species by combining bioinformatics and phylogenetics and a study of genome evolution in the family Solanaceae in a phylogenetic context [D] . Wu, Feinan 2007

机译：通过结合生物信息学和系统发育学以及研究茄科中茄科的基因组进化研究，在整个泛型物种中鉴定了一大套单拷贝直系同源基因（COSII）
6. Fast and simple protein-alignment-guided assembly of orthologous gene families from microbiome sequencing reads [O] . Daniel H. Huson, Rewati Tappu, Adam L Bazinet, 2017

机译：微生物组测序读取直系同源基因家族的快速简单的蛋白质比对引导组装
7. Fast and simple protein-alignment-guided assembly of orthologous gene families from microbiome sequencing reads [O] . Daniel H. Huson, Rewati Tappu, Adam L Bazinet, 2017

机译：微生物组测序读取直系同源基因家族的快速简单的蛋白质比对引导组装

Fast and simple protein-alignment-guided assembly of orthologous gene families from microbiome sequencing reads

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅