Metagenome assembly validation: Which metagenome contigs are bona fide?

Ji Y.; Li Y.-X.; Cai Y.-D.; Chou K.-C.

首页> 外文期刊>Current Bioinformatics >Metagenome assembly validation: Which metagenome contigs are bona fide?

【24h】

Metagenome assembly validation: Which metagenome contigs are bona fide?

机译：宏基因组组装验证：真正的哪些宏基因组重叠群？

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the metagenomics, long metagenome contigs can either improve metagenome gene prediction or metagenome sequence binning. Moreover, metagenome contigs can also make gene function annotation more accurate because they provide a lot of genome context information. Because of repetitive sequences of either intra-genomes or inter-genomes, metagenome contigs are probably wrongly assembled. Therefore, it is essential to develop a method to validate metagenome contigs. Here, we propose a computational method to validate metagenome contigs. After realigning raw sequencing reads onto one contig, we first compute a contig-ECDF (empirical cumulative probability distribution functions) and its corresponding reference using a computational simulation-based method. Because a reference of the contig-ECDF is changeless given some parameters, we use the distinction between them to check whether or not a contig is bona fide. The less the distinction is, the more likely a contig is bona fide. For wrongly assembled metagenome contigs, using simulated metagenome datasets, our method was shown to have a good capacity to identify them. After applying the method to a real metagenome dataset, which was sequenced from an in vitro-simulated microbial community with known constituted genomes, we showed that our method had a strong ability to identify bona fide contigs, and further demonstrated that small distinctions between contig-ECDFs and their references were significantly correlated with bona fide contigs. A computational method is developed to validate metagenome contigs. For each metagenome contig, our method gives it a score, and the smaller the score is, the more likely a contig is bona fide. After validation using both simulated and real datasets, our method was shown to have good performances.

机译：在宏基因组学中，长的元基因组重叠群可以改善元基因组基因的预测或元基因组序列的分箱。而且，由于元基因组重叠群提供了大量的基因组背景信息，因此它们也可以使基因功能注释更准确。由于基因组内或基因组间的重复序列，元基因组重叠群可能被错误地组装。因此，开发一种验证元基因组重叠群的方法至关重要。在这里，我们提出了一种计算方法来验证重叠基因组。将原始测序读段重新排列到一个重叠群上后，我们首先使用基于计算仿真的方法计算一个重叠群ECDF（经验累积概率分布函数）及其对应的参考。由于contig-ECDF的引用在给定某些参数的情况下是不变的，因此我们使用它们之间的区别来检查contig是否是真正的。区别越小，contig越有可能是善意的。对于错误组装的元基因组重叠群，使用模拟的元基因组数据集，我们的方法被证明具有很好的识别它们的能力。在将该方法应用于真实的元基因组数据集后，该数据集是从具有已知组成基因组的体外模拟微生物群落中测序而来的，我们证明了我们的方法具有很强的识别善意重叠群的能力，并进一步证明了重叠群之间的微小区别。 ECDF及其参考文献与善意重叠群显着相关。开发了一种计算方法来验证元基因组重叠群。对于每个元基因组重叠群，我们的方法都会给它一个分数，并且分数越小，真实重叠群的可能性就越大。在使用模拟和真实数据集进行验证之后，我们的方法被证明具有良好的性能。

著录项

来源
《Current Bioinformatics》 |2013年第4期|共13页
作者
Ji Y.; Li Y.-X.; Cai Y.-D.; Chou K.-C.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类 58;
关键词
Bona fide contigs; Computational method; Datasets; Metagenome contigs; Metagenomics; Simulated metagenome;

机译：善意重叠群;计算方法;数据集;元基因组重叠群;基因组学;模拟的基因组;

相似文献

外文文献
中文文献
专利

1. Metagenome assembly validation: Which metagenome contigs are bona fide? [J] . Ji Y., Li Y.-X., Cai Y.-D., Current Bioinformatics . 2013,第4期

机译：宏基因组组装验证：真正的哪些宏基因组重叠群？
2. Metagenome from a Spirulina digesting biogas reactor: analysis via binning of contigs and classification of short reads [J] . Vimac Nolla-Ard#232, vol, Miriam Peces, BMC Microbiology . 2015,第1期

机译：螺旋藻消化沼气反应器的元基因组：通过重叠群的装箱分析和短读数分类
3. ALE: a generic assembly likelihood evaluation framework for assessing the accuracy of genome and metagenome assemblies [J] . Clark Scott C., Egan Rob, Frazier Peter I., Bioinformatics . 2013,第4期

机译：ALE：通用的装配可能性评估框架，用于评估基因组和元基因组装配的准确性
4. GraphPE: Refining Metagenome Binning by Use of Paired-end Graph of Contigs [C] . Xianghui Liu, Rohan B. H. Williams International Conference on Bioinformatics and Computational Biology . 2017

机译：GraphPe：通过使用Contig的配对图炼制梅塔群体融合
5. Pearl in the mud: Genome assembly and binning of a cold seep Thiomargarita nelsonii cell and associated epibionts from an environmental metagenome. [D] . Fliss, Palmer Scott. 2014

机译：泥中的珍珠：基因组的组装和来自环境超基因组的冷渗入Thiomargarita nelsonii细胞及相关表皮生物的分装。
6. Metagenome Assembly and Metagenome-Assembled Genome Sequences from the Rhizosphere of Maize Plants in Mafikeng South Africa [O] . Olubukola O. Babalola, Rebaona R. Molefe, Adenike E. Amoo 2021

机译：来自南非南非喀布峰玉米植物根际的梅萨克群组装和毕业群组装基因组序列
7. Metagenomic assembly through the lens of validation: recent advances in assessing and improving the quality of genomes assembled from metagenomes [O] . Nathan D Olson, Todd J Treangen, Christopher M Hill, 2017

机译：通过验证镜片的偏见组装：近期评估和提高从梅曲线组装的基因组质量的进展

Metagenome assembly validation: Which metagenome contigs are bona fide?

摘要

著录项

相似文献

相关主题

期刊订阅