首页> 外文期刊>Applied Microbiology >CATCh, an Ensemble Classifier for Chimera Detection in 16S rRNA Sequencing Studies
【24h】

CATCh, an Ensemble Classifier for Chimera Detection in 16S rRNA Sequencing Studies

机译:CATCh,用于16S rRNA测序研究中嵌合体检测的整体分类器

获取原文
       

摘要

In ecological studies, microbial diversity is nowadays mostly assessed via the detection of phylogenetic marker genes, such as 16S rRNA. However, PCR amplification of these marker genes produces a significant amount of artificial sequences, often referred to as chimeras. Different algorithms have been developed to remove these chimeras, but efforts to combine different methodologies are limited. Therefore, two machine learning classifiers (reference-based and de novo CATCh) were developed by integrating the output of existing chimera detection tools into a new, more powerful method. When comparing our classifiers with existing tools in either the reference-based or de novo mode, a higher performance of our ensemble method was observed on a wide range of sequencing data, including simulated, 454 pyrosequencing, and Illumina MiSeq data sets. Since our algorithm combines the advantages of different individual chimera detection tools, our approach produces more robust results when challenged with chimeric sequences having a low parent divergence, short length of the chimeric range, and various numbers of parents. Additionally, it could be shown that integrating CATCh in the preprocessing pipeline has a beneficial effect on the quality of the clustering in operational taxonomic units.
机译:在生态学研究中,如今大多数微生物多样性是通过检测系统发育标记基因(例如16S rRNA)来评估的。但是,这些标记基因的PCR扩增会产生大量人工序列,通常称为嵌合体。已经开发了不同的算法来去除这些嵌合体,但是结合不同方法的努力受到限制。因此,通过将现有的嵌合体检测工具的输出整合到一种新的,功能更强大的方法中,开发了两个机器学习分类器(基于参考的分类器和从头分类的CATCh)。在基于参考模式或从头开始模式下将分类器与现有工具进行比较时,在广泛的测序数据(包括模拟,454焦磷酸测序和Illumina MiSeq数据集)上,我们的集成方法具有更高的性能。由于我们的算法结合了各种不同的嵌合体检测工具的优势,因此,当我们对亲本差异小,嵌合范围短,亲本数量众多的嵌合序列提出挑战时,我们的方法将产生更可靠的结果。此外,可以证明将CATCh集成到预处理流水线中对操作分类单元中的聚类质量具有有益的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号