...
首页> 外文期刊>mSystems >Controlling for Contaminants in Low-Biomass 16S rRNA Gene Sequencing Experiments
【24h】

Controlling for Contaminants in Low-Biomass 16S rRNA Gene Sequencing Experiments

机译:低生物量16S rRNA基因测序实验中污染物的控制

获取原文

摘要

Microbial communities are commonly studied using culture-independent methods, such as 16S rRNA gene sequencing. However, one challenge in accurately characterizing microbial communities is exogenous bacterial DNA contamination, particularly in low-microbial-biomass niches. Computational approaches to identify contaminant sequences have been proposed, but their performance has not been independently evaluated. To identify the impact of decreasing microbial biomass on polymicrobial 16S rRNA gene sequencing experiments, we created a mock microbial community dilution series. We evaluated four computational approaches to identify and remove contaminants, as follows: (i) filtering sequences present in a negative control, (ii) filtering sequences based on relative abundance, (iii) identifying sequences that have an inverse correlation with DNA concentration implemented in Decontam, and (iv) predicting the sequence proportion arising from defined contaminant sources implemented in SourceTracker. As expected, the proportion of contaminant bacterial DNA increased with decreasing starting microbial biomass, with 80.1% of the most diluted sample arising from contaminant sequences. Inclusion of contaminant sequences led to overinflated diversity estimates and distorted microbiome composition. All methods for contaminant identification successfully identified some contaminant sequences, which varied depending on the method parameters used and contaminant prevalence. Notably, removing sequences present in a negative control erroneously removed 20% of expected sequences. SourceTracker successfully removed over 98% of contaminants when the experimental environments were well defined. However, SourceTracker misclassified expected sequences and performed poorly when the experimental environment was unknown, failing to remove 97% of contaminants. In contrast, the Decontam frequency method did not remove expected sequences and successfully removed 70 to 90% of the contaminants. IMPORTANCE The relative scarcity of microbes in low-microbial-biomass environments makes accurate determination of community composition challenging. Identifying and controlling for contaminant bacterial DNA are critical steps in understanding microbial communities from these low-biomass environments. Our study introduces the use of a mock community dilution series as a positive control and evaluates four computational strategies that can identify contaminants in 16S rRNA gene sequencing experiments in order to remove them from downstream analyses. The appropriate computational approach for removing contaminant sequences from an experiment depends on prior knowledge about the microbial environment under investigation and can be evaluated with a dilution series of a mock microbial community.
机译:通常使用与培养无关的方法(例如16S rRNA基因测序)研究微生物群落。但是,准确表征微生物群落的挑战之一是外源细菌DNA污染,特别是在低微生物生物量生态位中。已经提出了确定污染物序列的计算方法,但是其性能尚未得到独立评估。为了确定微生物生物量减少对微生物16S rRNA基因测序实验的影响,我们创建了模拟微生物群落稀释系列。我们评估了四种识别和去除污染物的计算方法,如下所示:(i)阴性对照中存在的过滤序列,(ii)基于相对丰度的过滤序列,(iii)鉴定与DNA浓度负相关的序列Decontam,以及(iv)预测SourceTracker中实现的由定义的污染源引起的序列比例。不出所料,随着原始微生物生物量的减少,细菌细菌DNA的比例增加,其中80.1%的最稀释样品来自污染物序列。污染物序列的包含导致多样性估计过高,微生物组组成失真。所有用于污染物识别的方法都成功地识别了一些污染物序列,这些序列随所使用的方法参数和污染物发生率而变化。值得注意的是,去除阴性对照中存在的序列会错误地去除> 20%的预期序列。明确定义实验环境后,SourceTracker可以成功去除98%以上的污染物。但是,当实验环境未知时,SourceTracker对预期序列进行了错误分类,并且效果不佳,无法去除> 97%的污染物。相反,Decontam频率方法不能去除预期的序列,而是可以成功去除70%至90%的污染物。重要信息低微生物生物量环境中微生物的相对稀缺性使得准确确定群落组成具有挑战性。识别和控制污染物细菌DNA是从这些低生物量环境中了解微生物群落的关键步骤。我们的研究介绍了使用模拟社区稀释液系列作为阳性对照,并评估了四种计算策略,这些策略可以在16S rRNA基因测序实验中识别污染物,以便从下游分析中去除污染物。从实验中去除污染物序列的适当计算方法取决于有关所研究微生物环境的先验知识,并且可以通过模拟微生物群落的稀释系列进行评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号