...
首页> 外文期刊>MBio >Random Sampling Process Leads to Overestimation of β-Diversity of Microbial Communities
【24h】

Random Sampling Process Leads to Overestimation of β-Diversity of Microbial Communities

机译:随机抽样过程导致高估了微生物群落的β多样性

获取原文

摘要

The site-to-site variability in species composition, known as β-diversity, is crucial to understanding spatiotemporal patterns of species diversity and the mechanisms controlling community composition and structure. However, quantifying β-diversity in microbial ecology using sequencing-based technologies is a great challenge because of a high number of sequencing errors, bias, and poor reproducibility and quantification. Herein, based on general sampling theory, a mathematical framework is first developed for simulating the effects of random sampling processes on quantifying β-diversity when the community size is known or unknown. Also, using an analogous ball example under Poisson sampling with limited sampling efforts, the developed mathematical framework can exactly predict the low reproducibility among technically replicate samples from the same community of a certain species abundance distribution, which provides explicit evidences of random sampling processes as the main factor causing high percentages of technical variations. In addition, the predicted values under Poisson random sampling were highly consistent with the observed low percentages of operational taxonomic unit (OTU) overlap (<30% and <20% for two and three tags, respectively, based on both Jaccard and Bray-Curtis dissimilarity indexes), further supporting the hypothesis that the poor reproducibility among technical replicates is due to the artifacts associated with random sampling processes. Finally, a mathematical framework was developed for predicting sampling efforts to achieve a desired overlap among replicate samples. Our modeling simulations predict that several orders of magnitude more sequencing efforts are needed to achieve desired high technical reproducibility. These results suggest that great caution needs to be taken in quantifying and interpreting β-diversity for microbial community analysis using next-generation sequencing technologies. >IMPORTANCE Due to the vast diversity and uncultivated status of the majority of microorganisms, microbial detection, characterization, and quantitation are of great challenge. Although large-scale metagenome sequencing technology such as PCR-based amplicon sequencing has revolutionized the studies of microbial communities, it suffers from several inherent drawbacks, such as a high number of sequencing errors, biases, poor quantitation, and very high percentages of technical variations, which could greatly overestimate microbial biodiversity. Based on general sampling theory, this study provided the first explicit evidence to demonstrate the importance of random sampling processes in estimating microbial β-diversity, which has not been adequately recognized and addressed in microbial ecology. Since most ecological studies are involved in random sampling, the conclusions learned from this study should also be applicable to other ecological studies in general. In summary, the results presented in this study should have important implications for examining microbial biodiversity to address both basic theoretical and applied management questions.
机译:物种组成的站点间差异(称为β多样性)对于理解物种多样性的时空格局以及控制群落组成和结构的机制至关重要。然而,由于测序错误,偏倚以及重现性和定量分析的不完善,使用基于测序的技术对微生物生态学中的β多样性进行量化是一个巨大的挑战。在此,基于通用采样理论,首先开发了一个数学框架,用于在已知或未知群落大小时模拟随机采样过程对量化β多样性的影响。同样,使用泊松采样下有限的采样努力下的类似球示例,发达的数学框架可以准确地预测某些物种丰度分布的同一群落的技术复制样品之间的低重现性,这为随机采样过程提供了明确的证据,因为导致技术变化百分比很高的主要因素。此外,在Poisson随机抽样下的预测值与观察到的较低的操作分类单元(OTU)重叠百分比高度一致(基于Jaccard和Bray-Curtis,两个和三个标签分别<30%和<20%相似性指标),进一步支持以下假设:技术复制品之间的可重复性差是由于与随机采样过程相关的伪影。最后,开发了一个数学框架来预测采样工作,以实现重复样本之间的所需重叠。我们的建模模拟预测,需要更多数量级的测序工作才能实现所需的高技术重现性。这些结果表明,在使用下一代测序技术对微生物群落分析进行定量和解释β多样性时,需要格外谨慎。 >重要性:由于大多数微生物的多样性和未培养状态,微生物的检测,表征和定量面临巨大挑战。尽管大规模的元基因组测序技术(例如基于PCR的扩增子测序)已经彻底改变了微生物群落的研究方法,但它仍面临着一些固有的缺陷,例如,测序错误数量众多,偏差,定量不佳以及技术变化的百分比很高,这可能会大大高估微生物的生物多样性。基于一般采样理论,本研究提供了第一个明确的证据,以证明随机采样过程在估计微生物β多样性中的重要性,而微生物生态学尚未充分认识和解决这一问题。由于大多数生态研究都涉及随机抽样,因此从该研究中得出的结论也应普遍适用于其他生态研究。总而言之,本研究提出的结果对于检查微生物生物多样性以解决基本的理论和应用管理问题均具有重要意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号