首页> 外文OA文献 >Deriving accurate microbiota profiles from human samples with low bacterial content through post-sequencing processing of Illumina MiSeq data
【2h】

Deriving accurate microbiota profiles from human samples with low bacterial content through post-sequencing processing of Illumina MiSeq data

机译:通过对Illumina MiSeq数据进行序列后处理,从细菌含量低的人类样品中得出准确的微生物群谱

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

BackgroundThe rapid expansion of 16S rRNA gene sequencing in challenging clinical contexts has resulted in a growing body of literature of variable quality. To a large extent, this is due to a failure to address spurious signal that is characteristic of samples with low levels of bacteria and high levels of non-bacterial DNA. We have developed a workflow based on the paired-end read Illumina MiSeq-based approach, which enables significant improvement in data quality, post-sequencing. We demonstrate the efficacy of this methodology through its application to paediatric upper-respiratory samples from several anatomical sites.ResultsA workflow for processing sequence data was developed based on commonly available tools. Data generated from different sample types showed a marked variation in levels of non-bacterial signal and ‘contaminant’ bacterial reads. Significant differences in the ability of reference databases to accurately assign identity to operational taxonomic units (OTU) were observed. Three OTU-picking strategies were trialled as follows: de novo, open-reference and closed-reference, with open-reference performing substantially better. Relative abundance of OTUs identified as potential reagent contamination showed a strong inverse correlation with amplicon concentration allowing their objective removal. The removal of the spurious signal showed the greatest improvement in sample types typically containing low levels of bacteria and high levels of human DNA. A substantial impact of pre-filtering data and spurious signal removal was demonstrated by principal coordinate and co-occurrence analysis. For example, analysis of taxon co-occurrence in adenoid swab and middle ear fluid samples indicated that failure to remove the spurious signal resulted in the inclusion of six out of eleven bacterial genera that accounted for 80% of similarity between the sample types.ConclusionsThe application of the presented workflow to a set of challenging clinical samples demonstrates its utility in removing the spurious signal from the dataset, allowing clinical insight to be derived from what would otherwise be highly misleading output. While other approaches could potentially achieve similar improvements, the methodology employed here represents an accessible means to exclude the signal from contamination and other artefacts.
机译:背景技术在具有挑战性的临床环境中,16S rRNA基因测序的快速扩展导致了质量不断变化的文献的增长。在很大程度上,这是由于无法处理虚假信号所致,而虚假信号是细菌含量低且非细菌DNA含量高的样品的特征。我们已经开发出了基于基于Illumina MiSeq的配对末端读取方法的工作流程,该方法可以显着改善数据质量和后排序。我们通过将其应用到几个解剖部位的儿科上呼吸道样本中,证明了该方法的有效性。结果基于通用工具开发了用于处理序列数据的工作流程。从不同样品类型生成的数据显示非细菌信号和“污染物”细菌读数的水平明显不同。观察到参考数据库将身份准确分配给操作分类单位(OTU)的能力存在显着差异。尝试了以下三种OTU挑选策略:从头开始,开放参考和封闭参考,开放参考的性能要好得多。被确定为潜在试剂污染的OTU的相对丰度与扩增子浓度呈强烈反相关,从而可以将其客观去除。杂散信号的去除显示出通常包含低细菌水平和高人类DNA水平的样品类型的最大改进。主坐标和共现分析证明了预滤波数据和杂散信号去除的重大影响。例如,对腺样拭子和中耳液样本中的分类单元共现进行的分析表明,未能消除杂散信号导致11个细菌属中的6个被包括在内,这占样本类型之间相似性的80%。提出的工作流程对一组具有挑战性的临床样品的演示证明了其在从数据集中去除虚假信号的效用,从而可以从否则会极具误导性的输出中获得临床见解。尽管其他方法可能会实现类似的改进,但此处采用的方法代表了一种从污染和其他伪影中排除信号的便捷方法。

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号