...
首页> 外文期刊>BMC Bioinformatics >Rapid analysis of metagenomic data using signature-based clustering
【24h】

Rapid analysis of metagenomic data using signature-based clustering

机译:基于签名聚类的批量分析群体数据

获取原文

摘要

Sequencing highly-variable 16S regions is a common and often effective approach to the study of microbial communities, and next-generation sequencing (NGS) technologies provide abundant quantities of data for analysis. However, the speed of existing analysis pipelines may limit our ability to work with these quantities of data. Furthermore, the limited coverage of existing 16S databases may hamper our ability to characterise these communities, particularly in the context of complex or poorly studied environments. In this article we present the SigClust algorithm, a novel clustering method involving the transformation of sequence reads into binary signatures. When compared to other published methods, SigClust yields superior cluster coherence and separation of metagenomic read data, while operating within substantially reduced timeframes. We demonstrate its utility on published Illumina datasets and on a large collection of labelled wound reads sourced from patients in a wound clinic. The temporal analysis is based on tracking the dominant clusters of wound samples over time. The analysis can identify markers of both healing and non-healing wounds in response to treatment. Prominent clusters are found, corresponding to bacterial species known to be associated with unfavourable healing outcomes, including a number of strains of Staphylococcus aureus. SigClust identifies clusters rapidly and supports an improved understanding of the wound microbiome without reliance on a reference database. The results indicate a promising use for a SigClust-based pipeline in wound analysis and prediction, and a possible novel method for wound management and treatment.
机译:测序高度可变16S区域是微生物社区研究的常见且经常有效的方法,下一代测序(NGS)技术提供了丰富的分析数据量。然而,现有分析管道的速度可能限制我们使用这些数量的数据的能力。此外,现有16S数据库的有限覆盖可能会妨碍我们对这些社区的表征的能力,特别是在复杂或较差的环境中的背景下。在本文中,我们介绍了SigClust算法,一种涉及序列转换的新型聚类方法读入二进制签名。与其他公开的方法相比,SigClust会产生优异的聚类相干性和分离偏见读取数据,同时在基本上减少的时间框架内运行。我们展示了它在出版的Illumina数据集和伤口诊所中的患者中的大量标记伤口读取的效用。时间分析基于跟踪伤口样本的主要簇随时间。分析可以鉴定治疗的愈合和非愈合伤口的标记。发现突出的簇,对应于已知与不利的愈合结果相关的细菌种类,包括许多葡萄球菌菌株。 SigClust迅速识别群集,并支持对伤口微生物组的了解,而无需依赖参考数据库。结果表明伤口分析和预测中基于SigClust的管道的有希望的用途,以及可能的伤口管理和治疗方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号