...
首页> 外文期刊>PLoS Computational Biology >16S rRNA sequence embeddings: Meaningful numeric feature representations of nucleotide sequences that are convenient for downstream analyses
【24h】

16S rRNA sequence embeddings: Meaningful numeric feature representations of nucleotide sequences that are convenient for downstream analyses

机译:16S rRNA序列嵌入:有意义的核苷酸序列数字特征表示,便于下游分析

获取原文

摘要

Author summary Improvements in the way genomes are sequenced have led to an abundance of microbiome data. With the right approaches, researchers use these data to thoroughly characterize how microbes interact with each other and their host, but sequencing data is of a form (sequences of letters) not ideal for many data analysis approaches. We therefore present an approach to transform sequencing data into arrays of numbers that can capture interesting qualities of the data at the sub-sequence, full-sequence, and sample levels. This allows us to measure the importance of certain microbial sequences with respect to the type of microbe and the condition of the host. Also, representing sequences in this way improves our ability to use other complicated modeling approaches. Using microbiome data from human samples, we show that our numeric representations captured differences between various types of microbes, as well as differences in the body site location from which the samples were collected.
机译:作者摘要基因组测序方式的改进已导致大量的微生物组数据。通过正确的方法,研究人员可以利用这些数据来全面描述微生物彼此之间及其宿主之间的相互作用方式,但是测序数据的形式(字母序列)对于许多数据分析方法而言并不理想。因此,我们提出了一种将测序数据转换为数字数组的方法,该序列可以捕获子序列,全序列和样本级别的数据的有趣质量。这使我们能够测量某些微生物序列相对于微生物类型和宿主状况的重要性。同样,以这种方式表示序列可以提高我们使用其他复杂建模方法的能力。使用人类样品中的微生物组数据,我们显示出我们的数值表示法捕获了各种类型的微生物之间的差异,以及从中采集样品的身体部位的差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号