首页> 美国卫生研究院文献>other >Edge Principal Components and Squash Clustering: Using the Special Structure of Phylogenetic Placement Data for Sample Comparison
【2h】

Edge Principal Components and Squash Clustering: Using the Special Structure of Phylogenetic Placement Data for Sample Comparison

机译:边缘主成分和壁球集群:利用亲缘布置数据的特殊结构的比较举例

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Principal components analysis (PCA) and hierarchical clustering are two of the most heavily used techniques for analyzing the differences between nucleic acid sequence samples taken from a given environment. They have led to many insights regarding the structure of microbial communities. We have developed two new complementary methods that leverage how this microbial community data sits on a phylogenetic tree. Edge principal components analysis enables the detection of important differences between samples that contain closely related taxa. Each principal component axis is a collection of signed weights on the edges of the phylogenetic tree, and these weights are easily visualized by a suitable thickening and coloring of the edges. Squash clustering outputs a (rooted) clustering tree in which each internal node corresponds to an appropriate “average” of the original samples at the leaves below the node. Moreover, the length of an edge is a suitably defined distance between the averaged samples associated with the two incident nodes, rather than the less interpretable average of distances produced by UPGMA, the most widely used hierarchical clustering method in this context. We present these methods and illustrate their use with data from the human microbiome.
机译:主成分分析(PCA)和层次聚类是分析从给定环境中提取的核酸序列样本之间差异的两种最常用的技术。他们导致了有关微生物群落结构的许多见解。我们开发了两种新的互补方法,这些方法利用了该微生物群落数据在系统树上的位置。边缘主成分分析能够检测出包含密切相关分类单元的样品之间的重要差异。每个主成分轴是系统发育树边缘上带符号权重的集合,并且可以通过对边缘进行适当的增厚和着色来轻松查看这些权重。壁球聚类输出(根)聚类树,其中每个内部节点对应于该节点下方叶子处原始样本的适当“平均值”。此外,边缘的长度是与两个入射节点关联的平均样本之间的适当定义的距离,而不是UPGMA(在此情况下使用最广泛的层次聚类方法)产生的距离的难以解释的平均值。我们介绍这些方法,并说明其与人类微生物组数据的结合使用。

著录项

  • 期刊名称 other
  • 作者单位
  • 年(卷),期 -1(8),3
  • 年度 -1
  • 页码 e56859
  • 总页数 15
  • 原文格式 PDF
  • 正文语种
  • 中图分类
  • 关键词

  • 入库时间 2022-08-21 11:22:32

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号