首页> 外文期刊>Expert systems with applications >Clustering based distributed phylogenetic tree construction
【24h】

Clustering based distributed phylogenetic tree construction

机译:基于聚类的分布式系统树构建

获取原文
获取原文并翻译 | 示例
           

摘要

Phylogenetic tree construction has received much attention recently due to the availability of vast biological data. In this study, we provide a three step method to build phylogenetic trees. Firstly, a density-based clustering algorithm is used to provide clusters of the population at hand using the distance matrix which shows the distances of the species. Secondly, a phylogenetic tree for each cluster is constructed by using the neighbor-joining (NJ) algorithm and finally, the roots of the small phylogenetic trees are connected again by the NJ algorithm to form one large phylogenetic tree. To our knowledge, this is the first method for building phylogenetic trees that uses clustering prior to forming the tree. As such, it provides independent phylogenetic tree formation within each cluster as the second step, hence is suitable for parallel/distributed processing, enabling fast processing of very large biological data sets. The proposed method, clustered neighbor-joining (CNJ) is applied to 145 samples from the Y-DNA Haplogroup G. Distances between male samples are the variation in their set of Y-chromosomal short tandem repeat (STR) values. We show that the clustering method we use is superior to other clustering methods as applied to Y-DNA data and also independent, fast distributed construction of phylogenetic trees is possible with this method.
机译:由于大量生物数据的可获得性,系统发生树的构建最近受到了广泛的关注。在这项研究中,我们提供了一种建立系统树的三步法。首先,使用基于密度的聚类算法,使用距离矩阵显示物种的距离,以提供手头种群的集群。其次,使用邻接算法(NJ)构造每个聚类的系统树,最后,通过NJ算法再次将小系统树的根连接起来,从而形成一棵大型系统树。据我们所知,这是构建系统发育树的第一种方法,该方法在形成树之前先进行聚类。这样,作为第二步,它在每个群集中提供独立的系统树生成,因此适用于并行/分布式处理,从而可以快速处理非常大的生物学数据集。所提出的方法,聚簇邻居连接(CNJ)被应用于来自Y-DNA Haplogroup G的145个样品。男性样品之间的距离是它们的Y染色体短串联重复序列(STR)值的变化。我们表明,我们使用的聚类方法优于应用于Y-DNA数据的其他聚类方法,并且使用该方法可以独立,快速地构建系统发育树。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号