首页> 美国卫生研究院文献>Microbial Genomics >SynerClust: a highly scalable synteny-aware orthologue clustering tool
【2h】

SynerClust: a highly scalable synteny-aware orthologue clustering tool

机译:SynerClust:一种高度可扩展的可感知协同的直系同源物聚类工​​具

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Accurate orthologue identification is a vital component of bacterial comparative genomic studies, but many popular sequence-similarity-based approaches do not scale well to the large numbers of genomes that are now generated routinely. Furthermore, most approaches do not take gene synteny into account, which is useful information for disentangling paralogues. Here, we present SynerClust, a user-friendly synteny-aware tool based on synergy that can process thousands of genomes. SynerClust was designed to analyse genomes with high levels of local synteny, particularly prokaryotes, which have operon structure. SynerClust’s run-time is optimized by selecting cluster representatives at each node in the phylogeny; thus, avoiding the need for exhaustive pairwise similarity searches. In benchmarking against Roary, Hieranoid2, PanX and Reciprocal Best Hit, SynerClust was able to more completely identify sets of core genes for datasets that included diverse strains, while using substantially less memory, and with scalability comparable to the fastest tools. Due to its scalability, ease of installation and use, and suitability for a variety of computing environments, orthogroup clustering using SynerClust will enable many large-scale prokaryotic comparative genomics efforts.
机译:正确的直向同源物鉴定是细菌比较基因组研究的重要组成部分,但是许多流行的基于序列相似性的方法不能很好地适应现在常规生成的大量基因组。此外,大多数方法都没有考虑基因同义性,这对于解开旁白是有用的信息。在这里,我们介绍SynerClust,这是一种基于用户友好的,可感知协同作用的工具,该工具基于可处理数千个基因组的协同作用。 SynerClust设计用于分析具有高水平局部同义性的基因组,尤其是具有操纵子结构的原核生物。通过在系统发育的每个节点上选择簇代表,可以优化SynerClust的运行时间。因此,避免了详尽的成对相似搜索。在针对Roary,Hieranoid2,PanX和Reciprocal Best Hit进行基准测试时,SynerClust能够为包含各种菌株的数据集更完整地识别核心基因集,同时使用更少的内存,并且可扩展性与最快的工具相当。由于其可伸缩性,易于安装和使用以及适用于各种计算环境,因此使用SynerClust的正交群聚类将使许多大规模的原核比较基因组学工作成为可能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号