首页> 外文会议>Cairo International Biomedical Engineering Conference >Distributed suffix array construction algorithms: Comparison of two algorithms
【24h】

Distributed suffix array construction algorithms: Comparison of two algorithms

机译:分布式后缀数组构造算法:两种算法的比较

获取原文

摘要

The suffix array is an important indexing data structure for biological sequence analysis. The increasing size of genomic data necessitates the use of a computer cluster to speed up the computation. In this paper, we compare the performance of two distributed suffix array construction algorithms that can run on a computer cluster: the first is Futamura-Aluru-Kurtz algorithm and the second is Kulla-Sanders's. The performance of these two algorithms has not been compared earlier due to the lack of any available software and the difficulty of implementing them. In this paper, we answer a still open question about which algorithm would be better in practice, particularly for genomic data. We have implemented the two algorithms and made them available. The comparison results show that the Futamura-Aluru-Kurtz algorithm is more efficient in practice, despite the fact that the algorithm of Kulla-Sanders has better theoretical time complexity.
机译:后缀数组是用于生物序列分析的重要索引数据结构。基因组数据的大小不断增加,因此有必要使用计算机集群来加快计算速度。在本文中,我们比较了可以在计算机集群上运行的两种分布式后缀数组构造算法的性能:第一种是Futamura-Aluru-Kurtz算法,第二种是Kulla-Sanders算法。由于缺少任何可用软件且难以实现这两种算法,因此尚未对这两种算法的性能进行比较。在本文中,我们回答了一个尚待解决的问题,即哪种算法在实践中会更好,尤其是对于基因组数据而言。我们已经实现了这两种算法,并使它们可用。比较结果表明,尽管Kulla-Sanders算法具有更好的理论时间复杂度,但Futamura-Aluru-Kurtz算法在实践中效率更高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号