首页> 外文期刊>Theoretical computer science >On-line construction of compact suffix vectors and maximal repeats
【24h】

On-line construction of compact suffix vectors and maximal repeats

机译:紧凑后缀向量和最大重复数的在线构建

获取原文
获取原文并翻译 | 示例
       

摘要

A suffix vector of a string is an index data structure equivalent to a suffix tree. It was first introduced by Monostori et al. in 2001 [K. Monostori, Efficient computational approach to identifying overlapping documents in large digital collections, Ph.D. Thesis, Monash University, 2002; K. Monostori, A. Zaslavsky, H. Schmidt, Suffix vector: Space-and-time-efficient alternative to suffix trees, in: CRPITS'02: Proceedings of the 25th Australasian Computer Science Conference, vol. 4, Darlinghurst, Australia, 2002, Australian Computer Society, Inc. pp 157-166; K. Monostori, A. Zaslavsky, I. Vajk, Suffix vector: A space-efficient suffix tree representation, in: P. Eades, T. Takaoka (Eds.), Proceedings of the 12th International Symposium on Algorithms and Computation, in: Lecture Notes in Computer Science, vol. 2223, Springer-Verlag, Berlin, 2001, pp. 707-718. Christchurch, New Zealand]. They proposed a linear construction algorithm of an extended suffix vector, then another linear algorithm to transform an extended suffix vector into a more space-economical compact suffix vector. We propose an on-line linear algorithm for directly constructing a compact suffix vector. Not only do we show that it is possible to directly build a compact suffix vector but we will also show that this on-line construction can be faster than the construction of the extended suffix vector. Then we formalize the relation between suffix vectors and compact suffix automata which leads to an efficient method for computing maximal repeats using suffix vectors.
机译:字符串的后缀向量是等效于后缀树的索引数据结构。它最初是由Monostori等人介绍的。在2001年[K. Monostori,一种用于识别大型数字馆藏中重叠文档的高效计算方法。论文,莫纳什大学,2002; K. Monostori,A。Zaslavsky,H.Schmidt,后缀矢量:后缀树的时空替代方案,见:CRPITS'02:第25届澳大利亚计算机科学会议论文集,第一卷。 4,澳大利亚达令赫斯特,2002年,澳大利亚计算机学会,第157-166页; K. Monostori,A.Zaslavsky,I.Vajk,后缀向量:一种节省空间的后缀树表示形式,在:P.Eades,T.Takaoka(编),第十二届国际算法与计算研讨会论文集,在:计算机科学讲义,第一卷。 2223,Springer-Verlag,柏林,2001,第707-718页。新西兰克赖斯特彻奇]。他们提出了扩展后缀矢量的线性构建算法,然后提出了将扩展后缀矢量转换为更节省空间的紧凑后缀矢量的另一种线性算法。我们提出了一种直接构造紧凑后缀向量的在线线性算法。我们不仅表明可以直接构建一个紧凑的后缀矢量,而且还表明该在线构建比扩展后缀矢量的构建更快。然后,我们将后缀矢量和紧凑型后缀自动机之间的关系形式化,这导致使用后缀矢量计算最大重复的有效方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号