首页> 外文会议>Annual Symposium on Combinatorial Pattern Matching(CPM 2006); 20060705-07; Barcelona(ES) >Obtaining Provably Good Performance from Suffix Trees in Secondary Storage
【24h】

Obtaining Provably Good Performance from Suffix Trees in Secondary Storage

机译:从辅助存储中的后缀树获得可证明的良好性能

获取原文
获取原文并翻译 | 示例

摘要

Designing external memory data structures for string databases is of significant recent interest due to the proliferation of biological sequence data. The suffix tree is an important indexing structure that provides optimal algorithms for memory bound data. However, string B-trees provide the best known asymptotic performance in external memory for substring search and update operations. Work on external memory variants of suffix trees has largely focused on constructing suffix trees in external memory or layout schemes for suffix trees that preserve link locality. In this paper, we present a new suffix tree layout scheme for secondary storage and present construction, substring search, insertion and deletion algorithms that are competitive with the string B-tree. For a set of strings of total length n, a pattern p and disk blocks of size B, we provide a substring search algorithm that uses O(|p|/B + log_B n) disk accesses. We present algorithms for insertion and deletion of all suffixes of a string of length m that take O(m log_B (n + m)) and O(m log_B n) disk accesses, respectively. Our results demonstrate that suffix trees can be directly used as efficient secondary storage data structures for string and sequence data.
机译:由于生物序列数据的激增,为字符串数据库设计外部存储器数据结构引起了人们的极大兴趣。后缀树是重要的索引结构,可为内存绑定数据提供最佳算法。但是,字符串B树在外部存储器中为子字符串搜索和更新操作提供了最著名的渐近性能。后缀树的外部存储器变体的工作主要集中于在外部存储器中构造后缀树或用于保留链接局部性的后缀树的布局方案。在本文中,我们提出了一种用于辅助存储的新后缀树布局方案,并提出了与字符串B树竞争的构造,子字符串搜索,插入和删除算法。对于一组总长度为n的字符串,一个模式p和大小为B的磁盘块,我们提供了使用O(| p | / B + log_B n)磁盘访问的子字符串搜索算法。我们介绍了用于插入和删除长度为m的字符串的所有后缀的算法,这些后缀分别需要O(m log_B(n + m))和O(m log_B n)磁盘访问。我们的结果表明,后缀树可以直接用作字符串和序列数据的有效二级存储数据结构。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号