首页> 外文期刊>Algorithmica >Linearized Suffix Tree: an Efficient Index Data Structure with the Capabilities of Suffix Trees and Suffix Arrays
【24h】

Linearized Suffix Tree: an Efficient Index Data Structure with the Capabilities of Suffix Trees and Suffix Arrays

机译:线性化后缀树:具有后缀树和后缀数组功能的高效索引数据结构

获取原文
获取原文并翻译 | 示例

摘要

Suffix trees and suffix arrays are fundamental full-text index data structures to solve problems occurring in string processing. Since suffix trees and suffix arrays have different capabilities, some problems are solved more efficiently using suffix trees and others are solved more efficiently using suffix arrays. We consider efficient index data structures with the capabilities of both suffix trees and suffix arrays without requiring much space. When the size of an alphabet is small, enhanced suffix arrays are such index data structures. However, when the size of an alphabet is large, enhanced suffix arrays lose the power of suffix trees. Pattern searching in an enhanced suffix array takes O(m|Σ|) time while pattern searching in a suffix tree takes O(mlog |Σ|) time where m is the length of a pattern and Σ is an alphabet. In this paper, we present linearized suffix trees which are efficient index data structures with the capabilities of both suffix trees and suffix arrays even when the size of an alphabet is large. A linearized suffix tree has all the functionalities of the enhanced suffix array and supports the pattern search in O(mlog |Σ|) time. In a different point of view, it can be considered a practical implementation of the suffix tree supporting O(mlog |Σ|)-time pattern search.
机译:后缀树和后缀数组是基本的全文本索引数据结构,用于解决字符串处理中出现的问题。由于后缀树和后缀数组具有不同的功能,因此使用后缀树可以更有效地解决一些问题,而使用后缀数组可以更有效地解决其他问题。我们考虑具有后缀树和后缀数组的功能的高效索引数据结构,而不需要太多空间。当字母的大小较小时,增强的后缀数组就是这样的索引数据结构。但是,当字母的大小很大时,增强的后缀数组会失去后缀树的功能。在增强的后缀数组中进行模式搜索需要O(m |Σ|)时间,而在后缀树中进行模式搜索则需要O(mlog |Σ|)时间,其中m是模式的长度,而Σ是字母。在本文中,我们提出了线性化的后缀树,它是高效的索引数据结构,即使字母表大小很大,它也具有后缀树和后缀数组的功能。线性后缀树具有增强后缀数组的所有功能,并支持O(mlog |Σ|)时间中的模式搜索。从不同的角度来看,可以将其视为支持O(mlog |Σ|)时间模式搜索的后缀树的实际实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号