...
首页> 外文期刊>SIAM Journal on Computing >A GENERALIZATION OF THE SUFFIX TREE TO SQUARE MATRICES, WITH APPLICATIONS
【24h】

A GENERALIZATION OF THE SUFFIX TREE TO SQUARE MATRICES, WITH APPLICATIONS

机译:平方树上的后缀树的广义化及其应用

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We describe a new data structure, the Lsuffix tree, which generalizes McCreight's suffix tree for a string [J. Assoc. Comput. Mach.. 23 (1976), pp. 262-272] to a square matrix. All matrices have entries from a totally ordered alphabet Sigma. Based on the Lsuffix tree, we give efficient algorithms for the static versions of the following dual problems that arise in low-level image processing and visual databases. Two-dimensional pattern retrieval. We have a library of texts S = {TEXT(1)....TEXT(r)}, where TEXT(l) is an n(i) x n(i) matrix, 1 less than or equal to i less than or equal to r. We may preprocess the library, Then, given m x m, m less than or equal to n(i), 1 less than or equal to i less than or equal to r, pattern matrix PAT, we want to find all occurrences of PAT in TEXT, for all TEXT is an element of S. Let t(S) = Sigma(i=l)(r)n(i)(2) be the size of the library. The preprocessing step builds the Lsuffix tree for the matrices in S and then transforms it into an index (a trie defined over Sigma). It takes O(t(S)(log Sigma + log t(S))) time and O(t(S)) space. The index can be queried directly in O(m(2) log Sigma + torocc) time. where totocc is the total number of occurrences of PAT in TEXT, for all TEXT is an element of S. Two-dimensional dictionary marching. We have a dictionary of patterns DC = {PAT(1),..., PAT(J)}, where PAT(i) is of dimension m(i) x m(i), 1 less than or equal to i less than or equal to s. We may preprocess the dictionary. Then, given an n x n text matrix TEXT, we want to search for all occurrences of patterns in the dictionary in the text. Let t(DC) = Sigma(i=l)(s)m(i)(2) be the size of the dictionary and let (t) over bar(DC) be the sum of the m(i)'s. The preprocessing consists of building the Lsuffix tree for the matrices in DC. It takes O(t(DC) log Sigma + (t) over bar(DC) log (t) over bar(DC))) time and O(t(DC)) space. The search step takes O(n(2)(log Sigma + log (t) over bar(DC)) + totocc) time, where totocc is the total number of occurrences of patterns in the text. Both problems have a dynamic version in which the library and the dictionary, respectively, can be updated by insertion or deletion of square matrices in them. In a companion paper we will provide algorithms for the dynamic version. [References: 26]
机译:我们描述了一种新的数据结构Lsuffix树,该结构将McCreight的后缀树推广为一个字符串[J.副会长计算[Mach。23(1976),pp。262-272]形成方矩阵。所有矩阵的条目均来自完全有序的字母Sigma。基于Lsuffix树,我们为在低级图像处理和可视数据库中出现的以下双重问题的静态版本提供了有效的算法。二维模式检索。我们有一个文本库S = {TEXT(1).... TEXT(r)},其中TEXT(l)是n(i)xn(i)矩阵,小于或等于i小于或等于1等于r。我们可以对库进行预处理,然后,给定mxm,m小于或等于n(i),1小于或等于i小于或等于r,模式矩阵PAT,我们要查找TEXT中所有出现的PAT ,因为所有TEXT是S的元素。令t(S)= Sigma(i = l)(r)n(i)(2)为库的大小。预处理步骤为S中的矩阵构建Lsuffix树,然后将其转换为索引(在Sigma上定义的trie)。它需要O(t(S)(log Sigma + log t(S)))时间和O(t(S))空间。可以直接在O(m(2)log Sigma + torocc)时间中查询索引。其中,totocc是TEXT中发生PAT的总数,因为所有TEXT都是S的元素。二维字典行进。我们有一个模式字典DC = {PAT(1),...,PAT(J)},其中PAT(i)的尺寸为m(i)xm(i),小于1或等于i小于或等于s。我们可以对字典进行预处理。然后,给定n x n文本矩阵TEXT,我们要搜索文本中字典中所有出现的模式。令t(DC)= Sigma(i = 1)(s)m(i)(2)为字典的大小,令bar(DC)上的(t)为m(i)的总和。预处理包括为DC中的矩阵构建Lsuffix树。它花费O(t(DC)log Sigma +(t)超过bar(DC)log(t)超过bar(DC))时间和O(t(DC))空间。搜索步骤花费O(n(2)(log Sigma + bar(DC)上的log(t))+ totocc)时间,其中totocc是文本中模式出现的总数。这两个问题都有一个动态版本,其中可以通过在其中插入或删除平方矩阵来更新库和字典。在随附的论文中,我们将提供动态版本的算法。 [参考:26]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号