A GENERALIZATION OF THE SUFFIX TREE TO SQUARE MATRICES, WITH APPLICATIONS

Giancarlo R.

首页> 外文期刊>SIAM Journal on Computing >A GENERALIZATION OF THE SUFFIX TREE TO SQUARE MATRICES, WITH APPLICATIONS

【24h】

A GENERALIZATION OF THE SUFFIX TREE TO SQUARE MATRICES, WITH APPLICATIONS

机译：平方树上的后缀树的广义化及其应用

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe a new data structure, the Lsuffix tree, which generalizes McCreight's suffix tree for a string [J. Assoc. Comput. Mach.. 23 (1976), pp. 262-272] to a square matrix. All matrices have entries from a totally ordered alphabet Sigma. Based on the Lsuffix tree, we give efficient algorithms for the static versions of the following dual problems that arise in low-level image processing and visual databases. Two-dimensional pattern retrieval. We have a library of texts S = {TEXT(1)....TEXT(r)}, where TEXT(l) is an n(i) x n(i) matrix, 1 less than or equal to i less than or equal to r. We may preprocess the library, Then, given m x m, m less than or equal to n(i), 1 less than or equal to i less than or equal to r, pattern matrix PAT, we want to find all occurrences of PAT in TEXT, for all TEXT is an element of S. Let t(S) = Sigma(i=l)(r)n(i)(2) be the size of the library. The preprocessing step builds the Lsuffix tree for the matrices in S and then transforms it into an index (a trie defined over Sigma). It takes O(t(S)(log Sigma + log t(S))) time and O(t(S)) space. The index can be queried directly in O(m(2) log Sigma + torocc) time. where totocc is the total number of occurrences of PAT in TEXT, for all TEXT is an element of S. Two-dimensional dictionary marching. We have a dictionary of patterns DC = {PAT(1),..., PAT(J)}, where PAT(i) is of dimension m(i) x m(i), 1 less than or equal to i less than or equal to s. We may preprocess the dictionary. Then, given an n x n text matrix TEXT, we want to search for all occurrences of patterns in the dictionary in the text. Let t(DC) = Sigma(i=l)(s)m(i)(2) be the size of the dictionary and let (t) over bar(DC) be the sum of the m(i)'s. The preprocessing consists of building the Lsuffix tree for the matrices in DC. It takes O(t(DC) log Sigma + (t) over bar(DC) log (t) over bar(DC))) time and O(t(DC)) space. The search step takes O(n(2)(log Sigma + log (t) over bar(DC)) + totocc) time, where totocc is the total number of occurrences of patterns in the text. Both problems have a dynamic version in which the library and the dictionary, respectively, can be updated by insertion or deletion of square matrices in them. In a companion paper we will provide algorithms for the dynamic version. [References: 26]

机译：我们描述了一种新的数据结构Lsuffix树，该结构将McCreight的后缀树推广为一个字符串[J.副会长计算[Mach。23（1976），pp。262-272]形成方矩阵。所有矩阵的条目均来自完全有序的字母Sigma。基于Lsuffix树，我们为在低级图像处理和可视数据库中出现的以下双重问题的静态版本提供了有效的算法。二维模式检索。我们有一个文本库S = {TEXT（1）.... TEXT（r）}，其中TEXT（l）是n（i）xn（i）矩阵，小于或等于i小于或等于1等于r。我们可以对库进行预处理，然后，给定mxm，m小于或等于n（i），1小于或等于i小于或等于r，模式矩阵PAT，我们要查找TEXT中所有出现的PAT ，因为所有TEXT是S的元素。令t（S）= Sigma（i = l）（r）n（i）（2）为库的大小。预处理步骤为S中的矩阵构建Lsuffix树，然后将其转换为索引（在Sigma上定义的trie）。它需要O（t（S）（log Sigma + log t（S）））时间和O（t（S））空间。可以直接在O（m（2）log Sigma + torocc）时间中查询索引。其中，totocc是TEXT中发生PAT的总数，因为所有TEXT都是S的元素。二维字典行进。我们有一个模式字典DC = {PAT（1），...，PAT（J）}，其中PAT（i）的尺寸为m（i）xm（i），小于1或等于i小于或等于s。我们可以对字典进行预处理。然后，给定n x n文本矩阵TEXT，我们要搜索文本中字典中所有出现的模式。令t（DC）= Sigma（i = 1）（s）m（i）（2）为字典的大小，令bar（DC）上的（t）为m（i）的总和。预处理包括为DC中的矩阵构建Lsuffix树。它花费O（t（DC）log Sigma +（t）超过bar（DC）log（t）超过bar（DC））时间和O（t（DC））空间。搜索步骤花费O（n（2）（log Sigma + bar（DC）上的log（t））+ totocc）时间，其中totocc是文本中模式出现的总数。这两个问题都有一个动态版本，其中可以通过在其中插入或删除平方矩阵来更新库和字典。在随附的论文中，我们将提供动态版本的算法。 [参考：26]

著录项

来源
《SIAM Journal on Computing》 |1995年第3期|共43页
作者
Giancarlo R.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类应用数学;
关键词
Image processing; Two-dimensional information retrieval; Dictionary matching; Data structures; Pattern matching; Common ancestors;

机译：图像处理二维信息检索字典匹配数据结构模式匹配共同祖先;

相似文献

外文文献
中文文献
专利

1. A GENERALIZATION OF THE SUFFIX TREE TO SQUARE MATRICES, WITH APPLICATIONS [J] . Giancarlo R. SIAM Journal on Computing . 1995,第3期

机译：平方树上的后缀树的广义化及其应用
2. Compressed suffix arrays and suffix trees with applications to text indexing and string matching [J] . Grossi R, Vitter JS SIAM Journal on Computing . 2005,第2期

机译：压缩后缀数组和后缀树及其在文本索引和字符串匹配中的应用
3. Generalizations of suffix arrays to multi-dimensional matrices [J] . Kim DK., Kim YA., Park K. Theoretical computer science . 2003,第1a3期

机译：将后缀数组推广到多维矩阵
4. Compressed suffix arrays and suffix trees with applications to text indexing and string matching (extended abstract) [C] . Roberto Grossi, Jeffrey Scott Vitter Annual ACM symposium on Theory of computing;ACM symposium on Theory of computing . 2000

机译：压缩后缀数组和后缀树及其在文本索引和字符串匹配中的应用（扩展摘要）
5. Suffix trees and suffix arrays in primary and secondary storage [D] . Ko, Pang 2007

机译：主存储和辅助存储中的后缀树和后缀数组
6. Application of permanents of square matrices for DNA identification in multiple-fatality cases [O] . Maiko Narahara, Keiji Tamaki, Ryo Yamada 2013

机译：方阵永久物在多发性死亡病例DNA鉴定中的应用
7. On the Construction of Classes of Suffix Trees for Square Matrices: Algorithms and Applications [O] . GROSSI R, GIANCARLO R 1996

机译：方阵后缀树类的构建：算法与应用

A GENERALIZATION OF THE SUFFIX TREE TO SQUARE MATRICES, WITH APPLICATIONS

摘要

著录项

相似文献

相关主题

期刊订阅