Indexing weighted-sequences in large databases

机译：索引大型数据库中的加权序列

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present an index structure for managing weighted-sequences in large databases. A weighted-sequence is defined as a two-dimensional structure where each element in the sequence is associated with a weight. A series of network events, for instance, is a weighted-sequence in that each event has a timestamp. Querying a large sequence database by events' occurrence patterns is a first step towards understanding the temporal causal relationships among the events. The index structure proposed enables us to efficiently retrieve from the database all subsequences, possibly noncontiguous, that match a given query sequence both by events and by weights. The index method also takes into consideration the nonuniformfrequency distribution of events in the sequence data. In addition, our method finds a broad range of applications in indexing scientific data consisting of multiple numerical columns for discovery of correlations among these columns. For instance, indexing a DNA microarray that records expression levels of genes under different conditions enables us to search for genes whose responses to various experimental perturbations follow a given pattern. We demonstrate, using real-world data sets, that our method is effective and efficient.

机译：我们提出了一种索引结构，用于管理大型数据库中的加权序列。加权序列定义为二维结构，其中序列中的每个元素都与权重相关联。例如，一系列网络事件是加权序列，因为每个事件都有一个时间戳。通过事件的发生模式查询大型序列数据库是了解事件之间的时间因果关系的第一步。所提出的索引结构使我们能够从数据库中有效地检索所有可能不连续的子序列，这些子序列既可以通过事件又可以通过权重匹配给定查询序列。索引方法还考虑了序列数据中事件的非均匀频率分布。此外，我们的方法在索引由多个数字列组成的科学数据以发现这些列之间的相关性方面具有广泛的应用。例如，索引记录了在不同条件下基因表达水平的DNA微阵列，使我们能够搜索对各种实验扰动的响应遵循给定模式的基因。我们使用现实世界的数据集证明了我们的方法是有效和高效的。

著录项

来源
《Knowledge-Based Systems for Safety Critical Applications》|1994年|p.63-74|共12页
会议地点
作者
Wang H.; Perng C.-S.; Fan W.; Park S.; Yu P.S.;
展开▼
作者单位

IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. A comparative study of the origin, structure, and indexing language of the Persian and English keywords of articles indexed in the IranMedex database and their compliance with the Persian medical thesaurus and Medical Subject Headings [J] . Parastoo Parsaei-Mohammadi, Ali Hossein Ghasemi, Raziyeh Hassanzadeh-Beheshtabad Journal of Education and Health Promotion . 2017,第1期

机译：对在伊朗Medex数据库中被索引的文章的波斯和英语关键字的来源，结构和索引语言以及它们与波斯医学词库和医学主题词的依从性进行比较研究
2. The Adaptive Evolution Database (TAED): A New Release of a Database of Phylogenetically Indexed Gene Families from Chordates [J] . Hermansen Russell A., Oswald Benjamin P., Knight Stormy, Annals of Forest Science . 2017,第2期

机译：自适应演进数据库（TAED）：脊索化物体的新释放系统源性索引基因家族的数据库
3. The Adaptive Evolution Database (TAED): A New Release of a Database of Phylogenetically Indexed Gene Families from Chordates [J] . Hermansen Russell A., Oswald Benjamin P., Knight Stormy, Journal of Molecular Evolution . 2017,第1a2期

机译：自适应演进数据库（TAED）：脊索化物体的新释放系统源性索引基因家族的数据库
4. Indexing weighted-sequences in large databases [C] . Wang, H., Perng, . 2003

机译：索引大型数据库中的加权序列
5. A Novel GPU Algorithm for Indexing Columnar Databases with Column Imprints [D] . Mannem, Manaswi. 2020

机译：一种新型GPU算法，用于使用列印记索引柱状数据库
6. Geographically Indexed Referral Databases to Address Social Needs in the Emergency Department [O] . Alexa Curt, Hazar Khidir, Gia Ciccolo, 2021

机译：地理上索引推荐数据库以解决急诊部的社会需求
7. Indexing Weighted-Sequences in Large Databases [O] . Haixun Wang, Chang-shing Perng, Wei Fan, 2003

机译：索引大型数据库中的加权序列

Indexing weighted-sequences in large databases

摘要

著录项

相似文献

相关主题

期刊订阅