Distributed parallel generation of indices for very large text databases

机译：用于非常大的文本数据库的分布式并行生成索引

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We propose a new algorithm for the parallel generation of suffix arrays for large text databases on high-bandwidth computer networks. Suffix arrays are structures used in full text indexing which support very powerful query languages. Our algorithm is based on a parallel indirect mergesort (it is not a simple mergesort procedure) and is compared with a well known sequential algorithm (which is very efficient running on a single machine). Although network-bounded, the parallel version is theoretically and experimentally a much better alternative when compared to the sequential version (which is I/O-bounded in disk).

机译：我们提出了一种新的算法，用于高带宽计算机网络上的大型文本数据库的后缀阵列的并行生成。后缀阵列是用于全文索引中使用的结构，支持非常强大的查询语言。我们的算法基于并行间接合并（它不是简单的合并程序），并与众所周知的顺序算法（在单个机器上非常有效地运行）进行比较。虽然网络界限，并行版本是理论上，并在与顺序版本相比（磁盘中的I / O界）相比的更好的替代方案。

著录项

来源
《International Conference on Algorithms and Architectures for Parallel Processing》|1997年||共8页
会议地点
作者
Kitajima J.P.; Resende M.D.; Institute of Electric and Electronic Engineer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. An efficient synchronous indexing technique for full-text retrieval in distributed databases [J] . Fadoua Hassen, Grissa Touzi Amel Procedia Computer Science . 2017,第22期

机译：用于分布式数据库中全文检索的高效同步索引技术
2. Index Structures for Distributed Text Databases [J] . M. Marin Journal of Computer Science and Technology . 2004,第1期

机译：分布式文本数据库的索引结构
3. Parallel mining of association rules from text databases [J] . John D. Holt, Soon M. Chung Journal of supercomputing . 2007,第3期

机译：从文本数据库并行挖掘关联规则
4. Distributed parallel generation of indices for very large text databases [C] . Kitajima, J.P., Resende, . 1997

机译：分布式并行生成大型文本数据库的索引
5. SigSpace-Text: Parallel and distributed signature learning in text analytics [D] . Bandi, Rakesh Reddy 2016

机译：SigSpace-Text：文本分析中的并行和分布式签名学习
6. SPANG: a SPARQL client supporting generation and reuse of queries for distributed RDF databases [O] . Hirokazu Chiba, Ikuo Uchiyama 2017

机译：SPANG：SPARQL客户端支持生成和重用分布式RDF数据库的查询
7. Parallel Generation of Inverted Files for Distributed Text Collections [O] . Berthier A. Ribeiro-neto, Joao Paulo Kitajima, Gonzalo Navarro, 1998

机译：并行生成分布式文本集合的反向文件

Distributed parallel generation of indices for very large text databases

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅