首页>
外国专利>
System and method for multithreaded text indexing for next generation multi-core architectures
System and method for multithreaded text indexing for next generation multi-core architectures
展开▼
机译:用于下一代多核体系结构的多线程文本索引系统和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
A system and method for indexing documents in a data storage system includes generating a single document hash table in storage memory for a single document using an index construction in a multithreaded and scalable configuration wherein multiple threads are each assigned work to reduce synchronization between threads. The single document hash table includes partitioning the single document and indexing strings of partitioned portions of the single document to create a minor hash table for each document sub-part; generating a document level hash table from the minor hash tables; updating a stream level hash table for the strings which maps every string to a global identifier; and generating a term reordered array from the document level hash table.
展开▼