Selective Search: Efficient and Effective Search of Large Textual Collections

Kulkarni Anagha; Callan Jamie

首页> 外文期刊>ACM Transactions on Information Systems >Selective Search: Efficient and Effective Search of Large Textual Collections

【24h】

Selective Search: Efficient and Effective Search of Large Textual Collections

机译：选择性搜索：大型文本集的高效搜索

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The traditional search solution for large collections divides the collection into subsets (shards), and processes the query against all shards in parallel (exhaustive search). The search cost and the computational requirements of this approach are often prohibitively high for organizations with few computational resources. This article investigates and extends an alternative: selective search, an approach that partitions the dataset based on document similarity to obtain topic-based shards, and searches only a few shards that are estimated to contain relevant documents for the query. We propose shard creation techniques that are scalable, efficient, self-reliant, and create topic-based shards with low variance in size, and high density of relevant documents.

机译：针对大型集合的传统搜索解决方案将集合分为子集（分片），并并行处理针对所有分片的查询（穷举搜索）。对于具有较少计算资源的组织，此方法的搜索成本和计算要求通常过高。本文研究并扩展了另一种方法：选择性搜索，一种基于文档相似性对数据集进行分区以获取基于主题的分片的方法，并且仅搜索一些估计包含相关查询文档的分片。我们提出了可扩展，高效，自力更生的分片创建技术，并创建了基于主题的分片，这些分片的大小差异很小，相关文档的密度很高。

著录项

来源
《ACM Transactions on Information Systems》 |2015年第4期|17.1-17.33|共33页
作者
Kulkarni Anagha; Callan Jamie;
展开▼
作者单位

San Francisco State Univ, Dept Comp Sci, San Francisco, CA 94132 USA;

Carnegie Mellon Univ, Pittsburgh, PA 15213 USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Design; Algorithms; Experimentation; Performance; Large-scale text search; selective search; partitioned search; distributed information retrieval; document collection organization; resource selection;

机译：设计;算法;实验;性能;大型文本搜索;选择性搜索;分区搜索;分布式信息检索;文档收集组织;资源选择;

相似文献

外文文献
中文文献
专利

1. TSS: Efficient Term Set Search in Large Peer-to-Peer Textual Collections [J] . Computers, IEEE Transactions on . 2010,第7期

机译：TSS：大型对等文本集合中的有效术语集搜索
2. Using a Google Search Appliance (GSA) to search digital library collections: a case study of the INIS Collection Search [J] . Dobrica Savic JLIS.it . 2014,第2期

机译：使用Google Search Appliance（GSA）搜索数字图书馆馆藏：以INIS馆藏搜索为例
3. Power-Efficient Nonvolatile Ternary Content Addressable Memory with Flexible Search Scope Using Reconfigurable Match Line Segment and Selective Search Line [J] . Kim Cheol, Ahn Sung-Gi, Min Jisu, Journal of nanoscience and nanotechnology . 2019,第10期

机译：功能高效的非易失性三元内容可寻址存储器，使用可重新配置匹配线段和选择性搜索线具有灵活搜索范围
4. Efficient Search in Large Textual Collections with Redundancy [C] . Jiangong Zhang, Torsten Suel Proceedings of the Sixteenth international world wide web conference(WWW2007) . 2007

机译：大型文本集中的有效搜索与冗余
5. Use of non-biased combinatorial libraries in the search of leads for: (1) Selective and efficient cleavage of DAlaDLac, depsipeptide found in the cell of vancomycin resistant bacteria. (2) Cyclization of an epoxy-alcohol to the energetically disfavored product. [D] . Chiosis, Gabriela. 1998

机译：无偏组合文库在寻找前导的过程中的用途：（1）选择性有效地裂解万古霉素抗性细菌细胞中发现的去污肽DAlaDLac。（2）将环氧-醇环化成能量上不利的产物。
6. Comparative homology agreement search: An effective combination of homology-search methods [O] . Intikhab Alam, Andreas Dress, Marc Rehmsmeier, 2004

机译：比较同源性一致性搜索：同源性搜索方法的有效组合
7. TSS: Efficient Term Set Search in Large Peer-to-Peer Textual Collections [O] . Chen, Hanhua, Yan, Jun, Jin, Hai, 2010

机译：TSS：大型对等文本集合中的有效术语集搜索

Selective Search: Efficient and Effective Search of Large Textual Collections

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅