Efficient Text Proximity Search

机译：高效文本邻近搜索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In addition to purely occurrence-based relevance models, term proximity has been frequently used to enhance retrieval quality of keyword-oriented retrieval systems. While there have been approaches on effective scoring functions that incorporate proximity, there has not been much work on algorithms or access methods for their efficient evaluation. This paper presents an efficient evaluation framework including a proximity scoring function integrated within a top-k query engine for text retrieval. We propose precomputed and materialized index structures that boost performance. The increased retrieval effectiveness and efficiency of our framework are demonstrated through extensive experiments on a very large text benchmark collection. In combination with static index pruning for the proximity lists, our algorithm achieves an improvement of two orders of magnitude compared to a term-based top-k evaluation, with a significantly improved result quality.

机译：除了纯粹的基于相关的相关模型之外，术语接近经常用于增强关键字的检索系统的检索质量。虽然已经采用了有效评分功能的方法，但是，算法上没有多大的工作，以获得其有效评估的算法或访问方法。本文介绍了一个有效的评估框架，包括集成在Top-K查询引擎内的接近评分功能，用于文本检索。我们提出了预先提升和物化指数结构，以提高性能。通过在非常大的文本基准集合上进行广泛的实验，证明了我们框架的检索效力和效率增加。结合静态指数对接近列表进行修剪，与基于术语的Top-K评估相比，我们的算法实现了两个数量级的提高，结果质量显着提高。

著录项

来源
《International Conference on String Processing and Information Retrieval》|2007年||共13页
会议地点
作者
Ralf Schenkel; Andreas Broschart; Seungwon Hwang; Martin Theobald; Gerhard Weikum;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类数据备份与恢复;
关键词

相似文献

外文文献
中文文献
专利

1. "Method and System for Converting Image Text Documents in Bit-Mapped Formats to Searchable Text and for Searching the Searchable Text" in Patent Application Approval Process [J] . Robotics and Machine Learning . 2013,第1期

机译：专利申请批准过程中的“将位图格式的图像文本文档转换为可搜索文本并搜索可搜索文本的方法和系统”
2. Patent Issued for Method and System for Converting Image Text Documents in Bit-Mapped Formats to Searchable Text and for Searching the Searchable Text [J] . Robotics and Machine Learning . 2012,第44期

机译：将位图格式的图像文本文档转换为可搜索文本并用于搜索可搜索文本的方法和系统已颁发专利
3. Efficient and Exact Local Search for Random Walk Based Top-K Proximity Query in Large Graphs [J] . Y. Wu, R. Jin, X. Zhang IEEE Transactions on Knowledge and Data Engineering . 2016,第5期

机译：大型图中基于随机游走的Top-K邻近查询的高效且精确的本地搜索
4. Efficient Text Proximity Search [C] . Ralf Schenkel, Andreas Broschart, Seungwon Hwang, International Conference on String Processing and Information Retrieval(SPIRE 2007); 20071029-31; Santiago(CL) . 2007

机译：高效的文本邻近搜索
5. In search of better proximity. [D] . Kumar, Nirman. 2014

机译：寻找更好的接近性。
6. Efficient and Exact Local Search for Random Walk Based Top-K Proximity Query in Large Graphs [O] . Yubao Wu, Ruoming Jin, Xiang Zhang -1

机译：大型图中基于随机游走的Top-K邻近查询的高效且精确的本地搜索
7. Efficient Text Proximity Search [O] . Schenkel R., Broschart A., Hwang S., 2007

机译：高效的文本邻近搜索

Efficient Text Proximity Search

摘要

著录项

相似文献

相关主题

期刊订阅