首页> 外国专利> EFFICIENT RETRIEVAL ALGORITHM BY QUERY TERM DISCRIMINATION

EFFICIENT RETRIEVAL ALGORITHM BY QUERY TERM DISCRIMINATION

机译:通过查询术语进行有效的检索算法

摘要

Described is an efficient retrieval mechanism that quickly locates documents (e.g., corresponding to online advertisements) based on query term discrimination. A topmost subset (e.g., two) of search terms is selected according to their ranked importance, e.g., as ranked by inverted document frequency. The topmost terms are then used to narrow the number of rows of an inverted query index that are searched to find document identifiers and associated scores, such as computed offline by a BM25 algorithm. For example, for each document identifier of each important term, a fast search within each of the narrowed subset of rows (that also contain that document identifier) may be performed by comparing document identifiers to jump a pointer within each other row, followed by a binary search to locate a particular document. The scores of the set of particular documents may then be used to rank their relative importance for returning as results.
机译:描述了一种有效的检索机制,其基于查询词辨别来快速定位文档(例如,对应于在线广告)。根据搜索词的最重要子集(例如,两个),根据其排名重要性来选择,例如,按倒排文档频率进行排名。然后使用最顶层的术语来缩小搜索的反向查询索引的行数,以查找文档标识符和相关分数,例如BM25算法离线计算的分数。例如,对于每个重要术语的每个文档标识符,可以通过比较文档标识符以在每个其他行中跳一个指针,然后在每个行之间缩小指针来执行在每个缩小的行子集(也包含该文档标识符)中的快速搜索。二进制搜索以查找特定文档。然后可以使用该组特定文档的分数来对它们的相对重要性进行排名,以作为结果返回。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号