首页> 外国专利> USING HISTORICAL INFORMATION TO IMPROVE SEARCH ACROSS HETEROGENEOUS INDICES

USING HISTORICAL INFORMATION TO IMPROVE SEARCH ACROSS HETEROGENEOUS INDICES

机译:使用历史信息来改进跨异构索引的搜索

摘要

A method, system and computer program product are disclosed for searching for data. In one embodiment, the invention provides a method comprising identifying a query and a search scope including a set of specified entities; and for each of these entities, estimating a number of documents that would be identified in a search through the entity to answer the query. On the basis of this estimating, a subset of the entities is formed. The query and this subset of entities are sent to a search engine to search the subset of entities to answer the query. In one embodiment, the estimating includes collecting statistical information from queries to build up a historical cache using heuristics or machine learning techniques, wherein the query includes a key word and a scope, and the historical cache contains a maximum number of returned results for an entity given the queries executed.
机译:公开了一种用于搜索数据的方法,系统和计算机程序产品。在一个实施例中,本发明提供了一种方法,包括识别查询和包括一组指定实体的搜索范围;以及对于这些实体中的每一个,估算在通过实体进行搜索以回答查询时将被识别的文档数量。基于此估计,形成实体的子集。该查询和实体的该子集被发送到搜索引擎以搜索实体的子集以回答查询。在一个实施例中,估计包括使用启发式或机器学习技术从查询中收集统计信息以建立历史缓存,其中查询包括关键字和范围,并且历史缓存包含实体的最大返回结果数给定执行的查询。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号