首页> 外国专利> System and method for querying multiple, distributed databases by selective sharing of local relative significance information for terms related to the query

System and method for querying multiple, distributed databases by selective sharing of local relative significance information for terms related to the query

机译:通过选择性共享与查询有关的术语的本地相对重要性信息来查询多个分布式数据库的系统和方法

摘要

A system, method, and various software products provide improved information retrieval performance from multiple document databases by retrieving from the multiple document databases in response to a user query, a set of documents that globally satisfy the query, even though each database maintains independent document indices, term frequency information, and scoring functions. The global search result approximates, to any desired degree of error, the search results that would have been obtained had the multiple document databases been globally indexed. This is done by sharing at the time the query is executed, a small subset of information about the local relative significance of terms related to the user's query, and from this information, determining a global relative significance of such terms. From the global relative significance, the individual document databases determine their query results, which are then merged into a global set of documents satisfying the query. The shared local relative significance information may be the inverse document frequency of each of a number of terms related to the query, or it may be the total frequency of each of such terms. The global relative significance may correspondingly be a global inverse document frequency, or a global term frequency from which the global inverse document frequency is calculated.
机译:一种系统,方法和各种软件产品,通过响应于用户查询从多个文档数据库检索(全局满足查询的一组文档),即使每个数据库维护独立的文档索引,也可以从多个文档数据库提供改进的信息检索性能,学期频率信息和评分功能。如果将多个文档数据库进行全局索引,则全局搜索结果会以任何期望的错误程度近似将获得的搜索结果。这是通过在执行查询时共享与用户查询相关的术语的局部相对重要性的信息的一小部分,并从此信息中确定此类术语的整体相对重要性来完成的。根据全局相对重要性,各个文档数据库确定其查询结果,然后将其合并为满足查询条件的全局文档集。共享的局部相对重要性信息可以是与查询有关的多个术语中的每个术语的逆文档频率,或者可以是每个这样的术语的总频率。全局相对重要性可以相应地是全局逆文档频率或从其计算全局逆文档频率的全局项频率。

著录项

  • 公开/公告号US5826261A

    专利类型

  • 公开/公告日1998-10-20

    原文格式PDF

  • 申请/专利权人 SPENCER;GRAHAM;

    申请/专利号US19960644302

  • 发明设计人 GRAHAM SPENCER;

    申请日1996-05-10

  • 分类号G06F17/30;

  • 国家 US

  • 入库时间 2022-08-22 02:38:21

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号