【24h】

Metasearch. Properties of Common Documents Distributions

机译:元搜索。普通文件分配的属性

获取原文
获取原文并翻译 | 示例

摘要

The effectiveness of metasearch data fusion procedures depends crucially on the properties of common documents distributions. Because we usually know neither how different search engines assign relevance scores nor the similarity of these assignments, common documents of the individual ranked lists are the only base of combining search results. So it is very important to study the properties of common documents distributions. One of these properties is the Overlap Property (OP) of documents retrieved by different search engines. According to OP, the overlap between the relevant documents is usually greater than the overlap between non-relevant ones. Although OP was repeatedly observed and discussed, no theoretical explanation of this empirical property was elaborated. This paper considers formal research of properties of the common documents distributions. In particular, sufficient and necessary condition of OP is elaborated and it is proved that OP should take place practically under arbitrary circumstances.
机译:元搜索数据融合程序的有效性主要取决于通用文档分发的属性。因为我们通常既不知道不同的搜索引擎如何分配相关性分数,也不知道这些分配的相似性,所以各个排名列表的通用文档是组合搜索结果的唯一基础。因此,研究常用文档分配的属性非常重要。这些属性之一是由不同搜索引擎检索的文档的重叠属性(OP)。根据OP,相关文件之间的重叠通常大于不相关文件之间的重叠。尽管反复观察和讨论了OP,但仍未对这种经验特性进行理论解释。本文考虑对常用文档分布的属性进行形式化研究。特别是,详细阐述了OP的充分必要条件,并证明了OP应该实际上在任意情况下进行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号