【24h】

Efficient Lazy Algorithms for Minimal-Interval Semantics

机译:最小间隔语义的高效惰性算法

获取原文
获取原文并翻译 | 示例

摘要

Minimal-interval semantics [3] associates with each query over a document a set of intervals, called witnesses, that are incomparable with respect to inclusion (i.e., they form an antichain): witnesses define the minimal regions of the document satisfying the query. Minimal-interval semantics makes it easy to define and compute several sophisticated proximity operators, provides snippets for user presentation, and can be used to rank documents: thus, computing efficiently the antichains obtained by operations such as logic conjunction and disjunction is a basic issue. In this paper we provide the first algorithms for computing such operators that are linear in the number of intervals and logarithmic in the number of input antichains. The space used is linear in the number of antichains. Moreover, the algorithms are lazy — they do not assume random access to the input antichains. These properties make the usage of our algorithms feasible in large-scale web search engines.
机译:最小间隔语义[3]与文档中的每个查询关联一组称为见证的间隔,这些间隔在包含方面是无与伦比的(即,它们形成反链):见证定义了满足查询条件的文档的最小区域。最小间隔语义使定义和计算几个复杂的邻近运算符变得容易,为用户演示提供摘要,并可用于对文档进行排名:因此,有效地计算通过逻辑合取和析取等操作获得的反链是一个基本问题。在本文中,我们提供了用于计算此类算子的第一个算法,该算子的区间数为线性,输入反链数为对数。使用的空间在反链数量上是线性的。此外,这些算法是惰性的-它们不假定对输入反链的随机访问。这些属性使我们的算法在大规模Web搜索引擎中的使用成为可能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号