首页> 外文会议>International Conference on String Processing and Information Retrieval >Efficient Lazy Algorithms for Minimal-Interval Semantics
【24h】

Efficient Lazy Algorithms for Minimal-Interval Semantics

机译:最小间隔语义的高效惰性算法

获取原文

摘要

Minimal-interval semantics [3] associates with each query over a document a set of intervals, called witnesses, that are incomparable with respect to inclusion (i.e., they form an antichain): witnesses define the minimal regions of the document satisfying the query. Minimal-interval semantics makes it easy to define and compute several sophisticated proximity operators, provides snippets for user presentation, and can be used to rank documents: thus, computing efficiently the antichains obtained by operations such as logic conjunction and disjunction is a basic issue. In this paper we provide the first algorithms for computing such operators that are linear in the number of intervals and logarithmic in the number of input antichains. The space used is linear in the number of antichains. Moreover, the algorithms are lazy — they do not assume random access to the input antichains. These properties make the usage of our algorithms feasible in large-scale web search engines.
机译:最小间隔语义[3]与文档上的每个查询相关联,这是一个间隔,称为证人,它对于包含(即,它们形成Antichain):证人定义了满足查询的文档的最小区域。最小间隔的语义可以轻松定义和计算多个复杂的邻近运算符,提供用于用户演示的片段,并且可用于排序文件:因此,有效计算通过逻辑结合和分离等操作获得的antiChains是一个基本问题。在本文中,我们提供了用于计算这种运营商的第一算法,该算法以输入antiChains的数量的间隔数和对数为线性。使用的空间是抗铬数量的线性。此外,算法是懒惰的 - 它们不假设对输入antiChains的随机访问。这些属性使我们的算法使用在大型网络搜索引擎中可行的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号