首页> 外文会议>International Conference on String Processing and Information Retrieval >Processing Conjunctive and Phrase Queries with the Set-Based Model
【24h】

Processing Conjunctive and Phrase Queries with the Set-Based Model

机译:处理基于集的模型的联合和短语查询

获取原文

摘要

The objective of this paper is to present an extension to the set-based model (SBM), which is an effective technique for computing term weights based on co-occurrence patterns, for processing conjunctive and phrase queries. The intuition that semantically related term occurrences often occur closer to each other is taken into consideration. The novelty is that all known approaches that account for co-occurrence patterns was initially designed for processing disjunctive (OR) queries, and our extension provides a simple, effective and efficient way to process conjunctive (AND) and phrase queries. This technique is time efficient and yet yields nice improvements in retrieval effectiveness. Experimental results show that our extension improves the average precision of the answer set for all collection evaluated, keeping computational cost small. For the TReC-8 collection, our extension led to a gain, relative to the standard vector space model, of 23.32% and 18.98% in average precision curves for conjunctive and phrase queries, respectively.
机译:本文的目的是向基于集合的模型(SBM)呈现一个扩展,这是基于共同发生模式计算术语权重的有效技术,用于处理联合和短语查询。考虑到语义相关术语发生的直觉通常会彼此接近。新颖性是,所有已知的方法都是用于共同发生模式的方法,最初是为处理析出(或)查询而设计的,我们的扩展提供了一种简单,有效和有效的方法来处理联合(和)和短语查询。这种技术是时间效率,但取得了良好的改善了检索效率。实验结果表明,我们的延长提高了所有收集所评估的答案集的平均精度,保持计算成本小。对于TREC-8集合,我们的扩展导致了相对于标准向量空间模型的增益,分别为23.32%和18.98%,分别为联合和短语查询的平均精度曲线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号