首页> 外文会议>ACM/IEEE on joint conference on digital libraries >An Exploration of Pattern-based Subtopic Modeling for Search Result Diversification
【24h】

An Exploration of Pattern-based Subtopic Modeling for Search Result Diversification

机译:基于模式的子特化模型探索搜索结果分流

获取原文

摘要

'Traditional information retrieval models do not necessarily provide users with optimal search experience because the top ranked documents may contain the same piece of relevant information, i.e., the same subtopic of a emery. The goal of search result diversification is to return search results that not only are relevant to the query but also cover different subtopics. Therefore, the subtopic modeling is an important research topic in search result diversification. In this paper, we propose a novel pattern based method to extract subtopics from retrieved documents. The basic idea is to explicitly model a query subtopic as a scmanticaJly meaningful text, unit in relevant documents. We apply a frequent pattern mining algorithm to efficiently extract these text units (patterns) from retrieved documents. We then model a query subtopic with a single pattern and rank subtopics based on their similarity with the query. These pattern based subtopics are then used to diversify search results.
机译:“传统信息检索模型不一定为用户提供最佳的搜索体验,因为顶部排名的文档可能包含相同的相关信息,即砂肌的相同细胞主题。搜索结果多样化的目标是返回搜索结果,不仅与查询相关,而且涵盖了不同的子主题。因此,副介质建模是搜索结果多元化中的重要研究主题。在本文中,我们提出了一种新的基于模式的方法,可以从检索到的文档中提取子主题。基本思想是将查询子主题模拟为相关文档中的SCMANTICAJLY有意义的文本。我们应用频繁的模式挖掘算法,以有效地从检索到的文档中提取这些文本单位(模式)。然后,我们根据与查询的相似性模拟一个具有单个图案的查询子主题和排名副主题。然后使用这些基于模式的副主管来分化搜索结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号