首页> 外文会议>International conference on advanced data mining and applications;ADMA 2011 >A Probabilistic Topic Model with Social Tags for Query Reformulation in Informational Search
【24h】

A Probabilistic Topic Model with Social Tags for Query Reformulation in Informational Search

机译:带有社会标签的概率主题模型用于信息搜索中的查询重构

获取原文

摘要

It is non-trivial to formulate a query that can precisely describe the goal of an informational search task. Query reformulation based on the query clustering approach addresses this issue by expanding a new query with related existing queries that were generated by other users. However, the query clustering approach is unable to cluster queries that are intrinsically related but neither contain common terms nor return common clicked Web page URLs. More importantly, it does not address the issue of ranking retrieved results according to their relevance to the search goal. In this paper, we present new query reformulation approach based on a novel probabilistic topic model to discovering the latent semantic relationships between the queries and the URLs. It can not only discover related queries that cannot be clustered by existing query clustering approaches but also rank retrieved results according to the similarities of probability distributions over the latent topics among the queries and the URLs. The results of our experiments have shown that this approach can significantly improve the performance of an informational search task in terms of search accuracy and search efficiency.
机译:制定一个可以精确描述信息搜索任务目标的查询并非易事。基于查询群集方法的查询重新编制通过使用其他用户生成的相关现有查询扩展新查询来解决此问题。但是,查询聚类方法无法对本质上相关但既不包含公共术语也不返回公共单击的Web页面URL的查询进行聚类。更重要的是,它没有解决根据检索结果与搜索目标的相关性对检索结果进行排名的问题。在本文中,我们提出了一种基于新的概率主题模型的新查询重构方法,以发现查询和URL之间的潜在语义关系。它不仅可以发现现有查询聚类方法无法聚类的相关查询,而且还可以根据查询和URL之间潜在主题上概率分布的相似性对检索结果进行排名。我们的实验结果表明,该方法可以在搜索准确性和搜索效率方面显着提高信息搜索任务的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号