首页> 外文会议>ACM/IEEE-CS joint conference on digital libraries >Exploiting Real-Time Information Retrieval in the Microblogosphere
【24h】

Exploiting Real-Time Information Retrieval in the Microblogosphere

机译:利用实时信息检索微博影像

获取原文

摘要

Information seeking behavior in microblogging environments such as Twitter differs from traditional web search. The best performing microblog retrieval techniques attempt to utilize both semantic and temporal aspects of documents. In this paper, we present an effective approach, including the query modeling, the document modeling and the temporal re-ranking, to discover the most recent but relevant information to the query. For the query modeling, we introduce a two-stage pseudo-relevance feedback query expansion to overcome the severe vocabulary-mismatch problem of short message retrieval in microblog. For the document modeling, we propose two ways to expand document with the help of the shortened URL. For the temporal re-ranking, we suggest several methods to evaluate the temporal aspects of documents. Experimental results demonstrate that our approach obtains significant improvements compared with baseline systems. Specifically, the proposed system gives 26.37% and 9.94% further increases in P@30 and MAP over the best performing result on highrel in the TREC'11 Real-Time Search Task.
机译:在微博环境中寻求行为的信息与传统的网络搜索不同。最好的微博检索技术试图利用文档的语义和时间方面。在本文中,我们提出了一种有效的方法,包括查询建模,文档建模和时间重新排名,以发现查询最新但相关信息。对于查询建模,我们介绍了一个两级伪相关反馈查询扩展,以克服微博中短消息检索的严重词汇错配问题。对于文档建模,我们提出了两种方法可以在缩短的URL的帮助下扩展文档。对于时间重新排名,我们建议几种方法来评估文档的时间方面。实验结果表明,与基线系统相比,我们的方法获得了显着的改进。具体而言,所提出的系统在TREC'11实时搜索任务中,P @ 30的进一步增加了26.37%和9.94%的进一步增加,并在最佳表演结果上映射。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号