首页> 中文期刊> 《计算机工程与设计》 >适应性阈值优化的微博消息索引模式

适应性阈值优化的微博消息索引模式

         

摘要

为提高微博搜索的准确性,提出一种适应性的微博消息索引模式。将微博消息的转发和回复表示为树形结构并进行编码;提出一种基于内容和排名的索引模式,根据新消息的到来适应性地调整内存中的索引数据;为避免检索过程扫描整个微博数据集,提出一种Top-k阈值优化方法。Twitter数据实验结果表明,该模式降低了微博数据索引时的时间和空间开销,其性能随着时间的推移比较稳定。%To improve the accuracy of Microblog searching,an adaptive Microblog message indexing schema was proposed. Firstly,trees were constructed according to the forward and reply of messages,and these trees were encoded.Secondly,content and rank based indexing schema was proposed,and the index structure in memory was updated adaptively when a new message came.Finally,to avoid scanning the whole Microblog data,a Top-k threshold optimization method was proposed.Results of ex-periments on Twitter data set show that,the proposed index schema reduces the time and space cost while indexing the Microb-log messages,and its performance is stable along with time.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号