首页> 外文会议> >Adaptive Scoring Method based on Freshness for Fresh Information Retrieval
【24h】

Adaptive Scoring Method based on Freshness for Fresh Information Retrieval

机译:基于新鲜度的自适应信息新鲜度评分方法

获取原文

摘要

Fresh information is important for real business. In order to realize fresh information retrieval, we need not only to collect documents in a short time, but also to rank the results in the suitable order. However, conventional ranking methods are not suited for fresh information retrieval because they ignore temporal value of information. So, we have proposed the novel ranking method FTF IDF for fresh information retrieval. FTF IDF extends TF IDF by means of using FTF(fresh term frequency) instead of TF(term frequency). FTF differs from TF because FTF decreases as time goes. The speed of decreasing FTF is determined by the dumping factor. The dumping factor is sensitive against small changes of documents. So, we use a threshold to ignore such small changes. In some papers we published, we detect the optimal threshold manually. In this paper, we proposed an adaptive calculating method in order to detect threshold automatically. In this method, the optimal value is determined by iterating to test generated thresholds. In this paper, we describe our method and its evaluation.
机译:新鲜的信息对于实际业务很重要。为了实现新鲜的​​信息检索,我们不仅需要在短时间内收集文档,还需要按照适当的顺序对结果进行排名。但是,传统的排名方法不适合新鲜的信息检索,因为它们忽略了信息的时间价值。因此,我们提出了新颖的排序方法FTF IDF用于新鲜信息检索。 FTF IDF通过使用FTF(新术语频率)而不是TF(术语频率)来扩展TF IDF。 FTF与TF不同,因为FTF随着时间的流逝而减少。 FTF降低的速度由转储因子决定。倾销因素对文件的细微变化很敏感。因此,我们使用阈值来忽略这样的微小变化。在我们发表的一些论文中,我们手动检测最佳阈值。在本文中,我们提出了一种自适应计算方法以自动检测阈值。在这种方法中,最佳值是通过迭代确定生成的阈值来确定的。在本文中,我们描述了我们的方法及其评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号