首页> 外文会议>Annual international ACM SIGIR conference on Research and development in information retrieval >Robustness of adaptive filtering methods in a cross-benchmark evaluation
【24h】

Robustness of adaptive filtering methods in a cross-benchmark evaluation

机译:跨基准评估中自适应过滤方法的鲁棒性

获取原文

摘要

This paper reports a cross-benchmark evaluation of regularized logistic regression (LR) and incremental Rocchio for adaptive filtering. Using four corpora from the Topic Detection and Tracking (TDT) forum and the Text Retrieval Conferences (TREC) we evaluated these methods with non-stationary topics at various granularity levels, and measured performance with different utility settings. We found that LR performs strongly and robustly in optimizing T11SU (a TREC utility function) while Rocchio is better for optimizing Ctrk (the TDT tracking cost), a high-recall oriented objective function. Using systematic cross-corpus parameter optimization with both methods, we obtained the best results ever reported on TDT5, TREC10 and TREC11. Relevance feedback on a small portion (0.05~0.2%) of the TDT5 test documents yielded significant performance improvements, measuring up to a 54% reduction in Ctrk and a 20.9% increase in T11SU (with b=0.1), compared to the results of the top-performing system in TDT2004 without relevance feedback information.
机译:本文报告了正则逻辑回归(LR)和增量ROCCHIO的跨基准评估,用于自适应滤波。使用来自主题检测和跟踪(TDT)论坛的四个语料库和文本检索会议(TREC)我们在各种粒度级别的非固定主题评估了这些方法,并使用不同的实用程序设置进行测量性能。我们发现LR在优化T11SU(TREC实用程序函数)时执行强烈且强大,而Rocchio更好地优化CTRK(TDT跟踪成本),则是一个高召回的目标函数。使用两种方法的系统交叉语料库参数优化,我们获得了在TDT5,TREC10和TREC11上报告的最佳结果。关于小部分的相关反馈(0.05〜0.2%)的TDT5测试文献产生显着的性能改善,测量CTRK的减少54%,而T11SU的增加(B = 0.1)增加了20.9%(B = 0.1) TDT2004中的顶级执行系统,无相关反馈信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号