首页> 外文会议>International Symposium on Algorithms and Computation(ISAAC 2005); 20051219-21; Sanya(CN) >On the Complexity of Rocchio's Similarity-Based Relevance Feedback Algorithm
【24h】

On the Complexity of Rocchio's Similarity-Based Relevance Feedback Algorithm

机译:基于Rocchio相似度的相关反馈算法的复杂度

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we prove for the first time that the learning complexity of Rocchio's algorithm is O(d + d~2(log d+log n)) over the dis-cretized vector space {0,..., n — 1}~d, when the inner product similarity measure is used. The upper bound on the learning complexity for searching for documents represented by a monotone linear classifier (q, 0) over {0,... ,n-1}~d can be improved to O(d + 2k(n-1)(logd + log(n - 1))), where k is the number of nonzero components in q. An Ω((_2~d)log n) lower bound on the learning complexity is also obtained for Rocchio's algorithm over {0,..., n - 1}~d. In practice, Rocchio's algorithm often uses fixed query updating factors. When this is the case, the lower bound is strengthened to 2~(Ω(d)) over the binary vector space {0,1}~d. In general, if the query updating factors are bounded by O(n~c) for some constant c ≥ 0, an Ω(n~(d-1-c)/(n — 1)) lower bound is obtained over {0,... ,n— 1)~d.
机译:本文首次证明了Rocchio算法在离散向量空间{0,...,n_1上的学习复杂度为O(d + d〜2(log d + log n))。 }〜d,当使用内积相似性度量时。在{0,...,n-1}〜d上搜索单调线性分类器(q,0)表示的文档的学习复杂度上限可以提高为O(d + 2k(n-1) (logd + log(n-1))),其中k是q中非零分量的数量。 Rocchio算法在{0,...,n-1}〜d上也获得了学习复杂度的Ω((_ 2〜d)log n)下限。实际上,Rocchio的算法通常使用固定的查询更新因子。在这种情况下,下限在二进制矢量空间{0,1}〜d上增强为2〜(Ω(d))。通常,如果对于某些常数c≥0,查询更新因子由O(n〜c)限制,则在{0上获得Ω(n〜(d-1-c)/(n_1))下界,...,n— 1)〜d。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号