首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Active Ranking with Subset-wise Preferences
【24h】

Active Ranking with Subset-wise Preferences

机译:与子集合偏好有效排名

获取原文
           

摘要

We consider the problem of probably approximately correct (PAC) ranking $n$ items by adaptively eliciting subset-wise preference feedback. At each round, the learner chooses a subset of $k$ items and observes stochastic feedback indicating preference information of the winner (most preferred) item of the chosen subset drawn according to a Plackett-Luce (PL) subset choice model unknown a priori. The objective is to identify an $epsilon$-optimal ranking of the $n$ items with probability at least $1 - delta$. When the feedback in each subset round is a single Plackett-Luce-sampled item, we show $(epsilon, delta)$-PAC algorithms with a sample complexity of $Oleft(rac{n}{epsilon^2} ln rac{n}{delta} ight)$ rounds, which we establish as being order-optimal by exhibiting a matching sample complexity lower bound of $Omegaleft(rac{n}{epsilon^2} ln rac{n}{delta} ight)$—this shows that there is essentially no improvement possible from the pairwise comparisons setting ($k = 2$). When, however, it is possible to elicit top-$m$ ($leq k$) ranking feedback according to the PL model from each adaptively chosen subset of size $k$, we show that an $(epsilon, delta)$-PAC ranking sample complexity of $Oleft(rac{n}{m epsilon^2} ln rac{n}{delta} ight)$ is achievable with explicit algorithms, which represents an $m$-wise reduction in sample complexity compared to the pairwise case. This again turns out to be order-wise unimprovable across the class of symmetric ranking algorithms. Our algorithms rely on a novel {pivot trick} to maintain only $n$ itemwise score estimates, unlike $O(n^2)$ pairwise score estimates that has been used in prior work. We report results of numerical experiments that corroborate our findings.
机译:我们认为,通过自适应引出子集合偏好反馈,我们认为可能大致正确(PAC)排名N $项目。在每一轮时,学习者选择$ k $项目的子集,并观察到根据plackett-luce(pl)子集选择模型未知先验的所选子集的获胜者(最优选)项目的随机反馈。目标是识别$ epsilon $-Optimal排名,以概率至少为1美元 - delta $。当每个子集合的反馈是单个Plackett-Luce-Setpled项目时,我们为$( epsilon, delta)$ - pac算法,其具有$ o lex的样本复杂性( frac {n} { epsilon ^ 2} ln frac {n} { delta} 右),我们通过展示$ omega left的匹配样本复杂度( frac {n} { epsilon ^ 2} ln frac {n} { delta}右)$ - 这表明从成对比较设置($ k = 2 $)基本上没有改进。然而,当可以根据PL模型从每个自适应的尺寸为$ k $的PL模型引发顶级的排名反馈,我们展示了一个$( epsilon, delta )$ - PAC排名$ o left的样本复杂性( frac {n} {m epsilon ^ 2} ln frac {n} { delta} { delta}右)$可实现与显式算法代表$与成对案例相比,M $SWED样本复杂性。这再次结果在对称排名算法上无法令人满意的方式。我们的算法依赖于一个小说{Pivot Trick},仅维护$ N $ ItemWise评分估计,与已在事先工作中使用的$ o(n ^ 2)$配对分量估算不同。我们报告了数值实验的结果,证实了我们的研究结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号