首页> 外文会议>Annual conference on Neural Information Processing Systems >Online Rank Elicitation for Plackett-Luce: A Dueling Bandits Approach
【24h】

Online Rank Elicitation for Plackett-Luce: A Dueling Bandits Approach

机译:Plackett-Luce在线排名征集:决斗土匪方法

获取原文

摘要

We study the problem of online rank elicitation, assuming that rankings of a set of alternatives obey the Plackett-Luce distribution. Following the setting of the dueling bandits problem, the learner is allowed to query pairwise comparisons between alternatives, i.e., to sample pairwise marginals of the distribution in an online fashion. Using this information, the learner seeks to reliably predict the most probable ranking (or top-alternative). Our approach is based on constructing a surrogate probability distribution over rankings based on a sorting procedure, for which the pairwise marginals provably coincide with the marginals of the Plackett-Luce distribution. In addition to a formal performance and complexity analysis, we present first experimental studies.
机译:我们假设一组替代方案的排名服从Plackett-Luce分布,我们研究了在线排名引发的问题。在设置了强盗决斗问题之后,允许学习者查询替代方案之间的成对比较,即以在线方式对分布的成对边际进行采样。使用此信息,学习者将寻求可靠地预测最可能的排名(或最高替代)。我们的方法是基于基于排序过程的排名构建替代概率分布,对于该过程,成对边际可证明与Plackett-Luce分布的边际一致。除了正式的性能和复杂性分析之外,我们还提供了第一批实验研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号