Bandit Algorithms in Information Retrieval

Glowacka Dorota

首页> 外文期刊>Foundations and trends in information retrieval >Bandit Algorithms in Information Retrieval

【24h】

Bandit Algorithms in Information Retrieval

机译：信息检索中的强盗算法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Bandit algorithms, named after casino slot machines sometimes known as "one-armed bandits", fall into a broad category of stochastic scheduling problems. In the setting with multiple arms, each arm generates a reward with a given probability. The gambler's aim is to find the arm producing the highest payoff and then continue playing in order to accumulate the maximum reward possible. However, having only a limited number of plays, the gambler is faced with a dilemma: should he play the arm currently known to produce the highest reward or should he keep on trying other arms in the hope of finding a better paying one? This problem formulation is easily applicable to many real-life scenarios, hence in recent years there has been an increased interest in developing bandit algorithms for a range of applications. In information retrieval and recommender systems, bandit algorithms, which are simple to implement and do not require any training data, have been particularly popular in online personalization, online ranker evaluation and search engine optimization. This survey provides a brief overview of bandit algorithms designed to tackle specific issues in information retrieval and recommendation and, where applicable, it describes how they were applied in practice.

机译：以娱乐场老虎机（有时被称为“单臂土匪”）命名的强盗算法属于随机调度问题的大类。在多臂情况下，每个臂都会以给定的概率生成奖励。赌徒的目的是找到产生最高回报的手臂，然后继续玩以积累最大可能的报酬。但是，赌徒数量有限，因此面临一个难题：他应该玩目前已知能产生最高报酬的那支武器，还是应该继续尝试其他武器以希望找到薪水更高的武器？这个问题的表达很容易适用于许多现实生活中的场景，因此，近年来，对于开发适用于各种应用的强盗算法的兴趣日益浓厚。在信息检索和推荐系统中，易于实现且不需要任何训练数据的匪徒算法在在线个性化，在线排名评估和搜索引擎优化中特别受欢迎。此调查简要概述了旨在解决信息检索和推荐中的特定问题的强盗算法，并在适用时描述了它们在实践中的应用方式。

著录项

来源
《Foundations and trends in information retrieval》 |2019年第4期|13-1517-4951-109111113-131|共126页
作者
Glowacka Dorota;
展开▼
作者单位

Univ Helsinki, Helsinki, Finland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
入库时间 2022-08-18 04:17:03

相似文献

外文文献
中文文献
专利

1. Bandit Algorithms in Information Retrieval [J] . Glowacka Dorota Foundations and trends in information retrieval . 2019,第4期

机译：信息检索中的强盗算法
2. Towards improved snow water equivalent retrieval algorithms Towards improved snow water equivalent retrieval algorithms basins of western USA [J] . Naoki Mizukami12* and Sanja Perica12 Hydrological Processes . 2012,第13期

机译：寻求改进的雪水当量检索算法寻求改进的雪水当量检索算法
3. Multi-armed bandits for adjudicating documents in pooling-based evaluation of information retrieval systems [J] . David E. Losada, Javier Parapar, Alvaro Barreiro Information Processing & Management . 2017,第5期

机译：在基于池的信息检索系统评估中裁定文档的多臂匪徒
4. Bandit algorithms in information retrieval evaluation and ranking [C] . Sinyinda Muwanei, Hoo Wai Lam, Sri Devi Ravana, International Conference on Computer Science and Engineering . 2020

机译：信息检索评估和排名中的强盗算法
5. Offline Evaluation of Multi-Armed Bandit Algorithms Using Bootstrapped Replay on Expanded Data [D] . Dai, Jin. 2021

机译：在扩展数据上使用引导重播的多武装强盗算法的离线评估
6. Non Stationary Multi-Armed Bandit: Empirical Evaluation of a New Concept Drift-Aware Algorithm [O] . Emanuele Cavenaghi, Gabriele Sottocornola, Fabio Stella, 2021

机译：非固定多武装强盗：新概念漂移感知算法的实证评估
7. A Novel Approach to Address External Validity Issues in Fault Prediction Using Bandit Algorithms [O] . Teruki HAYAKAWA, Masateru TSUNODA, Koji TODA, 2021

机译：一种新的方法来解决使用强盗算法故障预测中的外部有效性问题的方法

Bandit Algorithms in Information Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅