Risk-Averse Stochastic Convex Bandit

Adrian Rivera Cardoso; Huan Xu

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Risk-Averse Stochastic Convex Bandit

【24h】

Risk-Averse Stochastic Convex Bandit

机译：规避风险的随机凸土匪

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Motivated by applications in clinical trials and finance, we study the problem of online convex optimization (with bandit feedback) where the decision maker is risk-averse. We provide two algorithms to solve this problem. The first one is a descent-type algorithm which is easy to implement. The second algorithm, which combines the ellipsoid method and a center point device, achieves (almost) optimal regret bounds with respect to the number of rounds. To the best of our knowledge this is the first attempt to address risk-aversion in the online convex bandit problem.

机译：受临床试验和金融应用的启发，我们研究了决策者具有规避风险的在线凸优化（带有强盗反馈）的问题。我们提供两种算法来解决此问题。第一个是易于实现的下降型算法。第二种算法结合了椭球方法和中心设备，相对于回合数实现了（几乎）最佳后悔边界。据我们所知，这是解决在线凸土匪问题中规避风险的首次尝试。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第2009期|共9页
作者
Adrian Rivera Cardoso; Huan Xu;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Stochastic online optimization. Single-point and multi-point non-linear multi-armed bandits. Convex and strongly-convex case [J] . Gasnikov A. V., Krymova E. A., Lagunovskaya A. A., Automation and Remote Control . 2017,第2期

机译：随机在线优化。单点和多点非线性多武装匪徒。凸和强凸案
2. CONVERGENCE ANALYSIS OF SAMPLING-BASED DECOMPOSITION METHODS FOR RISK-AVERSE MULTISTAGE STOCHASTIC CONVEX PROGRAMS [J] . Guigues Vincent SIAM Journal on Optimization: A Publication of the Society for Industrial and Applied Mathematics . 2016,第4期

机译：风险规避多阶段随机凸方案基于抽样分解方法的收敛性分析
3. Stochastic convex optimization with bandit feedback [J] . Agarwal A., Foster D.P., Hsu D., SIAM Journal on Optimization: A Publication of the Society for Industrial and Applied Mathematics . 2013,第1期

机译：具有Bandit反馈的随机凸优化
4. Robust Risk-Averse Stochastic Multi-armed Bandits [C] . Odalric-Ambrym Maillard International conference on algorithmic learning theory . 2013

机译：健壮的规避风险的随机多武装土匪
5. Risk-Averse and Distributionally Robust Multistage Stochastic Optimization [D] . Duque Villarreal, Daniel. 2020

机译：风险厌恶和分布鲁棒多级随机优化
6. An Analysis of the Value of Information When Exploring Stochastic Discrete Multi-Armed Bandits [O] . Isaac J. Sledge, José C. Príncipe 2018

机译：探索随机离散多武装匪徒信息的价值分析
7. Robust Risk-averse Stochastic Multi-Armed Bandits [O] . Odalric-ambrym Maillard 2013

机译：强大的风险规避随机多臂土匪

Risk-Averse Stochastic Convex Bandit

摘要

著录项

相似文献

相关主题

期刊订阅