首页> 外文期刊>Automation and Remote Control >Stochastic online optimization. Single-point and multi-point non-linear multi-armed bandits. Convex and strongly-convex case
【24h】

Stochastic online optimization. Single-point and multi-point non-linear multi-armed bandits. Convex and strongly-convex case

机译:随机在线优化。 单点和多点非线性多武装匪徒。 凸和强凸案

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper the gradient-free modification of the mirror descent method for convex stochastic online optimization problems is proposed. The crucial assumption in the problem setting is that function realizations are observed with minor noises. The aim of this paper is to derive the convergence rate of the proposed methods and to determine a noise level which does not significantly affect the convergence rate.
机译:在本文中,提出了对凸的随机在线优化问题进行镜面落水方法的梯度修改。 问题设置中的至关重要假设是使用轻微噪声观察到函数实现。 本文的目的是推导出所提出的方法的收敛速度,并确定不会显着影响收敛速度的噪声水平。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号