【24h】

Certainty equivalence control with forcing: revisited

机译:强制性的等同性控制:重新探讨

获取原文

摘要

Summary form only given, as follows. Stochastic adaptiveoptimization problems are considered with the objective of minimizingthe rate of increase of the learning loss, i.e. the additional cost onehas to pay due to the inbuilt learning tasks in such problems. Inparticular, an examination is made of two problems: the multiarmedbandit problem, and the adaptive control of Markov chains. Previous workhas shown that the minimum rate of increase of the learning loss forthese problems is typically
机译:仅给出摘要表格,如下。考虑随机自适应优化问题,其目的是使学习损失的增加率最小化,即由于这种问题中内置的学习任务而不得不支付的额外费用。特别是,对两个问题进行了研究:多臂强盗问题和马尔可夫链的自适应控制。先前的工作表明,学习损失引起的问题的最小增长率通常是

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号