【24h】

Reinforcement learning algorithms as function optimizers

机译:强化学习算法作为功能优化器

获取原文

摘要

Any nonassociative reinforcement learning algorithm can be viewedas a method for performing function optimization through (possiblynoise-corrupted) sampling of function values. A description is given ofthe results of simulations in which the optima of several deterministicfunctions studied by D.H. Ackley (Ph.D. Diss., Carnegie-Mellon Univ.,1987) were sought using variants of REINFORCE algorithms. Resultsobtained for certain of these algorithms compare favorably to the bestresults found by Ackley
机译:可以将任何非关联性强化学习算法视为一种通过(可能会受到噪声破坏的)函数值采样执行函数优化的方法。给出了仿真结果的描述,其中使用REINFORCE算法的变体来寻求由D.H.Ackley(Ph.D.Diss。,Carnegie-Mellon Univ。,1987)研究的几种确定性函数的最优值。这些算法中的某些获得的结果与Ackley的最佳结果相吻合

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号