Reinforcement learning algorithms as function optimizers

机译：强化学习算法作为功能优化器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Any nonassociative reinforcement learning algorithm can be viewedas a method for performing function optimization through (possiblynoise-corrupted) sampling of function values. A description is given ofthe results of simulations in which the optima of several deterministicfunctions studied by D.H. Ackley (Ph.D. Diss., Carnegie-Mellon Univ.,1987) were sought using variants of REINFORCE algorithms. Resultsobtained for certain of these algorithms compare favorably to the bestresults found by Ackley

机译：可以将任何非关联性强化学习算法视为一种通过（可能会受到噪声破坏的）函数值采样执行函数优化的方法。给出了仿真结果的描述，其中使用REINFORCE算法的变体来寻求由D.H.Ackley（Ph.D.Diss。，Carnegie-Mellon Univ。，1987）研究的几种确定性函数的最优值。这些算法中的某些获得的结果与Ackley的最佳结果相吻合

著录项

来源
《2001 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2001. Proceedings, 2001》|2001年|p.89-95|共7页
会议地点 Washington, DC
作者
Williams, R.J.; Peng, J.;
展开▼
作者单位

Coll. of Comput. Sci. Northeastern Univ. Boston MA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 14:39:17

相似文献

外文文献
中文文献
专利

1. Hybrid algorithms based on combining reinforcement learning and metaheuristic methods to solve global optimization problems [J] . Seyyedabbasi Amir, Aliyev Royal, Kiani Farzad, Knowledge-Based Systems . 2021,第Jula8期

机译：基于组合钢筋学习的混合算法及弥补全球优化问题的综合算法
2. Rule-based reinforcement learning methodology to inform evolutionary algorithms for constrained optimization of engineering applications [J] . Radaideh Majdi I, Shirvan Koroush Knowledge-Based Systems . 2021,第Apra6期

机译：基于规则的强化学习方法，以通知进化算法，以了解工程应用的约束优化
3. Heuristic algorithms based on deep reinforcement learning for quadratic unconstrained binary optimization [J] . Chen Ming, Chen Yuning, Du Yonghao, Knowledge-Based Systems . 2020,第Nova5期

机译：基于深度加强学习的高强度学习的启发式算法
4. Reinforcement learning algorithms as function optimizers [C] . Williams, R.J., Peng, . 1989

机译：强化学习算法作为功能优化器
5. Genetic Algorithm as Function Optimizer in Reinforcement Learning and Sensor Odometry [D] . Sehgal, Adarsh 2019

机译：遗传算法在强化学习和传感器测程中的功能优化
6. Development and validation of a reinforcement learning algorithm to dynamically optimize mechanical ventilation in critical care [O] . Arne Peine, Ahmed Hallawa, Johannes Bickenbach, 2021

机译：强化学习算法的开发与验证动态优化批判性教育中的机械通风
7. Maximum Power Point Tracking Based on Reinforcement Learning Using Evolutionary Optimization Algorithms [O] . Kostas Bavarinos, Anastasios Dounis, Panagiotis Kofinas 2021

机译：基于使用进化优化算法的强化学习的最大功率点跟踪

Reinforcement learning algorithms as function optimizers

摘要

著录项

相似文献

相关主题

期刊订阅