Optimality principle broken by considering structured plant variation and relevant robust reinforcement learning

机译：考虑结构化植物变异和相关稳健强化学习的最优原则

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In a general reinforcement learning problem, a plant (state transition probabilities) is estimated and a learning policy for the estimated plant is applied to a real plant. If there are differences between the estimated plant and the real plant, the obtained policy may not work for the real plant. Therefore, a set of plants with variations is used for learning in order to obtain a robust policy against variations. Bellman's principle of optimality does not hold when the set of plants is used, and a typical dynamic programming algorithm cannot solve the problem. This study shows the reason why the principle of optimality does not hold. It then makes some relaxed problems whose solutions can be obtained. Moreover, this study proposes solutions to learn feasible policies efficiently. The effectiveness of the proposed method is demonstrated by applying to simple examples.

机译：在一般的强化学习问题中，估计工厂（状态转换概率），并将估计的工厂的学习策略应用于实际工厂。如果估计工厂与实际工厂之间存在差异，则所获得的策略可能不适用于实际工厂。因此，使用一组具有变异的植物进行学习，以获得针对变异的鲁棒策略。当使用一组植物时，贝尔曼的最优性原理不成立，并且典型的动态规划算法不能解决该问题。这项研究表明了最优原理不成立的原因。然后，它提出了一些轻松的问题，可以获得解决方案。此外，本研究提出了有效学习可行政策的解决方案。通过应用到简单的例子证明了该方法的有效性。

著录项

来源
《2011 IEEE International Conference on Systems, Man, and Cybernetics》|2011年|p.477-483|共7页
会议地点
作者

展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化系统理论;
关键词
optimality principle breaking; reinforcement learning; robust optimal policy; structured plant variation;

机译：最优性原则突破;强化学习;鲁棒最优策略;结构化植物变异;

相似文献

外文文献
中文文献
专利

1. Autonomous Robust Skill Generation Using Reinforcement Learning with Plant Variation [J] . Senda Kei, Tani Yurika Advances in Mechanical Engineering . 2014,第Pta4期

机译：使用带有植物变异的强化学习来自主产生强大的技能
2. Optimal Adaptive Control and Differential Games by Reinforcement Learning Principles [J] . Warren Dixon Journal of guidance, control, and dynamics . 2014,第3期

机译：强化学习原理的最优自适应控制与微分博弈
3. Robust flow control and optimal sensor placement using deep reinforcement learning [J] . Paris Romain, Beneddine Samir, Dandois Julien Journal of Fluid Mechanics . 2021,第1期

机译：利用深增强学习鲁棒流量控制和最优传感器放置
4. Optimality principle broken by considering structured plant variation and relevant robust reinforcement learning [C] . (missing) IEEE International Conference on Systems, Man and Cybernetics . 2011

机译：通过考虑结构化植物变异和相关强大的强大钢筋学习，最优性原理
5. Modeling plant-soil-atmosphere carbon dioxide exchange using optimality principles. [D] . Tu, Kevin Patrick. 2000

机译：使用最佳原理对植物-土壤-大气中的二氧化碳交换进行建模。
6. Reinforcement Learning in Neurocritical and Neurosurgical Care: Principles and Possible Applications [O] . Ying Liu, Nidan Qiao, Yuksel Altinel 2021

机译：神科和神经外科护理中的加强学习：原则和可能的应用
7. Autonomous Robust Skill Generation Using Reinforcement Learning with Plant Variation [O] . Senda Kei, Tani Yurika 2014

机译：使用带有植物变异的强化学习来自主产生强大的技能

Optimality principle broken by considering structured plant variation and relevant robust reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅