Adaptive action selection using utility-based reinforcement learning

机译：使用基于实用的强化学习的自适应动作选择

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A basic problem of intelligent systems is choosing adaptive action to perform in a non-stationary environment. Due to the combinatorial complexity of actions, agent cannot possibly consider every option available to it at every instant in time. It needs to find good policies that dictate optimum actions to perform in each situation. This paper proposes an algorithm, called UQ-learning, to better solve action selection problem by using reinforcement learning and utility function. Reinforcement learning can provide the information of environment and utility function is used to balance Exploration-Exploitation dilemma. We implement our method with maze navigation tasks in a non-stationary environment. The results of simulated experiments show that utility-based reinforcement learning approach is more effective and efficient compared with Q-learning and Recency-Based Exploration.

机译：智能系统的基本问题是在非静止环境中选择要执行的自适应动作。由于行动的组合复杂性，代理人不能考虑每时每刻都可以使用的每个选项。它需要找到良好的政策，这些政策要求在每种情况下执行最佳行动。本文提出了一种算法，称为UQ学习，通过使用强化学习和实用功能来更好地解决动作选择问题。加固学习可以提供环境信息，用途函数用于平衡勘探开发困境。我们在非静止环境中使用迷宫导航任务来实现我们的方法。模拟实验结果表明，与Q-Learning和基于新近度的勘探相比，基于型基于实用的强化学习方法更有效和高效。

著录项

来源
《IEEE International Conference on Granular Computing》|2009年||共6页
会议地点
作者

展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP301.6-53;
关键词

相似文献

外文文献
中文文献
专利

1. Behavior Selection Using Utility-Based Reinforcement Learning in Irregular Warfare Simulation Models [J] . Sotiris Papadopoulos, Francisco Baez, Jonathan Alt, International journal of operations research and information systems . 2013,第3期

机译：在非常规战争仿真模型中使用基于实用程序的强化学习进行行为选择
2. Continuous-action reinforcement learning with fast policy search and adaptive basis function selection [J] . Xin Xu, Chunming Liu, Dewen Hu Soft Computing - A Fusion of Foundations, Methodologies and Applications . 2011,第6期

机译：具有快速策略搜索和自适应基函数选择的连续动作强化学习
3. Continuous-action reinforcement learning with fast policy search and adaptive basis function selection [J] . Xu X., Liu C., Hu D. Soft computing: A fusion of foundations, methodologies and applications . 2011,第6期

机译：具有快速策略搜索和自适应基函数选择的连续动作强化学习
4. Adaptive action selection using utility-based reinforcement learning [C] . IEEE International Conference on Granular Computing . 2009

机译：使用基于实用的强化学习的自适应动作选择
5. Hybrid learning approach based on adaptive resonance theory and reinforcement learning for computer generated agents. [D] . Ninomiya, Susumu. 2002

机译：基于自适应共振理论和针对计算机生成的主体的强化学习的混合学习方法。
6. Adaptive model selection in photonic reservoir computing by reinforcement learning [O] . Kazutaka Kanno, Makoto Naruse, Atsushi Uchida -1

机译：强化学习在光子储层计算中的自适应模型选择
7. Learning to Transfer Learn: Reinforcement Learning-Based Selection for Adaptive Transfer Learning [O] . Linchao Zhu, Sercan Ö. Arık, Yi Yang, 2020

机译：学习转移学习：加强基于学习的自适应转移学习选择

Adaptive action selection using utility-based reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅