Smoothed Sarsa: Reinforcement learning for robot delivery tasks

机译：平滑的Sarsa：针对机器人交付任务的强化学习

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Our goal in this work is to make high level decisions for mobile robots. In particular, given a queue of prioritized object delivery tasks, we wish to find a sequence of actions in real time to accomplish these tasks efficiently. We introduce a novel reinforcement learning algorithm called Smoothed Sarsa that learns a good policy for these delivery tasks by delaying the backup reinforcement step until the uncertainty in the state estimate improves. The state space is modeled by a Dynamic Bayesian Network and updated using a Region-based Particle Filter. We take advantage of the fact that only discrete (topological) representations of entity locations are needed for decision-making, to make the tracking and decision making more efficient. Our experiments show that policy search leads to faster task completion times as well as higher total reward compared to a manually crafted policy. Smoothed Sarsa learns a policy orders of magnitude faster than previous policy search algorithms. We demonstrate our results on the Player/Stage simulator and on the Pioneer robot.

机译：我们在这项工作中的目标是为移动机器人做出高层决策。特别是，在给定优先级的对象交付任务队列的情况下，我们希望实时找到一系列操作来有效地完成这些任务。我们引入了一种新颖的强化学习算法，称为“平滑Sarsa”，它通过延迟备用强化步骤直到状态估计的不确定性得到改善，为这些交付任务学习了一个好的策略。状态空间由动态贝叶斯网络建模，并使用基于区域的粒子过滤器进行更新。我们利用以下事实：决策只需要实体位置的离散（拓扑）表示即可，从而使跟踪和决策更加有效。我们的实验表明，与手动制定的策略相比，策略搜索可导致更快的任务完成时间以及更高的总奖励。平滑Sarsa学习策略的速度比以前的策略搜索算法快几个数量级。我们在Player / Stage模拟器和Pioneer机器人上展示了我们的结果。

著录项

来源
《IEEE International Conference on Robotics and Automation;ICRA '09》|2009年|2125-2132|共8页
会议地点 Kobe(JP);Kobe(JP)
作者
Ramachandran Deepak; Gupta Rakesh;
展开▼
作者单位

Computer Science Dept., University of Illinois at Urbana-Champaign, IL-61801, USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Task scheduling, resource provisioning, and load balancing on scientific workflows using parallel SARSA reinforcement learning agents and genetic algorithm [J] . Asghari Ali, Sohrabi Mohammad Karim, Yaghmaee Farzin Journal of supercomputing . 2021,第3期

机译：使用并行Sarsa强化学习代理和遗传算法的科学工作流程任务调度，资源供应和负载平衡
2. Deep Reinforcement Learning with Sarsa and Q-Learning: A Hybrid Approach [J] . Zhi-xiong XU, Lei CAO, Xi-liang CHEN, IEICE transactions on information and systems . 2018,第9期

机译：使用Sarsa和Q学习进行深度强化学习：一种混合方法
3. Deep reinforcement learning with smooth policy update: Application to robotic cloth manipulation [J] . Tsurumine Yoshihisa, Cui Yunduan, Uchibe Eiji, Robotics and Autonomous Systems . 2019,第期

机译：深度加强学习，具有顺利的政策更新：在机器人布操控中的应用
4. Smoothed Sarsa: Reinforcement Learning for Robot Delivery Tasks [C] . Deepak Ramachandran, Rakesh Gupta International Conference on Robotics and Automation . 2009

机译：Smoothed Sarsa：加固学习机器人交付任务
5. Deep Reinforcement Learning with Accelerated Reward Function Technique for Robotics Task Planning [D] . Shaikh, Shifa. 2021

机译：机器人任务规划加速奖励功能技术的深增强学习
6. Learning for a Robot: Deep Reinforcement Learning Imitation Learning Transfer Learning [O] . Jiang Hua, Liangcai Zeng, Gongfa Li, 2021

机译：学习机器人：深增强学习仿制学习转移学习
7. Smoothed Sarsa: Reinforcement Learning for Robot Delivery Tasks [O] . Deepak Ramachandran, Rakesh Gupta 2013

机译：平滑的Sarsa：针对机器人交付任务的强化学习

Smoothed Sarsa: Reinforcement learning for robot delivery tasks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅