Multi-robot inverse reinforcement learning under occlusion with estimation of state transitions

Bogert Kenneth; Doshi Prashant

首页> 外文期刊>Artificial intelligence >Multi-robot inverse reinforcement learning under occlusion with estimation of state transitions

【24h】

Multi-robot inverse reinforcement learning under occlusion with estimation of state transitions

机译：闭塞状态下的多机器人逆强化学习

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Inverse reinforcement learning (IRL), analogously to RL, refers to both the problem and associated methods by which an agent passively observing another agent's actions over time, seeks to learn the latter's reward function. The learning agent is typically called the learner while the observed agent is often an expert in popular applications such as in learning from demonstrations. Some of the assumptions that underlie current IRL methods are impractical for many robotic applications. Specifically, they assume that the learner has full observability of the expert as it performs its task; that the learner has full knowledge of the expert's dynamics; and that there is always only one expert agent in the environment. For example, these assumptions are particularly restrictive in our application scenario where a subject robot is tasked with penetrating a perimeter patrol by two other robots after observing them from a vantage point. In our instance of this problem, the learner can observe at most 10% of the patrol.

机译：逆强化学习（IRL）与RL类似，是指问题和相关方法，代理商通过这种方法被动地观察另一位代理商的行为，以寻求学习后者的奖励功能。学习代理通常称为学习者，而观察到的代理通常是流行应用程序中的专家，例如从演示中学习。对于许多机器人应用而言，当前基于IRL方法的一些假设是不切实际的。具体来说，他们假设学习者在执行任务时具有专家的完全可观察性；学习者完全了解专家的动态；并且环境中始终只有一个专家代理。例如，这些假设在我们的应用场景中特别受限制，在该应用场景中，目标机器人的任务是在从有利位置观察后再由其他两个机器人穿透外围巡逻。在我们这个问题的实例中，学习者最多可以观察到10％的巡逻。

著录项

来源
《Artificial intelligence》 |2018年第10期|46-73|共28页
作者
Bogert Kenneth; Doshi Prashant;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Multi-robot inverse reinforcement learning under occlusion with estimation of state transitions [J] . Bogert Kenneth, Doshi Prashant Artificial intelligence . 2018,第Octa期

机译：闭塞状态下的多机器人逆强化学习
2. I2RL: online inverse reinforcement learning under occlusion [J] . Arora Saurabh, Doshi Prashant, Banerjee Bikramjit Autonomous agents and multi-agent systems . 2021,第1期

机译：I2RL：遮挡下的在线逆钢筋学习
3. Estimation of personal driving style via deep inverse reinforcement learning [J] . Daiko Kishikawa, Sachiyo Arai Artificial life and robotics . 2021,第3期

机译：深度逆钢筋学习估算个人驾驶风格
4. Multi-Robot Inverse Reinforcement Learning Under Occlusion with State Transition Estimation [C] . Kenneth Bogert, Prashant Doshi International Conference on Autonomous Agents and Multiagent Systems . 2015

机译：封闭下的多机器人反增强学习与状态转换估计
5. Using Reinforcement Learning in Multi-Robot SLAM. [D] . Dinnissen, Pierre. 2011

机译：在多机器人SLAM中使用强化学习。
6. Distributed Non-Communicating Multi-Robot Collision Avoidance via Map-Based Deep Reinforcement Learning [O] . Guangda Chen, Shunyi Yao, Jun Ma, 2020

机译：通过基于地图的深度增强学习分布式非传送多机器人碰撞避免
7. Multi-robot inverse reinforcement learning under occlusion with estimation of state transitions [O] . Kenneth Bogert, Prashant Doshi 2018

机译：遮挡下的多机器人逆钢筋学习与状态转换估计

Multi-robot inverse reinforcement learning under occlusion with estimation of state transitions

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅