Trial and Error Experience Replay Based Deep Reinforcement Learning

机译：基于尝试和错误体验重播的深度强化学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The environment with sparse rewards in reinforcement learning is a common problem and the agent learns inefficiently using general methods. A new solution called trialand-error experience replay is proposed. In this method, the general hindsight experience replay is combined with a curiositydriven model, by which the sample-efficiency will be improved although extrinsic rewards are sparse. It is demonstrated as an algorithm to control a virtual robotic arm to reach a mobile goal. Through analysis the robotic arm can explore and learn based on failure trajectories which shows that the agent mimics a human who failed repeatedly but still tries to learn something from the unexpected outcomes.

机译：强化学习中奖励稀少的环境是一个普遍的问题，代理人使用通用方法学习效率低下。提出了一种新的解决方案，称为试验和错误体验重播。在这种方法中，一般的事后观察重放与好奇心驱动的模型相结合，尽管外部奖励很少，但通过该模型可以提高采样效率。它被证明是一种控制虚拟机械手达到移动目标的算法。通过分析，机械臂可以根据失败轨迹进行探索和学习，这表明代理模仿了一个反复失败但仍然试图从意外结果中学习的人。

著录项

来源
《IEEE International Conference on Smart Cloud》|2019年|221-226|共6页
会议地点
作者
Cheng Zhang; Liang Ma;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Manipulators; Neural networks; Task analysis; Machine learning; Trajectory; MIMICs;

机译：机械手;神经网络;任务分析;机器学习;轨迹; MIMIC;

相似文献

外文文献
中文文献
专利

1. Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning [J] . Jakob Foerster, Nantas Nardelli, Gregory Farquhar, JMLR: Workshop and Conference Proceedings . 2017,第3期

机译：稳定的体验重播，以进行深度的多智能体强化学习
2. Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge [J] . Lan Jiang, Hongyun Huang, Zuohua Ding 自动化学报：英文版 . 2020,第004期

机译：基于深度Q学习，经验回放和启发式知识的智能机器人路径规划
3. Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge [J] . Lan Jiang, Hongyun Huang, Zuohua Ding 自动化学报（英文版） . 2020,第004期

机译：基于Deep Q-Learning的经验重播和启发式知识的智能机器人路径规划
4. Trial and Error Experience Replay Based Deep Reinforcement Learning [C] . Cheng Zhang, Liang Ma IEEE International Conference on Smart Cloud . 2019

机译：试验和错误体验基于重播的深度增强学习
5. Entropy-Based Experience Replay in Reinforcement Learning [D] . Dadvar, Mehdi. 2020

机译：基于熵的体验重播在加固学习中
6. Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor–Critic with Hindsight Experience Replay [O] . Evan Prianto, MyeongSeop Kim, Jae-Han Park, 2020

机译：使用深度加强学习的多臂操纵器的路径规划：软演员 - 与后敏感体验重播
7. Deep Reinforcement Learning With Quantum-Inspired Experience Replay [O] . Qing Wei, Hailan Ma, Chunlin Chen, 2021

机译：随着量子启发体验重放的深度增强学习
8. Enhanced Experience Replay for Deep Reinforcement Learning. [R] . Doria, D., Dawson, B., Vindiola, M. 2015

机译：增强深度强化学习的体验重播。

Trial and Error Experience Replay Based Deep Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅