机译:使用政策梯度优化和Q-Learning避免深增强学习碰撞
Mechatronics Engineering Department Faculty of Engineering Ain Shams University;
Mechatronics Engineering Department Faculty of Engineering Ain Shams University;
robot operating system; ROS; robotics; reinforcement learning; deep learning; deep Q-learning; trust region optimisation; proximal policy optimisation; PPO; trust region policy optimisation; TRPO; deep Q-learning network; DQN; Q-learning; autonomous; differential robot; obstacle avoidance; navigation; tensorflow;
机译:基于深Q学习的多艘船舶自动碰撞
机译:富裕环境中的避免避免,具有深入的加强学习
机译:基于深度加强的自主船舶的碰撞避免
机译:深度强化学习:从Q学习到深度Q学习
机译:关于游戏的深度加固学习:多重政策头部深度Q学的泛化
机译:通过基于地图的深度增强学习分布式非传送多机器人碰撞避免
机译:富裕环境中的避免避免,具有深入的加强学习