首页> 外文会议>International Symposium on Computing and Networking Workshops >FPGA Acceleration of ROS2-Based Reinforcement Learning Agents
【24h】

FPGA Acceleration of ROS2-Based Reinforcement Learning Agents

机译:基于ROS2的加强学习代理的FPGA加速

获取原文

摘要

Reinforcement learning agents have shown very good results in robot control and navigation tasks, allowing robots to learn how to interact with an environment appropriately in a model-free manner. However, real-world robot systems have strict latency, power, and cost constraints, thus requiring special hardware consideration for the demanding computations of neural networks. Furthermore, reinforcement learning networks should be able to interface efficiently with the various other robot components. To address these challenges, we propose a method for applying FPGA hardware accelerators to robotics reinforcement learning agents at the inference stage and seamlessly integrating the FPGA hardware module to the robot system by automatically wrapping it in a Robot Operating System 2 (ROS2) node. The proposed system is evaluated in three OpenAI gym control environments: Cartpole-v1, Acrobot-v1, and Pendulum-v0. In the evaluation, both quantized and non-quantized reinforcement learning neural networks are used, and the proposed FPGA system is observed to provide up to a 3.69x speed up and up to 52.7x better performance per watt when compared to an agent running on a ROS2 node on a modern CPU.
机译:强化学习代理在机器人控制和导航任务中表现出非常好的结果,允许机器人学习如何以无意义的方式适当地与环境进行互动。然而,现实世界机器人系统具有严格的延迟,功率和成本约束,因此需要对神经网络的苛刻计算进行特殊的硬件考虑。此外,加强学习网络应该能够用各种其他机器人组件有效地接口。为了解决这些挑战,我们提出了一种将FPGA硬件加速器应用于推理阶段的机器人加固学习代理的方法,并通过在机器人操作系统2(ROS2)节点中自动包装它来无缝地将FPGA硬件模块与机器人系统集成到机器人系统。所提出的系统在三个Openai健身房控制环境中进行评估:Cartpole-V1,Acrobot-V1和Pendulum-V0。在评估中,使用量化和非量化的增强学习神经网络,并且拟议的FPGA系统在与运行上运行的代理相比,每瓦的增速高达3.69倍的速度,最高可达52.7倍。 ROS2节点在现代CPU上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号