FPGA Acceleration of ROS2-Based Reinforcement Learning Agents

机译：基于ROS2的加强学习代理的FPGA加速

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement learning agents have shown very good results in robot control and navigation tasks, allowing robots to learn how to interact with an environment appropriately in a model-free manner. However, real-world robot systems have strict latency, power, and cost constraints, thus requiring special hardware consideration for the demanding computations of neural networks. Furthermore, reinforcement learning networks should be able to interface efficiently with the various other robot components. To address these challenges, we propose a method for applying FPGA hardware accelerators to robotics reinforcement learning agents at the inference stage and seamlessly integrating the FPGA hardware module to the robot system by automatically wrapping it in a Robot Operating System 2 (ROS2) node. The proposed system is evaluated in three OpenAI gym control environments: Cartpole-v1, Acrobot-v1, and Pendulum-v0. In the evaluation, both quantized and non-quantized reinforcement learning neural networks are used, and the proposed FPGA system is observed to provide up to a 3.69x speed up and up to 52.7x better performance per watt when compared to an agent running on a ROS2 node on a modern CPU.

机译：强化学习代理在机器人控制和导航任务中表现出非常好的结果，允许机器人学习如何以无意义的方式适当地与环境进行互动。然而，现实世界机器人系统具有严格的延迟，功率和成本约束，因此需要对神经网络的苛刻计算进行特殊的硬件考虑。此外，加强学习网络应该能够用各种其他机器人组件有效地接口。为了解决这些挑战，我们提出了一种将FPGA硬件加速器应用于推理阶段的机器人加固学习代理的方法，并通过在机器人操作系统2（ROS2）节点中自动包装它来无缝地将FPGA硬件模块与机器人系统集成到机器人系统。所提出的系统在三个Openai健身房控制环境中进行评估：Cartpole-V1，Acrobot-V1和Pendulum-V0。在评估中，使用量化和非量化的增强学习神经网络，并且拟议的FPGA系统在与运行上运行的代理相比，每瓦的增速高达3.69倍的速度，最高可达52.7倍。 ROS2节点在现代CPU上。

著录项

来源
《International Symposium on Computing and Networking Workshops》|2020年|106-112|共7页
会议地点
作者
Daniel Pinheiro Leal; Midori Sugaya; Hideharu Amano; Takeshi Ohkawa;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Neural networks; Robot control; Reinforcement learning; Hardware; Wrapping; Task analysis; Field programmable gate arrays;

机译：神经网络;机器人控制;强化学习;硬件;包装;任务分析;现场可编程门阵列;

相似文献

外文文献
中文文献
专利

1. Neural Network Based Reinforcement Learning Acceleration on FPGA Platforms [J] . Jiang Su, Jianxiong Liu, David B. Thomas, Computer architecture news . 2016,第4期

机译：在FPGA平台上基于神经网络的强化学习加速
2. A Reinforcement Learning-Based Markov-Decision Process (MDP) Implementation for SRAM FPGAs [J] . Ruan Aiwu, Shi Aokai, Qin Liang, Circuits and Systems II: Express Briefs, IEEE Transactions on . 2020,第10期

机译：SRAM FPGA的基于加强学习的Markov决策过程（MDP）实施
3. Reinforcement learning technique using agent state occurrence frequency with analysis of knowledge sharing on the agent’s learning process in multiagent environments [J] . H. S. Al-Dayaa, D. B. Megherbi The Journal of Supercomputing . 2012,第1期

机译：使用座席状态发生频率并分析多座席环境中座席学习过程中的知识共享的强化学习技术
4. Acceleration of Multi-agent Simulation on FPGAs [C] . Cui Lintao, Chen Jing, Hu Yu, 21st International Conference on Field Programmable Logic and Applications . 2011

机译：在FPGA上加速多智能体仿真
5. Acceleration of multi-agent simulation on FPGAs. [D] . Cui, Lintao. 2012

机译：在FPGA上加速多主体仿真。
6. Co-Evolution of Predator-Prey Ecosystems by Reinforcement Learning Agents [O] . Jeongho Park, Juwon Lee, Taehwan Kim, 2021

机译：加固学习代理捕食者 - 猎物生态系统的共同演变
7. Accelerating Deep Neuroevolution on Distributed FPGAs for Reinforcement Learning Problems [O] . Alexis Asseman, Nicolas Antoine, Ahmet S. Ozcan 2021

机译：加速深度神经发展在分布式FPGA中加强学习问题

FPGA Acceleration of ROS2-Based Reinforcement Learning Agents

摘要

著录项

相似文献

相关主题

期刊订阅