首页> 外文会议>Chinese Automation Congress >Heuristic Gait Learning of Quadruped Robot Based on Deep Deterministic Policy Gradient Algorithm

【24h】

Heuristic Gait Learning of Quadruped Robot Based on Deep Deterministic Policy Gradient Algorithm

机译：基于深度确定性政策梯度算法的四足机器人启发式步态学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The gait control of the quadruped robot has always been a hot topic in the field of robot research. At present, the traditional control methods have many limitations such as low intelligence and poor autonomy. With the development of artificial intelligence technology, the application of reinforcement learning to the quadruped robot autonomous learning strategy provides a promising solution. Deep deterministic policy gradient (DDPG) algorithm has achieved good performance in continuous control tasks, but such value-based reinforcement learning algorithms have the problem of too high epoch estimates when performing function approximation, then reached a bad strategy actually. In order to solve the above-mentioned problem, this paper proposed a heuristic gait learning method for quadruped robot based on DDPG, inspired by the Double Q-learning algorithm, two independent critics were used to select the smaller value to update the parameters. The Open AI Gym platform was used for experimental verification, which proved that the proposed improved DDPG algorithm had better performance.

机译：四足机器人的步态控制一直是机器人研究领域的热门话题。目前，传统的控制方法具有许多局限性，例如低智力和自主性差。随着人工智能技术的发展，加强学习在四足机器人自主学习策略中的应用提供了有希望的解决方案。深度确定性政策梯度（DDPG）算法在连续控制任务中取得了良好的性能，但是这种基于价值的增强学习算法在执行函数近似时具有太高的时期估计的问题，然后实际达到了不良策略。为了解决上述问题，本文提出了一种基于DDPG的四足机器人的启发式步态学习方法，受到双Q学习算法的启发，使用了两个独立的批评者来选择更新参数的较小值。 Open AI Gym平台用于实验验证，这证明了提出的改进的DDPG算法具有更好的性能。

著录项

来源
《Chinese Automation Congress》|2020年|1046-1049|共4页
会议地点
作者
Mingchao Wang; Xiaogang Ruan; Xiaoqing Zhu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Reinforcement learning; Training; Legged locomotion; Approximation algorithms; Classification algorithms; Task analysis; Function approximation;

机译：加强学习;训练;腿运动;近似算法;分类算法;任务分析;函数近似;

相似文献

外文文献
中文文献
专利

1. Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm [J] . Junta Wu, Huiyun Li Mathematical Problems in Engineering: Theory, Methods and Applications . 2020,第1期

机译：具有多种深度确定性政策梯度算法的深度集成钢筋学习
2. Policy gradient learning for quadruped soccer robots [J] . A. Cherubini, F. Giannone, L. Iocchi, Robotics and Autonomous Systems . 2010,第7期

机译：四足足球机器人的策略梯度学习
3. A gait transition algorithm based on hybrid walking gait for a quadruped walking robot [J] . Lee Yoon Haeng, Duc Trong Tran, Hyun Jae-ho, Intelligent Service Robotics . 2015,第4期

机译：一种基于混合行走机器人混合行走步态的步态过渡算法
4. Collective Behavior for Cooperative Transport Task in a Robotic Swarm based on Deep Deterministic Policy Gradient Algorithms [C] . Hanjun Jiang, Toshiyuki Yasuda, Kazuhiro Ohkura システム制御情報学会研究発表講演会 . 2017

机译：基于深度确定性政策梯度算法的机器人群合作交通任务的集体行为
5. Quadrupedal Emotive Gaits in Robotics [D] . Hainsworth, Travis Brad. 2017

机译：机器人中的四桥情绪高速公路
6. Gait Optimization Method for Humanoid Robots Based on Parallel Comprehensive Learning Particle Swarm Optimizer Algorithm [O] . Chongben Tao, Jie Xue, Zufeng Zhang, 2020

机译：基于并行综合学习粒子群优化化算法的人形机器人步态优化方法
7. Ensemble Bootstrapped Deep Deterministic Policy Gradient for Vision-Based Robotic Grasping [O] . Weiwei Liu, Linpeng Peng, Junjie Cao, 2021

机译：基于视觉的机器人掌握的合奏自动启动深度确定性政策梯度

Heuristic Gait Learning of Quadruped Robot Based on Deep Deterministic Policy Gradient Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅