Controlling bicycle using deep deterministic policy gradient algorithm

机译：使用深度确定性策略梯度算法控制自行车

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Controlling a bicycle without human interaction is still a challenge for researchers. Most of the studies on this topic focus on the physical area of bicycle or designing controllers based on automatic control knowledge such as feedback controller, LQR controller. This study focuses on applying a state-of-the-art deep reinforcement learning algorithm called Deep Deterministic Policy Gradient to control the bicycle. The bicycle can use the learned controller (agent) to keep balancing or reach a specified goal.

机译：在没有人为干预的情况下控制自行车仍然是研究人员的挑战。关于此主题的大多数研究都集中在自行车的物理区域或基于自动控制知识的控制器的设计上，例如反馈控制器，LQR控制器。这项研究的重点是应用称为“深度确定性策略梯度”的最新深度强化学习算法来控制自行车。自行车可以使用学习到的控制器（代理）保持平衡或达到指定目标。

著录项

来源
《International Conference on Ubiquitous Robots and Ambient Intelligence》|2017年|413-417|共5页
会议地点
作者
Le Pham Tuyen; TaeChoong Chung;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Bicycles; Trajectory; Robots; Learning (artificial intelligence); Aerospace electronics; Process control; Computer architecture;

机译：自行车;轨迹;机器人;学习（人工智能）;航空电子;过程控制;计算机体系结构;

相似文献

外文文献
中文文献
专利

1. Adaptive neuro-fuzzy PID controller based on twin delayed deep deterministic policy gradient algorithm [J] . Shi Qian, Lam Hak-Keung, Xuan Chengbin, Neurocomputing . 2020,第Auga18期

机译：基于双延迟深度确定性政策梯度算法的自适应神经模糊PID控制器
2. Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm [J] . Junta Wu, Huiyun Li Mathematical Problems in Engineering: Theory, Methods and Applications . 2020,第1期

机译：具有多种深度确定性政策梯度算法的深度集成钢筋学习
3. AUV path following controlled by modified Deep Deterministic Policy Gradient [J] . Sun Yushan, Ran Xiangrui, Zhang Guocheng, Ocean Engineering . 2020,第Auga15期

机译：由修改的深度确定性政策梯度控制后的AUV路径
4. Controlling bicycle using deep deterministic policy gradient algorithm [C] . Le Pham Tuyen, TaeChoong Chung International Conference on Ubiquitous Robots and Ambient Intelligence . 2017

机译：使用深度确定性政策梯度算法控制自行车
5. Comparison of gradient-restoration algorithms for optimal control problems with nondifferential constraints and general boundary conditions [D] . Ko, Shuh-Hung 1994

机译：具有非微分约束和一般边界条件的最优控制问题的梯度恢复算法比较
6. Implementation of Deep Deterministic Policy Gradients for Controlling Dynamic Bipedal Walking [O] . Chujun Liu, Andrew G. Lonsberry, Mark J. Nandor, 2019

机译：控制动态双足行走的深度确定性策略梯度的实现
7. Deep Deterministic Policy Gradient Algorithm Based on Convolutional Block Attention for Autonomous Driving [O] . Yanliang Jin, Qianhong Liu, Liquan Shen, 2021

机译：基于自动驾驶卷积块注意力的深度确定性政策梯度算法

Controlling bicycle using deep deterministic policy gradient algorithm

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅