Adaptive neuro-fuzzy PID controller based on twin delayed deep deterministic policy gradient algorithm

Shi Qian; Lam Hak-Keung; Xuan Chengbin; Chen Ming

首页> 外文期刊>Neurocomputing >Adaptive neuro-fuzzy PID controller based on twin delayed deep deterministic policy gradient algorithm

【24h】

Adaptive neuro-fuzzy PID controller based on twin delayed deep deterministic policy gradient algorithm

机译：基于双延迟深度确定性政策梯度算法的自适应神经模糊PID控制器

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents an adaptive neuro-fuzzy PID controller based on twin delayed deep deterministic policy gradient (TD3) algorithm for nonlinear systems. In this approach, the observation of the environment is embedded with information of a multiple input single output (MISO) fuzzy inference system (FIS) and have a specially defined fuzzy PID controller in neural network (NN) formation acting as the actor in the TD3 algorithm, which achieves automatic tuning of gains of fuzzy PID controller. From the control perspective, the controller combines the merits of both FIS and PID controller and utilizes reinforcement learning algorithm for optimizing parameters. From the reinforcement learning point of view, embedding the prior knowledge into the fuzzy PID controller incorporated in the actor network helps reduce the learning difficulty in the training process. The proposed method was tested on the cart-pole system in simulation environment with comparison of a linear PID controller, which demonstrates the robustness and generalization of the proposed approach. (C) 2020 Elsevier B.V. All rights reserved.

机译：本文介绍了基于双延迟深层确定性政策梯度（TD3）算法的自适应神经模糊PID控制器。在这种方法中，对环境的观察嵌入了多输入单输出（MISO）模糊推理系统（FIS）的信息，并且在神经网络（NN）形成中具有特殊定义的模糊PID控制器，其作为TD3中的actor算法，实现了模糊PID控制器的自动调整。从控制角度来看，控制器结合了FIS和PID控制器的优点，并利用了加强学习算法来优化参数。从加强学习的角度来看，将先验知识嵌入到参与者网络中的模糊PID控制器中有助于降低培训过程中的学习难度。在仿真环境中测试了所提出的方法，与线性PID控制器进行了比较，这证明了所提出的方法的鲁棒性和泛化。（c）2020 Elsevier B.v.保留所有权利。

著录项

来源
《Neurocomputing》 |2020年第18期|183-194|共12页
作者
Shi Qian; Lam Hak-Keung; Xuan Chengbin; Chen Ming;
展开▼
作者单位

Kings Coll London Dept Engn London WC2R 2LS England;

Kings Coll London Dept Engn London WC2R 2LS England;

Kings Coll London Dept Engn London WC2R 2LS England;

Kings Coll London Dept Engn London WC2R 2LS England;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Twin delayed deep deterministic policy gradient algorithm; Reinforcement learning; Fuzzy PID controller; Cart-pole system;

机译：双延迟深度确定性政策梯度算法;加固学习;模糊PID控制器;车杆系统;

相似文献

外文文献
中文文献
专利

1. An Intelligent Energy Management Strategy for Hybrid Vehicle with irrational actions using Twin Delayed Deep Deterministic Policy Gradient [J] . Zemin Eitan Liu, Quan Zhou, Yanfei Li, IFAC PapersOnLine . 2021,第10期

机译：使用双胞胎延迟的非理性行为的混合动力车辆智能能量管理策略深度确定性政策梯度
2. Agent-Based Modeling in Electricity Market Using Deep Deterministic Policy Gradient Algorithm [J] . Yanchang Liang, Chunlin Guo, Zhaohao Ding, Power Systems, IEEE Transactions on . 2020,第6期

机译：基于代理的电力市场建模使用深度确定性政策梯度算法
3. Consortium blockchains-based deep deterministic policy gradient algorithm for optimal electricity trading among households [J] . Yang Chen 中国邮电高校学报（英文版） . 2018,第006期

机译：基于联盟区块链的深度确定性策略梯度算法，实现家庭间最佳电力交易
4. A Novel Vehicle Platoon Following Controller Based on Deep Deterministic Policy Gradient Algorithms [C] . Guan Wang, Jianming Hu, Yusen Huo, COTA international conference of transportation professionals . 2018

机译：基于深度确定性策略梯度算法的新型汽车排跟随控制器
5. Adaptive neuro-fuzzy controller for passive nonlinear systems. [D] . Kumbla, Kishan Kumar. 1997

机译：被动非线性系统的自适应神经模糊控制器。
6. Implementation of Deep Deterministic Policy Gradients for Controlling Dynamic Bipedal Walking [O] . Chujun Liu, Andrew G. Lonsberry, Mark J. Nandor, 2019

机译：控制动态双足行走的深度确定性策略梯度的实现
7. Deep Deterministic Policy Gradient Algorithm Based on Convolutional Block Attention for Autonomous Driving [O] . Yanliang Jin, Qianhong Liu, Liquan Shen, 2021

机译：基于自动驾驶卷积块注意力的深度确定性政策梯度算法

Adaptive neuro-fuzzy PID controller based on twin delayed deep deterministic policy gradient algorithm

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅