Deep Reinforcement Learning and Randomized Blending for Control under Novel Disturbances

Yves Sohège; Gregory Provan; Marcos Qui?ones-Grueiro; Gautam Biswas

首页> 外文期刊>IFAC PapersOnLine >Deep Reinforcement Learning and Randomized Blending for Control under Novel Disturbances

【24h】

Deep Reinforcement Learning and Randomized Blending for Control under Novel Disturbances

机译：新型扰动下控制的深度加固学习和随机混合

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Enabling autonomous vehicles to maneuver in novel scenarios is a key unsolved problem. A well-known approach, Weighted Multiple Model Adaptive Control (WMMAC), uses a set of pre-tuned controllers and combines their control actions using a weight vector. Although WMMAC offers an improvement to traditional switched control in terms of smooth control oscillations, it depends on accurate fault isolation and cannot deal with unknown disturbances. A recent approach avoids state estimation by randomly assigning the controller weighting vector; however, this approach uses a uniform distribution for control-weight sampling, which is sub-optimal compared to state-estimation methods. In this article, we propose a framework that uses deep reinforcement learning (DRL) to learn weighted control distributions that optimize the performance of the randomized approach for both known and unknown disturbances. We show that RL-based randomized blending dominates pure randomized blending, a switched FDI-based architecture and pre-tuned controllers on a quadcopter trajectory optimisation task in which we penalise deviations in both position and attitude.

机译：在新颖的情景中启用自动车辆的机动是一个关键未解决的问题。众所周知的方法，加权多模型自适应控制（WMMAC）使用一组预调谐控制器，并使用权重向量组合它们的控制动作。虽然WMMAC在平滑控制振荡方面对传统的交换控制提供了改进，但它取决于精确的故障隔离，不能处理未知的干扰。最近的方法通过随机分配控制器加权矢量来避免状态估计;然而，与状态估计方法相比，这种方法使用用于控制权重采样的均匀分布，这是次优的。在本文中，我们提出了一个使用深度加强学习（DRL）来学习加权控制分布的框架，从而优化随机方法的性能，以了解已知和未知干扰。我们表明基于RL的随机混合在Quadcopter轨迹优化任务上占据了纯随机混合，纯粹的基于FDI的架构和预调谐控制器，在其中惩罚了两个位置和姿态的偏差。

著录项

来源
《IFAC PapersOnLine 》 |2020年第2期| 共6页
作者
Yves Sohège; Gregory Provan; Marcos Qui?ones-Grueiro; Gautam Biswas;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
Design of fault tolerant/reliable systemsFault accommodationReconfiguration strategiesMethods based on neural networks and/or fuzzy logic for FDI;

机译：基于神经网络和/或FDI模糊逻辑的故障容忍/可靠系统频率适应竞争策略设计;

相似文献

外文文献
中文文献
专利

1. Deep reinforcement learning for traffic signal control under disturbances: A case study on Sunway city, Malaysia [J] . Faizan Rasheed, Kok-Lim Alvin Yau, Yeh-Ching Low Future generation computer systems . 2020 ,第Auga期

机译：交通信号控制下的深度加强学习：马来西亚孙道城的案例研究
2. Finding the Right Blend of Technologically Enhanced Learning Environments: Randomized Controlled Study of the Effect of Instructional Sequences on Interprofessional Learning [J] . Sok Ying Liaw, Khoon Kiat Tan, Ling Ting Wu, Journal of medical Internet research . 2019 ,第5期

机译：寻找技术增强的学习环境的正确融合：教学序列对跨专业学习的影响的随机对照研究
3. Finding the Right Blend of Technologically Enhanced Learning Environments: Randomized Controlled Study of the Effect of Instructional Sequences on Interprofessional Learning [J] . Sok Ying Liaw, Khoon Kiat Tan, Ling Ting Wu, Journal of medical Internet research . 2019 ,第5期

机译：寻找技术增强的学习环境的正确融合：教学序列对跨专业学习的影响的随机对照研究
4. Motion Planning and Control with Randomized Payloads Using Deep Reinforcement Learning [C] . Ali Demir, Volkan Sezer IEEE International Conference on Robotic Computing . 2019

机译：使用深度强化学习的具有随机有效载荷的运动计划和控制
5. Deep Learning and Reinforcement Learning for Inventory Control [D] . Khanidahaj, Zahra. 2018

机译：库存控制深度学习和加固学习
6. Significant differences in written assessments as a result of a blended learning approach used in a clinical examination course in internal medicine: a randomized controlled pilot study [O] . Carolin Sonne, Hasema Persch, Stefanie Rosner, 2021

机译：由于内科临床检查课程中使用的混合学习方法而导致书面评估的显着差异：随机对照试验研究
7. A Disturbance Rejection Control Method Based on Deep Reinforcement Learning for a Biped Robot [O] . Chuzhao Liu, Junyao Gao, Dingkui Tian, 2021

机译：基于Biped机器人的深增强学习的扰动抑制控制方法

Deep Reinforcement Learning and Randomized Blending for Control under Novel Disturbances

摘要

著录项

相似文献

相关主题

期刊订阅