Multidisciplinary Optimization in Decentralized Reinforcement Learning

机译：分散式强化学习中的多学科优化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multidisciplinary Optimization (MDO) is one of the most popular techniques in aerospace engineering, where the system is complex and includes the knowledge from multiple fields. However, according to the best of our knowledge, MDO has not been widely applied in decentralized reinforcement learning (RL) due to the `unknown' nature of the RL problems. In this work, we apply the MDO in decentralized RL. In our MDO design, each learning agent uses system identification to closely approximate the environment and tackle the `unknown' nature of the RL. Then, the agents apply the MDO principles to compute the control solution using Monte Carlo and Markov Decision Process techniques. We examined two options of MDO designs: the multidisciplinary feasible and the individual discipline feasible options, which are suitable for multi-agent learning. Our results show that the MDO individual discipline feasible option could successfully learn how to control the system. The MDO approach shows better performance than the completely decentralization and centralization approaches.

机译：多学科优化（MDO）是航空航天工程中最流行的技术之一，该系统非常复杂，并且包含来自多个领域的知识。但是，据我们所知，由于RL问题的“未知”性质，MDO尚未广泛应用于分散式强化学习（RL）。在这项工作中，我们将MDO应用于分散式RL。在我们的MDO设计中，每个学习代理都使用系统标识来接近环境并解决RL的“未知”性质。然后，代理将MDO原理应用到使用Monte Carlo和Markov决策过程技术来计算控制解决方案。我们研究了MDO设计的两个选项：多学科可行和个人学科可行选项，它们适用于多主体学习。我们的结果表明，MDO个人学科可行方案可以成功地学习如何控制系统。与完全分散和集中化方法相比，MDO方法显示出更好的性能。

著录项

来源
《IEEE International Conference on Machine Learning and Applications》|2017年|779-784|共6页
会议地点
作者
Thanh Nguyen; Snehasis Mukhopadhyay;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Adaptive control; Conferences; Optimization; Nonlinear systems; Markov processes; System identification; Probabilistic logic;

机译：自适应控制;会议;优化;非线性系统;马尔可夫过程;系统辨识;概率逻辑;

相似文献

外文文献
中文文献
专利

1. Decentralized Tracking Optimization Control for Partially Unknown Fuzzy Interconnected Systems via Reinforcement Learning Method [J] . Zhang Kun, Zhang Huaguang, Mu Yunfei, IEEE Transactions on Fuzzy Systems . 2021,第4期

机译：通过加固学习方法对部分未知模糊互连系统分散的跟踪优化控制
2. Model-Free Optimization Scheme for Efficiency Improvement of Wind Farm Using Decentralized Reinforcement Learning ? [J] . Zhiwei Xu, Hua Geng, Bing Chu, IFAC PapersOnLine . 2020,第2期

机译：使用分散钢筋学习的风电场效率改进的无模型优化方案
3. Why the 'selfish' optimizing agents could solve the decentralized reinforcement learning problems [J] . Thanh Nguyen, Mukhopadhyay Snehasis, Babbar-Sebens Meghna AI communications . 2019,第2期

机译：为什么“自私”的优化代理可以解决分散式强化学习问题
4. Multidisciplinary Optimization in Decentralized Reinforcement Learning [C] . Thanh Nguyen, Snehasis Mukhopadhyay IEEE International Conference on Machine Learning and Applications . 2017

机译：分散加固学习中的多学科优化
5. Reinforcement Learning and Recurrent Reinforcement Learning for Dynamic Portfolio Optimization [D] . Almahdi, Saud 2019

机译：强化学习和循环强化学习以实现动态资产组合优化
6. Design Optimization of a Pneumatic Soft Robotic Actuator Using Model-Based Optimization and Deep Reinforcement Learning [O] . Mahsa Raeisinezhad, Nicholas Pagliocca, Behrad Koohbor, 2021

机译：基于模型的优化和深度加固学习的气动软机器人执行器设计优化
7. Multidisciplinary Optimization in Decentralized Reinforcement Learning [O] . Thanh Nguyen, Snehasis Mukhopadhyay 2017

机译：分散加固学习中的多学科优化

Multidisciplinary Optimization in Decentralized Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅