Data-driven dynamic multi-objective optimal control: A Hamiltonian-inequality driven satisficing reinforcement learning approach

Majid Mazouchi; Yongliang Yang; Hamidreza Modares

首页> 外文期刊>IFAC PapersOnLine >Data-driven dynamic multi-objective optimal control: A Hamiltonian-inequality driven satisficing reinforcement learning approach

【24h】

Data-driven dynamic multi-objective optimal control: A Hamiltonian-inequality driven satisficing reinforcement learning approach

机译：数据驱动的动态多目标最优控制：汉密尔顿 - 不等式驱动符合增强型研究方法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents an iterative data-driven algorithm for solving dynamic multi-objective (MO) optimal control problems arising in control of nonlinear continuous-time systems with multiple objectives. It is first shown that the Hamiltonian function corresponding to each objective can serve as a comparison function to compare the performance of admissible policies. Relaxed Hamilton-Jacobi-bellman (HJB) equations in terms of HJB inequalities are then solved in a dynamic constrained MO framework to find Pareto-optimal solutions. Relation to satisficing (good enough) decision-making framework is shown. A Sum-of-Square (SOS)-based iterative algorithm is developed to solve the formulated MO optimization with HJB inequalities. To obviate the requirement of complete knowledge of the system dynamics, a data-driven satisficing reinforcement learning approach is proposed to solve the SOS optimization problem in real-time using only the information of the system trajectories measured during a time interval without having full knowledge of the system dynamics. Finally, a simulation example is provided to show the effectiveness of the proposed algorithm.

机译：本文介绍了一种迭代数据驱动算法，用于求解具有多个目标的非线性连续时间系统的控制中出现的动态多目标（MO）最优控制问题。首先表明，对应于每个目标的哈密顿函数可以用作比较可允许策略性能的比较函数。在动态约束的MO框架中，可以解决在HJB不平等方面的哈米尔顿 - 雅各 - 贝尔曼（HJB）方程，以找到帕累托最优解决方案。与满足（足够好）决策框架的关系显示。基于广场（SOS）的迭代算法，以解决与HJB不等式的配制MO优化。为了避免系统动态的完整知识的要求，提出了一种数据驱动的满足增强学习方法，以实时解决SOS优化问题，仅使用时间间隔期间测量的系统轨迹的信息而没有完全了解系统动态。最后，提供了模拟示例以显示所提出的算法的有效性。

著录项

来源
《IFAC PapersOnLine》 |2020年第2期|共6页
作者
Majid Mazouchi; Yongliang Yang; Hamidreza Modares;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
Multi-objective optimizationPareto optimalityReinforcement learningSum-of-Square theory;

机译：多目标优化帕雷托最优性的战略学习 - 方形理论;

相似文献

外文文献
中文文献
专利

1. Data-Driven Optimal Consensus Control for Discrete-Time Multi-Agent Systems With Unknown Dynamics Using Reinforcement Learning Method [J] . Huaguang Zhang, He Jiang, Yanhong Luo, Industrial Electronics, IEEE Transactions on . 2017,第5期

机译：使用强化学习方法的具有未知动力学的离散多智能体系统的数据驱动最优共识控制
2. Data-driven Optimal Control Strategy for Virtual Synchronous Generator via Deep Reinforcement Learning Approach [J] . Yushuai Li, Wei Gao, Weihang Yan, 现代电力系统与清洁能源学报(英文) . 2021,第004期

机译：深度加固学习方法对虚拟同步发电机的数据驱动的最优控制策略
3. Data-driven optimal energy management for a wind-solar-diesel-battery-reverse osmosis hybrid energy system using a deep reinforcement learning approach [J] . Zhang Guozhou, Hu Weihao, Cao Di, Energy Conversion & Management . 2021,第Jana期

机译：利用深增强学习方法对风力太阳能柴油 - 电池 - 电池反渗透混合动力系统的数据驱动的最佳能源管理
4. Data-Driven Solutions to Mixed H_2/H_∞ Control: A Hamilton-Inequality-Driven Reinforcement Learning Approach [C] . Yongliang Yang, Majid Mazouchi, Hamidreza Modares IEEE Conference on Control Technology and Applications . 2020

机译：混合H_2 /H_∞控制的数据驱动解决方案：汉密尔顿不等式驱动强化学习方法
5. Data-Driven Adaptive Traffic Signal Control via Deep Reinforcement Learning [D] . Tan, Tian. 2020

机译：通过深度增强学习数据驱动的自适应交通信号控制
6. A Multi-Objective Approach for Optimal Energy Management in Smart Home Using the Reinforcement Learning [O] . Muhammad Diyan, Bhagya Nathali Silva, Kijun Han 2020

机译：基于强化学习的智能家居最佳能源管理多目标方法
7. Data-driven control of micro-climate in buildings: An event-triggered reinforcement learning approach [O] . Ashkan Haji Hosseinloo, Alexander Ryzhov, Aldo Bischi, 2020

机译：建筑物中微气候的数据驱动控制：事件触发的加强学习方法
8. Multi-objective dynamic programming approach to constrained discrete-time optimal control [R] . Driessen, B. J. , Kwok, K. S. 1997

机译：约束离散时间最优控制的多目标动态规划方法

Data-driven dynamic multi-objective optimal control: A Hamiltonian-inequality driven satisficing reinforcement learning approach

摘要

著录项

相似文献

相关主题

期刊订阅