Distributed Adaptive Control: Beyond Single-Instant, Discrete Control Variables

机译：分布式自适应控制：超越单速，离散控制变量

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In extensive form noncooperative game theory, at each instant t, each agent i sets its state x_i independently of the other agents, by sampling an associated distribution, q_i(x_i). The coupling between the agents arises in the joint evolution of those distributions. Distributed control problems can be cast the same way. In those problems the system designer sets aspects of the joint evolution of the distributions to try to optimize the goal for the overall system. Now information theory tells us what the separate q_i of the agents are most likely to be if the system were to have a particular expected value of the objective function G(x_1,x_2, ...). So one can view the job of the system designer as speeding an iterative process. Each step of that process starts with a specified value of E(G), and the convergence of the q_i to the most likely set of distributions consistent with that value. After this the target value for E_q(G) is lowered, and then the process repeats. Previous work has elaborated many schemes for implementing this process when the underlying variables x_i all have a finite number of possible values and G does not extend to multiple instants in time. That work also is based on a fixed mapping from agents to control devices, so that the the statistical independence of the agents' moves means independence of the device states. This paper also extends that work to relax all of these restrictions. This extends the applicability of that work to include continuous spaces and Reinforcement Learning. This paper also elaborates how some of that earlier work can be viewed as a first-principles justification of evolution-based search algorithms.

机译：在广泛的非自由度博弈论中，在每个瞬间t，每个代理通过采样关联的分布，q_i（x_i），我可以独立地设置其状态x_i。试剂之间的偶联在这些分布的关节演变中产生。分布式控制问题可以相同的方式投射。在这些问题中，系统设计人员在尝试优化整个系统的目标方面的联合演进的方面。现在信息理论告诉我们代理的单独Q_I最有可能是该系统具有目标函数G（X_1，X_2，...）的特定预期值。因此，可以将系统设计师的工作视为加快迭代过程。该过程的每个步骤以指定的e（g）的值开始，以及q_i的收敛到与该值一致的最可能的分布集。在此之后，e_q（g）的目标值降低，然后重复过程。以前的工作已经详细说明了许多用于在底层变量X_I所有具有有限数量的可能值并且g不扩展到多个瞬间时实现此过程的许多方案。该工作还基于从代理到控制设备的固定映射，使得代理的统计独立性移动意味着设备状态的独立性。本文还扩展了这项工作，可以放宽所有这些限制。这延长了该工作的适用性，包括连续空间和加强学习。本文还详细阐述了如何将一些早期的工作视为基于进化的搜索算法的第一原理理由。

著录项

来源
《International Workshop on Monitoring, Security, and Rescue Techniques in Multiagent Systems》|2005年||共22页
会议地点
作者
David H. Wolpert; Stefan Bieniawski;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Distributed adaptive containment control for a class of discrete-time nonlinear multi-agent systems with unknown parameters and control gains [J] . Li Nannan, Fei Qing, Ma Hongbin Journal of the Franklin Institute . 2020,第13期

机译：具有未知参数和控制收益的一类离散时间非线性多种子体系统的分布式自适应密封控制
2. Distributed Optimal Power Flow With Discrete Control Variables of Large Distributed Power Systems [J] . Lin C.-H., Lin S.-Y. IEEE Transactions on Power Systems . 2008,第3期

机译：具有离散控制变量的大型分布式电源系统的分布式最优潮流
3. Large stroke and high precision pneumatic-piezoelectric hybrid positioning control using adaptive discrete variable structure control [J] . Mao-Hsiung Chiang, Chung-Chieh Chen, Tan-Ni Tsou Mechatronics: The Science of Intelligent Machines . 2005,第5期

机译：自适应离散变结构控制的大行程高精度气动-压电混合定位控制
4. Distributed Adaptive Control: Beyond Single-Instant, Discrete Control Variables [C] . David H. Wolpert, Stefan Bieniawski International Workshop on Monitoring, Security, and Rescue Techniques in Multiagent Systems . 2005

机译：分布式自适应控制：超越单速，离散控制变量
5. Iterative Learning Control and Adaptive Control for Systems with Unstable Discrete-time Inverse [D] . Wang, Bowen . 2019

机译：不稳定离散时间逆的系统迭代学习控制和自适应控制
6. How do treatments for chronic fatigue syndrome work? Exploration of instrumental variable methods for mediation analysis in PACE – a randomised controlled trial of adaptive pacing therapy cognitive behaviour therapy graded exercise therapy and specialist medical care [O] . Kimberley Goldsmith, Trudie Chalder, Peter White, 2011

机译：慢性疲劳综合症的治疗方法如何起作用？探索用于PACE中介分析的工具变量方法–适应性起搏治疗认知行为治疗分级运动治疗和专科医疗的随机对照试验
7. DIGITAL CONTROL SYSTEMS IMPLEMENTATION TECHNIQUES, VOLUME 70 OF CONTROL AND DYNAMIC SYSTEMS: ADVANCES IN THEORY AND APPLICATIONS, C. Leondes (ed), Academic Press, San Diego, 1995, 390 pp., ISBN 0-12-0127702, $99.00 DISCRETE-TIME CONTROL SYSTEM ANALYSIS AND DESIGN, VOLUME 71 OF CONTROL AND DYNAMIC SYSTEMS: ADVANCES IN THEORY AND APPLICATIONS, C. Leondes (ed), Academic Press, San Diego, 1995, 410 pp., ISBN 0-12-0127715, $99.00 DISCRETE-TIME CONTROL SYSTEM IMPLEMENTATION TECHNIQUES, VOLUME 72 OF CONTROL AND DYNAMIC SYSTEMS: ADVANCES IN THEORY AND APPLICATIONS, C. Leondes (ed), Academic Press, San Diego, 1995, 388 pp., ISBN 0-12-0127725, $99.00 TECHNIQUES IN DISCRETE-TIME STOCHASTIC CONTROL SYSTEMS, VOLUME 73 OF CONTROL AND DYNAMIC SYSTEMS: ADVANCES IN THEORY AND APPLICATIONS, C. Leondes (ed), Academic Press, San Diego, 1995, 380 pp., ISBN 0-12-0127734, $99.00 TECHNIQUES IN DISCRETE AND CONTINUOUS ROBUST SYSTEMS, VOLUME 74 OF CONTROL AND DYNAMIC SYSTEMS: ADVANCES IN THEORY AND APPLICATIONS, C. Leondes (ed), Academic Press, San Diego, 1995, 412 pp., ISBN 0-12-0127741, $99.00 [O] . D. SUBBARAM NAIDU 1997

机译：数字控制系统的实施技术，控制和动态系统的第70卷：理论和应用的进步，C.丝丝（ED），学术出版社，圣地亚哥，1995,390 PP，ISBN 0-12-0127702，99.00美元离散时间控制系统分析和设计，控制和动态系统的第71卷：理论和应用的进步，C.丝丝（ED），学术出版社，圣地亚哥，1995,410 PP，ISBN 0-12-0127715，99.00美元离散时间控制系统实现技术，控制和动态系统的第72卷：理论和应用的进步，C.丝丝（ED），学术出版社，圣地亚哥，1995,388 pp，ISBN 0-12-0127725，99.00美元的离散 - 时间随机控制系统，控制和动态系统的VOLUME 73：前进，理论与应用，C Leondes（ED），学术出版社，圣地亚哥，1995年，380页，ISBN 0-12-0127734，在离散和$ 99.00技术。连续稳健的系统，控制和动态系统的第74卷：理论和应用的进步，C. leondes（ed），学术出版社，圣地亚哥，1995,412 pp。，ISBN 0-12-0127741，99.00美元
8. Distributed Adaptive Control: Beyond Single-Instant, Discrete Variables [R] . Wolpert, D. H. , Biewniawski, S. 2005

机译：分布式自适应控制：超越单瞬时离散变量

Distributed Adaptive Control: Beyond Single-Instant, Discrete Control Variables

摘要

著录项

相似文献

相关主题

期刊订阅