Compositionality of optimal control laws

机译：最优控制律的组成

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a theory of compositionality in stochastic optimal control, showing how task-optimal controllers can be constructed from certain primitives. The primitives are themselves feedback controllers pursuing their own agendas. They are mixed in proportion to how much progress they are making towards their agendas and how compatible their agendas are with the present task. The resulting composite control law is provably optimal when the problem belongs to a certain class. This class is rather general and yet has a number of unique properties - one of which is that the Bellman equation can be made linear even for non-linear or discrete dynamics. This gives rise to the compositionality developed here. In the special case of linear dynamics and Gaussian noise our framework yields analytical solutions (i.e. non-linear mixtures of LQG controllers) without requiring the final cost to be quadratic. More generally, a natural set of control primitives can be constructed by applying SVD to Green's function of the Bellman equation. We illustrate the theory in the context of human arm movements. The ideas of opti-mality and compositionality are both very prominent in the field of motor control, yet they have been difficult to reconcile. Our work makes this possible.

机译：我们提出了一种随机最优控制的组合理论，展示了如何从某些原语中构造出任务最优控制器。这些原语本身就是追求自己议程的反馈控制器。他们的混合情况与他们在议程上取得的进展以及议程与当前任务的相符程度成正比。当问题属于某个类别时，由此产生的复合控制定律被证明是最优的。该类相当笼统，但具有许多独特的属性-其中之一是即使对于非线性或离散动力学，也可以使Bellman方程线性化。这引起了此处开发的组合性。在线性动力学和高斯噪声的特殊情况下，我们的框架可以提供分析解决方案（即LQG控制器的非线性混合），而无需最终成本为二次方。更一般而言，可以通过将SVD应用于Bellman方程的Green函数来构造一组自然的控制原语。我们在人类手臂运动的背景下说明了这一理论。最优性和组合性的思想在电机控制领域都非常突出，但是很难调和。我们的工作使这成为可能。

著录项

来源
《Conference on Neural Information Processing Systems;Annual conference on Neural Information Processing Systems》|2009年|P.1856-1864|共9页
会议地点
作者
Emanuel Todorov;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Application of multistep time decomposition at synthesis of suboptimal control laws for controlling multidimensional objects in the state space [J] . Karelin A. N. Приборы и системы: управление, контроль, диагностика . 2001,第7期

机译：多步时间分解在综合控制状态空间中多维对象的次优控制律中的应用
2. Application of multistep time decomposition at synthesis of suboptimal control laws for controlling multidimensional objects in the state space [J] . Karelin A. N. Приборы и системы: управление, контроль, диагностика . 2001,第7期

机译：多步时间分解在划分控制法律综合中的应用来控制状态空间中的多维物体
3. Decomposition method of synthesis of optimal observation control laws in multipositional measuring systems [J] . V. V. Khutortsev, A. A. Fasolya Automatic Control and Computer Sciences . 2001,第1期

机译：多位置测量系统中最优观测控制律综合的分解方法
4. Concepts, Decompositions, and Optimal Control Laws for a Gaussian Team Problem [C] . Jan H. van Schuppen, Charalambos D. Charalambous IEEE Annual Conference on Decision and Control . 2019

机译：高斯团队问题的概念，分解和最佳控制法
5. Development of a finite difference neighboring optimal control law and application to the optimal landing of a reusable launch vehicle. [D] . Wetzel, Todd Andrew. 1996

机译：有限差邻近最优控制律的发展及其在可重复使用运载火箭最优着陆中的应用。
6. Optimality and sub-optimality in a bacterial growth law [O] . Benjamin D. Towbin, Yael Korem, Anat Bren, -1

机译：细菌生长定律的最优性和次优性
7. A class of stochastic optimal control problems in Hilbert spaces: BSDEs and optimal control laws, state constraints, conditioned processes [O] . Fuhrman Marco 2003

机译：Hilbert空间中的一类随机最优控制问题：BSDE和最优控制律，状态约束，条件过程
8. Approximating the linear quadratic optimal control law for hereditary systems with delays in the control [R] . Milman, Mark H. 1987

机译：具有控制延迟的遗传系统线性二次型最优控制律的逼近

Compositionality of optimal control laws

摘要

著录项

相似文献

相关主题

期刊订阅