【24h】

Compositionality of optimal control laws

机译:最优控制律的组成

获取原文

摘要

We present a theory of compositionality in stochastic optimal control, showing how task-optimal controllers can be constructed from certain primitives. The primitives are themselves feedback controllers pursuing their own agendas. They are mixed in proportion to how much progress they are making towards their agendas and how compatible their agendas are with the present task. The resulting composite control law is provably optimal when the problem belongs to a certain class. This class is rather general and yet has a number of unique properties - one of which is that the Bellman equation can be made linear even for non-linear or discrete dynamics. This gives rise to the compositionality developed here. In the special case of linear dynamics and Gaussian noise our framework yields analytical solutions (i.e. non-linear mixtures of LQG controllers) without requiring the final cost to be quadratic. More generally, a natural set of control primitives can be constructed by applying SVD to Green's function of the Bellman equation. We illustrate the theory in the context of human arm movements. The ideas of opti-mality and compositionality are both very prominent in the field of motor control, yet they have been difficult to reconcile. Our work makes this possible.
机译:我们提出了一种随机最优控制的组合理论,展示了如何从某些原语中构造出任务最优控制器。这些原语本身就是追求自己议程的反馈控制器。他们的混合情况与他们在议程上取得的进展以及议程与当前任务的相符程度成正比。当问题属于某个类别时,由此产生的复合控制定律被证明是最优的。该类相当笼统,但具有许多独特的属性-其中之一是即使对于非线性或离散动力学,也可以使Bellman方程线性化。这引起了此处开发的组合性。在线性动力学和高斯噪声的特殊情况下,我们的框架可以提供分析解决方案(即LQG控制器的非线性混合),而无需最终成本为二次方。更一般而言,可以通过将SVD应用于Bellman方程的Green函数来构造一组自然的控制原语。我们在人类手臂运动的背景下说明了这一理论。最优性和组合性的思想在电机控制领域都非常突出,但是很难调和。我们的工作使这成为可能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号