首页> 外文会议>International Conference on Control, Automation and Robotics >Learning abstraction of a swarm to control a parent system
【24h】

Learning abstraction of a swarm to control a parent system

机译:学习群体的抽象来控制父系统

获取原文

摘要

Many high-dimensional systems can be decomposed into a swarm of subsystems manipulating a parent system, each with its own dynamics. Rather than design control laws directly on this state space, we propose a method that uses deep learning to learn an abstract state of the swarm that encapsulates the interactions between the swarm and parent system. In addition, controllers for the swarm and parent system that utilize this abstract state are also learned. Further, the controller for the swarm is designed to be pseudo-distributed, where the policy for each member is only dependent on that member's state, the abstract representation of the swarm, and the desired abstract representation. We set up neural networks for each part of the architecture and assemble them into a recurrent neural network wherein the mapping to the abstract state and the control laws for the parent and swarm systems are learned simultaneously. This method is applied to an example problem consisting of a tilting plane with a swarm of robots driving on top of it, with the goal being to balance the plane. The results are compared to those of an iterative Linear Quadratic Regulator as well as a prescribed abstract state with hand-crafted controllers for the swarm and parent systems. Our results show that the performance of the learned method is comparable to these more demanding methods.
机译:许多高维系统可以分解成操纵父系统的群体系统,每个子系统都具有自己的动态。我们提出了一种使用深度学习来学习蜂拥而至的群体和父系统之间的交互的抽象状态的方法,而不是直接设计控制法律。此外,还学习了利用这种抽象状态的群和父系统的控制器。此外,用于群体的控制器被设计为伪分布,其中每个成员的策略仅取决于该成员的状态,群体的抽象表示以及所需的抽象表示。我们为架构的每个部分设置神经网络,并将它们组装到复发性神经网络中,其中同时学习映射到父母和群体系统的抽象状态和控制法。该方法应用于由倾斜平面组成的示例问题,其中一群机器人驱动在其顶部,目标是平衡平面。结果将结果与迭代线性二次调节器以及具有用于群体和父系系统的手工制作控制器的规定抽象状态。我们的研究结果表明,学习方法的性能与这些更苛刻的方法相当。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号