首页> 外文会议>American Control Conference >Counterexample-guided permissive supervisor synthesis for probabilistic systems through learning
【24h】

Counterexample-guided permissive supervisor synthesis for probabilistic systems through learning

机译:通过学习针对概率系统的反例指导的宽松主管综合

获取原文

摘要

Formal methods in robotic motion planning have emerged as a hot research topic recently due to its correct-by-design nature, and most results haven been based on nonprobabilistic discrete models. To better handle the environment uncertainties, sensor noise and actuator imperfection, control problems in probabilistic systems like Markov Chain (MC) and Markov Decision Process (MDP) have also been studied. Most existing methods are either based on probabilistic model checking or through reinforcement learning oriented optimization. On the other hand, in the literature of supervisory control of discrete event systems, people usually design supervisors with maximum permissive nature. In other words, a collection of schedulers, instead of a single one scheduler, that satisfy the given specification is designed at the same time. We are therefore motivated to propose a novel learning based automated supervisor synthesis framework to automatically generate permissive supervisor so that the supervised system satisfies the given specification. Our approach is based on a modified L* learning algorithm and runs iteratively. It is guaranteed to be correct and terminate in finite steps.
机译:机器人运动计划中的形式化方法由于其按设计正确的性质,最近已成为研究的热点,并且大多数结果都基于非概率离散模型。为了更好地处理环境不确定性,传感器噪声和执行器缺陷,还研究了概率系统(例如马尔可夫链(MC)和马尔可夫决策过程(MDP))中的控制问题。大多数现有的方法要么基于概率模型检查,要么基于面向强化学习的优化。另一方面,在离散事件系统监督控制的文献中,人们通常设计具有最大允许性质的监督器。换句话说,可以同时设计满足给定规范的一组调度程序,而不是单个调度程序。因此,我们有动机提出一种新颖的基于学习的自动化主管综合框架,以自动生成宽松的主管,从而使受监管的系统满足给定的规范。我们的方法基于改进的L *学习算法,并且可以迭代地运行。它可以保证正确无误并以有限的步长终止。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号