首页> 外文会议>International conference on model and data engineering >Reinforcement Learning-Based Methods for Falsification: A New Trend in Critical Controllers Verification
【24h】

Reinforcement Learning-Based Methods for Falsification: A New Trend in Critical Controllers Verification

机译:基于强化学习的伪造方法:关键控制器验证的新趋势

获取原文

摘要

The talk gives an overview of a relatively recent trend in critical embedded controller verification: the use of (possibly deep) reinforcement learning algorithms for property falsification. The central idea is to use temporal logics with real-valued robust semantics to formulate safety objectives, and to formulate the property falsification problem as reward optimization problem, which can be solved using reinforcement learning algorithms for optimal planning or optimal policy synthesis. After introducing basic definitions and concepts, we review a collection of landmark papers, then we illustrate the approach with results obtained on an significant Airbus case study. Last, we outline current challenges and future research directions.
机译:演讲概述了关键嵌入式控制器验证中相对较新的趋势:使用(可能是更深的)强化学习算法来伪造属性。中心思想是使用具有实值鲁棒语义的时态逻辑来制定安全目标,并将财产伪造问题制定为报酬优化问题,可以使用强化学习算法进行最优计划或最优策略综合来解决该问题。在介绍了基本的定义和概念之后,我们回顾了一系列具有里程碑意义的论文,然后通过在一个重要的空中客车案例研究中获得的结果来说明该方法。最后,我们概述了当前的挑战和未来的研究方向。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号