Reinforcement Learning-Based Methods for Falsification: A New Trend in Critical Controllers Verification

机译：基于强化学习的伪造方法：关键控制器验证的新趋势

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The talk gives an overview of a relatively recent trend in critical embedded controller verification: the use of (possibly deep) reinforcement learning algorithms for property falsification. The central idea is to use temporal logics with real-valued robust semantics to formulate safety objectives, and to formulate the property falsification problem as reward optimization problem, which can be solved using reinforcement learning algorithms for optimal planning or optimal policy synthesis. After introducing basic definitions and concepts, we review a collection of landmark papers, then we illustrate the approach with results obtained on an significant Airbus case study. Last, we outline current challenges and future research directions.

机译：演讲概述了关键嵌入式控制器验证中相对较新的趋势：使用（可能是更深的）强化学习算法来伪造属性。中心思想是使用具有实值鲁棒语义的时态逻辑来制定安全目标，并将财产伪造问题制定为报酬优化问题，可以使用强化学习算法进行最优计划或最优策略综合来解决该问题。在介绍了基本的定义和概念之后，我们回顾了一系列具有里程碑意义的论文，然后通过在一个重要的空中客车案例研究中获得的结果来说明该方法。最后，我们概述了当前的挑战和未来的研究方向。

著录项

来源
《International conference on model and data engineering》|2019年|a2-a2|共1页
会议地点
作者
Remi Delmas;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Sampling-based Falsification And Verification Of Controllers For Continuous Dynamic Systems [J] . Peng Cheng, Vijay Kumar The International journal of robotics research . 2008,第11a12期

机译：基于采样的连续动态系统控制器的伪造与验证
2. Hierarchical Tracking by Reinforcement Learning-Based Searching and Coarse-to-Fine Verifying [J] . Zhong Bineng, Bai Bing, Li Jun, IEEE Transactions on Image Processing . 2019,第5期

机译：通过基于强化学习的搜索和粗到精的验证进行分层跟踪
3. Reinforcement learning-based shared control for walking-aid robot and its experimental verification [J] . Xu Wenxia, Huang Jian, Wang Yongji, Advanced Robotics: The International Journal of the Robotics Society of Japan . 2015,第21a22期

机译：基于增强学习的助步机器人共享控制及其实验验证
4. Reinforcement Learning-Based Methods for Falsification: A New Trend in Critical Controllers Verification [C] . Remi Delmas International conference on model and data engineering . 2019

机译：基于加强学习的伪造方法：关键控制器验证的新趋势
5. Dynamic tuning of PI-controllers based on model-free Reinforcement Learning methods. [D] . Abbasi Brujeni, Lena. 2010

机译：基于无模型强化学习方法的PI控制器的动态调整。
6. Reinforcement Learning-Based Satellite Attitude Stabilization Method for Non-Cooperative Target Capturing [O] . Zhong Ma, Yuejiao Wang, Yidai Yang, 2018

机译：基于强化学习的非合作目标捕获卫星姿态稳定方法
7. Sampling-based falsification and verification of controllers for continuous dynamic systems [O] . Peng Cheng, Vijay Kumar 2006

机译：基于采样的伪造和验证连续动态系统的控制器
8. Use of (D, MUF) And Maximum-Likelihood Methods for Detecting Falsification and Diversion in Data-Verification Problems [R] . Goldman, A. S., Beedgen, R. 1982

机译：利用（D，mUF）和最大似然方法检测数据验证问题中的证伪和转移

Reinforcement Learning-Based Methods for Falsification: A New Trend in Critical Controllers Verification

摘要

著录项

相似文献

相关主题

期刊订阅