Reinforcement Learning-Based Methods for Falsification: A New Trend in Critical Controllers Verification

机译：基于加强学习的伪造方法：关键控制器验证的新趋势

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The talk gives an overview of a relatively recent trend in critical embedded controller verification: the use of (possibly deep) reinforcement learning algorithms for property falsification. The central idea is to use temporal logics with real-valued robust semantics to formulate safety objectives, and to formulate the property falsification problem as reward optimization problem, which can be solved using reinforcement learning algorithms for optimal planning or optimal policy synthesis. After introducing basic definitions and concepts, we review a collection of landmark papers, then we illustrate the approach with results obtained on an significant Airbus case study. Last, we outline current challenges and future research directions.

机译：谈话概述了临界嵌入式控制器验证中相对较近的趋势：使用（可能是深）加强学习算法的财产伪造。中心思想是使用具有实值强大语义的时间逻辑来制定安全目标，并将属性伪造问题作为奖励优化问题，可以使用加强学习算法来解决以获得最佳规划或最佳政策合成。在引入基本定义和概念后，我们审查了一个地标论文的集合，然后我们说明了在重要的空中客车案例研究中获得的结果。最后，我们概述了当前的挑战和未来的研究方向。

著录项

来源
《International conference on model and data engineering》|2019年|xv 349 p.|共1页
会议地点
作者
Remi Delmas;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Sampling-based Falsification And Verification Of Controllers For Continuous Dynamic Systems [J] . Peng Cheng, Vijay Kumar The International journal of robotics research . 2008,第11a12期

机译：基于采样的连续动态系统控制器的伪造与验证
2. Hierarchical Tracking by Reinforcement Learning-Based Searching and Coarse-to-Fine Verifying [J] . Zhong Bineng, Bai Bing, Li Jun, IEEE Transactions on Image Processing . 2019,第5期

机译：通过基于强化学习的搜索和粗到精的验证进行分层跟踪
3. Reinforcement learning-based shared control for walking-aid robot and its experimental verification [J] . Xu Wenxia, Huang Jian, Wang Yongji, Advanced Robotics: The International Journal of the Robotics Society of Japan . 2015,第21a22期

机译：基于增强学习的助步机器人共享控制及其实验验证
4. Reinforcement Learning-Based Methods for Falsification: A New Trend in Critical Controllers Verification [C] . Remi Delmas International conference on model and data engineering . 2019

机译：基于强化学习的伪造方法：关键控制器验证的新趋势
5. Dynamic tuning of PI-controllers based on model-free Reinforcement Learning methods. [D] . Abbasi Brujeni, Lena. 2010

机译：基于无模型强化学习方法的PI控制器的动态调整。
6. Reinforcement Learning-Based Satellite Attitude Stabilization Method for Non-Cooperative Target Capturing [O] . Zhong Ma, Yuejiao Wang, Yidai Yang, 2018

机译：基于强化学习的非合作目标捕获卫星姿态稳定方法
7. Sampling-based falsification and verification of controllers for continuous dynamic systems [O] . Peng Cheng, Vijay Kumar 2006

机译：基于采样的伪造和验证连续动态系统的控制器
8. Use of (D, MUF) And Maximum-Likelihood Methods for Detecting Falsification and Diversion in Data-Verification Problems [R] . Goldman, A. S., Beedgen, R. 1982

机译：利用（D，mUF）和最大似然方法检测数据验证问题中的证伪和转移

Reinforcement Learning-Based Methods for Falsification: A New Trend in Critical Controllers Verification

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅