首页> 外文期刊>Advanced engineering informatics >An active inference approach to on-line agent monitoring in safety-critical systems
【24h】

An active inference approach to on-line agent monitoring in safety-critical systems

机译:安全关键系统中在线代理监视的主动推理方法

获取原文
获取原文并翻译 | 示例
       

摘要

The current trend towards integrating software agents in safety-critical systems such as drones, autonomous cars and medical devices, which must operate in uncertain environments, gives rise to the need of on-line detection of an unexpected behavior. In this work, on-line monitoring is carried out by comparing environmental state transitions with prior beliefs descriptive of optimal behavior. The agent policy is computed analytically using linearly solvable Markov decision processes. Active inference using prior beliefs allows a monitor proactively rehearsing on-line future agent actions over a rolling horizon so as to generate expectations to discover surprising behaviors. A Bayesian surprise metric is proposed based on twin Gaussian processes to measure the difference between prior and posterior beliefs about state transitions in the agent environment. Using a sliding window of sampled data, beliefs are updated a posteriori by comparing a sequence of state transitions with the ones predicted using the optimal policy. An artificial pancreas for diabetic patients is used as a representative example.
机译:当前将软件代理集成到必须在不确定环境中运行的安全关键系统(如无人机,无人驾驶汽车和医疗设备)中的趋势提出了对意外行为进行在线检测的需求。在这项工作中,通过将环境状态转换与描述最佳行为的先前信念进行比较来进行在线监控。代理策略是使用线性可解马尔可夫决策过程进行解析计算的。使用先前的信念进行主动推理可以使监视器在滚动的地平线上主动演练未来的在线代理行为,从而产生发现令人惊讶行为的期望。提出了一种基于双高斯过程的贝叶斯惊奇度量,以测量关于代理环境中状态转换的先验信念与后验信念之间的差异。使用采样数据的滑动窗口,通过比较状态转换序列与使用最佳策略预测的状态转换序列,可以更新后验信念。糖尿病患者的人造胰腺被用作代表性的例子。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号