首页> 外文会议>German Conference on Artificial Intelligence >Enhancing Explainability of Deep Reinforcement Learning Through Selective Layer-Wise Relevance Propagation
【24h】

Enhancing Explainability of Deep Reinforcement Learning Through Selective Layer-Wise Relevance Propagation

机译:通过选择性分层明智的关联传播增强深度强化学习的可解释性

获取原文
获取外文期刊封面目录资料

摘要

Modern deep reinforcement learning agents are capable of achieving super-human performance in tasks like playing Atari games, solely based on visual input. However, due to their use of neural networks the trained models are lacking transparency which makes their inner workings incomprehensible for humans. A promising approach to gain insights into the opaque reasoning process of neural networks is the layer-wise relevance propagation (LRP) concept. This visualization technique creates saliency maps that highlight the areas in the input which were relevant for the agents' decision-making process. Since such saliency maps cover every possible cause for a prediction, they are often accentuating very diverse parts of the input. This makes the results difficult to understand for people without a machine-learning background. In this work, we introduce an adjustment to the bRP concept that utilizes only the most relevant neurons of each convolutional layer and thus generates more selective saliency maps. We test our approach with a dueling Deep Q-Network (DQN) agent, which we trained on three different Atari games of varying complexity. Since the dueling DQN approach considerably alters the neural network architecture of the original DQN algorithm, it requires its own LRP variant which will be presented in this paper.
机译:现代深层强化学习代理仅凭视觉输入就能在诸如玩Atari游戏之类的任务中实现超人的表现。但是,由于使用了神经网络,所以训练有素的模型缺乏透明度,这使得它们的内部工作方式对于人类来说是难以理解的。深入了解神经网络的不透明推理过程的一种有前途的方法是分层相关传播(LRP)概念。这种可视化技术可创建显着性图,以突出显示输入中与代理的决策过程相关的区域。由于此类显着性图涵盖了进行预测的所有可能原因,因此它们通常会加重输入中非常不同的部分。对于没有机器学习背景的人们来说,这使得结果难以理解。在这项工作中,我们对bRP概念进行了调整,该概念仅利用每个卷积层中最相关的神经元,从而生成更具选择性的显着性图。我们使用对决的深度Q网络(DQN)代理测试了我们的方法,该代理对三种复杂程度各异的Atari游戏进行了培训。由于对决DQN方法大大改变了原始DQN算法的神经网络体系结构,因此它需要自己的LRP变体,这将在本文中进行介绍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号