首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops >Inaccuracy of State-Action Value Function For Non-Optimal Actions in Adversarially Trained Deep Neural Policies
【24h】

Inaccuracy of State-Action Value Function For Non-Optimal Actions in Adversarially Trained Deep Neural Policies

机译:在对外地培训的深度神经政策中的非最佳行为的状态 - 行动价值函数不准确

获取原文

摘要

The introduction of deep neural networks as function approximator for the state-action value function has led to the creation of a new research area for self-learning systems that explore policies from high dimensional input. While the success of deep neural policies has resulted in the deployment of these policies in diversified application domains, there are significant concerns regarding their robustness towards specifically crafted malicious perturbations introduced to their inputs. Several studies have focused on making deep neural policies resistant to such perturbations via training with the existence of these perturbations (i.e. adversarial training). In this paper we focus on conducting an investigation on the state-action value function learned by state-of-the-art adversarially trained deep neural policies and vanilla trained deep neural policies. We perform several experiments in the OpenAI Baselines and we show that the state-action value functions learned by vanilla trained deep neural policies have better estimates for the non-optimal actions than the state-of-the-art adversarially trained deep neural policies. We believe our study lays out intriguing properties of adversarial training and could be critical step towards obtaining robust and reliable policies.
机译:深度神经网络作为状态逼近的函数近似值导致了为从高维输入探索政策的自学习系统创建了新的研究区域。虽然深度神经政策的成功导致在多元化的应用领域中部署了这些政策,但对其对其投入的特制恶意扰动的稳健性有重大问题。几项研究专注于通过培训进行这些扰动(即对抗培训)来促进这种扰动的深度神经政策。在本文中,我们专注于对通过最先进的离境培训的深度神经政策和香草培训的深层神经政策进行了对国家行动价值函数的调查。我们在Openai基准中执行了几个实验,我们表明Vanilla训练有素的深度神经政策学习的国家行动价值函数对非最佳行为的估计而不是最先进的离境培训的深层神经政策。我们相信我们的研究提出了对抗性培训的兴趣性质,可能是获得强大且可靠的政策的关键步骤。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号