首页> 外文会议>IEEE International Conference on Data Science in Cyberspace >Adversarial Examples Construction Towards White-Box Q Table Variation in DQN Pathfinding Training
【24h】

Adversarial Examples Construction Towards White-Box Q Table Variation in DQN Pathfinding Training

机译:DQN寻路培训中针对白盒Q表变化的对抗性示例构建

获取原文
获取外文期刊封面目录资料

摘要

As a new research hotspot in the field of artificial intelligence, deep reinforcement learning (DRL) has achieved certain success in various fields such as robot control, computer vision, natural language processing and so on. At the same time, the possibility of its application being attacked and whether it have a strong resistance to strike has also become a hot topic in recent years. Therefore, we select the representative Deep Q Network (DQN) algorithm in deep reinforcement learning, and use the robotic automatic pathfinding application as a countermeasure application scenario for the first time, and attack DQN algorithm against the vulnerability of the adversarial samples. In this paper, we first use DQN to find the optimal path, and analyze the rules of DQN pathfinding. Then, we propose a method that can effectively find vulnerable points towards White-Box Q table variation in DQN pathfinding training. Finally, we build a simulation environment as a basic experimental platform to test our method, through multiple experiments, we can successfully find the adversarial examples and the experimental results show that the supervised method we proposed is effective.
机译:作为人工智能领域的新研究热点,深度强化学习(DRL)在机器人控制,计算机视觉,自然语言处理等各个领域都取得了一定的成功。同时,其应用受到攻击的可能性以及它是否具有强大的抵抗力也成为近年来的热门话题。因此,我们在深度强化学习中选择了具有代表性的Deep Q Network(DQN)算法,并首次将机器人自动寻路应用程序作为对策应用场景,并针对敌对样本的脆弱性攻击DQN算法。在本文中,我们首先使用DQN查找最佳路径,然后分析DQN寻路的规则。然后,我们提出了一种方法,该方法可以在DQN寻路训练中有效地找到针对白盒Q表变化的易受攻击点。最后,我们建立了一个仿真环境作为测试该方法的基本实验平台,通过多次实验,可以成功找到对抗性实例,实验结果表明,本文提出的监督方法是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号