Adversarially Robust Policy Learning: Active construction of physically-plausible perturbations

机译：对面强大的政策学习：积极建设物理合理的扰动

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Policy search methods in reinforcement learning have demonstrated success in scaling up to larger problems beyond toy examples. However, deploying these methods on real robots remains challenging due to the large sample complexity required during learning and their vulnerability to malicious intervention. We introduce Adversarially Robust Policy Learning (ARPL), an algorithm that leverages active computation of physically-plausible adversarial examples during training to enable robust policy learning in the source domain and robust performance under both random and adversarial input perturbations. We evaluate ARPL on four continuous control tasks and show superior resilience to changes in physical environment dynamics parameters and environment state as compared to state-of-the-art robust policy learning methods. Code, data, and additional experimental results are available at: stanfordvl.github.io/ARPL.

机译：强化学习中的政策搜索方法在扩大到玩具示例之外的更大问题方面表现出了成功。但是，由于学习期间所需的大量样本复杂性以及对恶意干预的脆弱性所需的大量复杂性，部署了这些方法仍然具有挑战性。我们介绍了对抗性强大的策略学习（ARPL），该算法利用了训练期间利用物理合理的对手示例的活跃计算，以使源域中的鲁棒策略学习和在随机和普发的输入扰动下的鲁棒性能。我们在四个连续控制任务中评估ARPL，并显示出与最先进的强大的策略学习方法相比，对物理环境动态参数和环境状态的变化显示出优异的弹性。代码，数据和其他实验结果可用于：Stanfordvl.github.io/arpl。

著录项

来源
《IEEE/RSJ International Conference on Intelligent Robots and Systems》|2017年|p3755-4559|共8页
会议地点
作者
Ajay Mandlekar; Yuke Zhu; Animesh Garg; Li Fei-Fei; Silvio Savarese;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TB904.1-53;
关键词

相似文献

外文文献
中文文献
专利

1. Towards Improving Robustness of Deep Neural Networks to Adversarial Perturbations [J] . Amini Sajjad, Ghaemmaghami Shahrokh IEEE transactions on multimedia . 2020,第7期

机译：旨在提高深度神经网络的鲁棒性对抗对抗扰动
2. Detecting and Mitigating Adversarial Perturbations for Robust Face Recognition [J] . Goswami Gaurav, Agarwal Akshay, Ratha Nalini, International Journal of Computer Vision . 2019,第6a7期

机译：检测和减轻强大的人脸识别的对抗扰动
3. Analysis of classifiers' robustness to adversarial perturbations [J] . Fawzi Alhussein, Fawzi Omar, Frossard Pascal Machine Learning . 2018,第3期

机译：分类器对对抗扰动的鲁棒性分析
4. Adversarially Robust Policy Learning: Active construction of physically-plausible perturbations [C] . Ajay Mandlekar, Yuke Zhu, Animesh Garg, IEEE/RSJ International Conference on Intelligent Robots and Systems . 2017

机译：对抗性强健的政策学习：积极构建身体上似乎合理的扰动
5. Towards Adversarial and Non-Adversarial Robustness of Machine Learning and Signal Processing: Fundamental Limits and Algorithms [D] . Yi, Jirong. 2021

机译：对机器学习和信号处理的侵犯和非对抗性鲁棒性：基本限制和算法
6. Active Learning Plus Deep Learning Can Establish Cost-Effective and Robust Model for Multichannel Image: A Case on Hyperspectral Image Classification [O] . Fangyu Shi, Zhaodi Wang, Menghan Hu, 2020

机译：主动学习加深度学习可以为多通道图像建立成本效益和强大的模型：一个关于高光谱图像分类的案例
7. A Deep Genetic Programming Based Methodology for Art Media Classification Robust to Adversarial Perturbations [O] . Gustavo Olague, Gerardo Ibarra-Vázquez, Mariana Chan-Ley, 2020

机译：基于深度遗传编程的艺术媒体分类方法论强大对抗对抗扰动

Adversarially Robust Policy Learning: Active construction of physically-plausible perturbations

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅