Adversarially Robust Policy Learning: Active construction of physically-plausible perturbations

机译：对抗性强健的政策学习：积极构建身体上似乎合理的扰动

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Policy search methods in reinforcement learning have demonstrated success in scaling up to larger problems beyond toy examples. However, deploying these methods on real robots remains challenging due to the large sample complexity required during learning and their vulnerability to malicious intervention. We introduce Adversarially Robust Policy Learning (ARPL), an algorithm that leverages active computation of physically-plausible adversarial examples during training to enable robust policy learning in the source domain and robust performance under both random and adversarial input perturbations. We evaluate ARPL on four continuous control tasks and show superior resilience to changes in physical environment dynamics parameters and environment state as compared to state-of-the-art robust policy learning methods. Code, data, and additional experimental results are available at: stanfordvl.github.io/ARPL.

机译：强化学习中的策略搜索方法已经证明可以成功地解决玩具示例以外的更大问题。但是，由于在学习过程中需要大量的样本复杂性以及它们容易受到恶意干预，因此在实际的机器人上部署这些方法仍然具有挑战性。我们引入了对抗鲁棒性策略学习（ARPL），该算法在训练过程中利用对物理上可行的对抗性示例的主动计算来实现源域中的鲁棒性策略学习以及在随机和对抗性输入扰动下均具有鲁棒的性能。我们评估了四个连续控制任务上的ARPL，与最新的鲁棒性策略学习方法相比，它显示了对物理环境动力学参数和环境状态变化的出色恢复能力。代码，数据和其他实验结果可在以下位置获得：stanfordvl.github.io/ARPL。

著录项

来源
《IEEE/RSJ International Conference on Intelligent Robots and Systems》|2017年|3932-3939|共8页
会议地点
作者
Ajay Mandlekar; Yuke Zhu; Animesh Garg; Li Fei-Fei; Silvio Savarese;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Robustness; Perturbation methods; Training; Computational modeling; Heuristic algorithms; Learning (artificial intelligence);

机译：鲁棒性;摄动方法;训练;计算模型;启发式算法;学习（人工智能）;

相似文献

外文文献
中文文献
专利

1. Towards Improving Robustness of Deep Neural Networks to Adversarial Perturbations [J] . Amini Sajjad, Ghaemmaghami Shahrokh IEEE transactions on multimedia . 2020,第7期

机译：旨在提高深度神经网络的鲁棒性对抗对抗扰动
2. Detecting and Mitigating Adversarial Perturbations for Robust Face Recognition [J] . Goswami Gaurav, Agarwal Akshay, Ratha Nalini, International Journal of Computer Vision . 2019,第6a7期

机译：检测和减轻强大的人脸识别的对抗扰动
3. Analysis of classifiers' robustness to adversarial perturbations [J] . Fawzi Alhussein, Fawzi Omar, Frossard Pascal Machine Learning . 2018,第3期

机译：分类器对对抗扰动的鲁棒性分析
4. Adversarially Robust Policy Learning: Active construction of physically-plausible perturbations [C] . Ajay Mandlekar, Yuke Zhu, Animesh Garg, IEEE/RSJ International Conference on Intelligent Robots and Systems . 2017

机译：对面强大的政策学习：积极建设物理合理的扰动
5. Towards Adversarial and Non-Adversarial Robustness of Machine Learning and Signal Processing: Fundamental Limits and Algorithms [D] . Yi, Jirong. 2021

机译：对机器学习和信号处理的侵犯和非对抗性鲁棒性：基本限制和算法
6. Active Learning Plus Deep Learning Can Establish Cost-Effective and Robust Model for Multichannel Image: A Case on Hyperspectral Image Classification [O] . Fangyu Shi, Zhaodi Wang, Menghan Hu, 2020

机译：主动学习加深度学习可以为多通道图像建立成本效益和强大的模型：一个关于高光谱图像分类的案例
7. A Deep Genetic Programming Based Methodology for Art Media Classification Robust to Adversarial Perturbations [O] . Gustavo Olague, Gerardo Ibarra-Vázquez, Mariana Chan-Ley, 2020

机译：基于深度遗传编程的艺术媒体分类方法论强大对抗对抗扰动

Adversarially Robust Policy Learning: Active construction of physically-plausible perturbations

摘要

著录项

相似文献

相关主题

期刊订阅