Robustness Assessment of Asynchronous Advantage Actor-Critic Based on Dynamic Skewness and Sparseness Computation: A Parallel Computing View

Tong Chen; Ji-Qiang Liu; He Li; Shuo-Ru Wang; Wen-Jia Niu; En-Dong Tong; Liang Chang; Qi Alfred Chen; Gang Li

首页> 中文期刊> 《计算机科学技术学报：英文版》 >Robustness Assessment of Asynchronous Advantage Actor-Critic Based on Dynamic Skewness and Sparseness Computation: A Parallel Computing View

Robustness Assessment of Asynchronous Advantage Actor-Critic Based on Dynamic Skewness and Sparseness Computation: A Parallel Computing View

AI论文写作 >>

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相关主题

摘要

Reinforcement learning as autonomous learning is greatly driving artificial intelligence(AI)development to practical applications.Having demonstrated the potential to significantly improve synchronously parallel learning,the parallel computing based asynchronous advantage actor-critic(A3C)opens a new door for reinforcement learning.Unfortunately,the acceleration's influence on A3C robustness has been largely overlooked.In this paper,we perform the first robustness assessment of A3C based on parallel computing.By perceiving the policy's action,we construct a global matrix of action probability deviation and define two novel measures of skewness and sparseness to form an integral robustness measure.Based on such static assessment,we then develop a dynamic robustness assessing algorithm through situational whole-space state sampling of changing episodes.Extensive experiments with different combinations of agent number and learning rate are implemented on an A3C-based pathfinding application,demonstrating that our proposed robustness assessment can effectively measure the robustness of A3C,which can achieve an accuracy of 83.3%.

著录项

来源
《计算机科学技术学报：英文版》 |2021年第5期|1002-1021|共20页
作者
Tong Chen; Ji-Qiang Liu; He Li; Shuo-Ru Wang; Wen-Jia Niu; En-Dong Tong; Liang Chang; Qi Alfred Chen; Gang Li;
展开▼
作者单位

Beijing Key Laboratory of Security and Privacy in Intelligent Transportation;

Beijing Jiaotong University Beijing 100044;

China;

Guangxi Key Laboratory of Trusted Software;

Guilin University of Electronic Technology;

Guilin 541004;

China;

Donald Bren School of Information and Computer Sciences;

University of California;

Irvine 92697;

U.S.A.;

Centre for Cyber Security Research and Innovation;

Deakin University;

Geelong;

VIC 3216;

Australia;

展开▼
原文格式 PDF
正文语种 chi
中图分类计算技术、计算机技术;
关键词
robustness assessment; skewness; sparseness; asynchronous advantage actor-critic; reinforcement learning;

Robustness Assessment of Asynchronous Advantage Actor-Critic Based on Dynamic Skewness and Sparseness Computation: A Parallel Computing View

摘要

著录项

引文网络

相关主题

期刊订阅