Sample-Efficient Deep Reinforcement Learning with Directed Associative Graph

Dujia Yang; Xiaowei Qin; Xiaodong Xu; Chensheng Li; Guo Wei

首页> 中文期刊> 《中国通信：英文版》 >Sample-Efficient Deep Reinforcement Learning with Directed Associative Graph

Sample-Efficient Deep Reinforcement Learning with Directed Associative Graph

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement learning can be modeled as markov decision process mathematically.In consequence,the interaction samples as well as the connection relation between them are two main types of information for learning.However,most of recent works on deep reinforcement learning treat samples independently either in their own episode or between episodes.In this paper,in order to utilize more sample information,we propose another learning system based on directed associative graph(DAG).The DAG is built on all trajectories in real time,which includes the whole connection relation of all samples among all episodes.Through planning with directed edges on DAG,we offer another perspective to estimate stateaction pair,especially for the unknowns to deep neural network(DNN)as well as episodic memory(EM).Mixed loss function is generated by the three learning systems(DNN,EM and DAG)to improve the efficiency of the parameter update in the proposed algorithm.We show that our algorithm is significantly better than the state-of-the-art algorithm in performance and sample efficiency on testing environments.Furthermore,the convergence of our algorithm is proved in the appendix and its long-term performance as well as the effects of DAG are verified.

著录项

来源
《中国通信：英文版》 |2021年第6期|100-113|共14页
作者
Dujia Yang; Xiaowei Qin; Xiaodong Xu; Chensheng Li; Guo Wei;
展开▼
作者单位

University of Science and Technology of China;

Hefei 230026;

China;

CAS Key Laboratory of Wireless-Optical Communications;

Hefei 230027;

China;

展开▼
原文格式 PDF
正文语种 chi
中图分类代数、数论、组合理论;
关键词
directed associative graph; sample efficiency; deep reinforcement learning;

相似文献

中文文献
外文文献

1. Deep Learning助力客服小二:数据技术及机器学习在客服中心的应用 [C] . 鄢志杰 . 第九届中国大数据技术大会 . 2015
2. A Novel Link Prediction Model Based on Deep Reinforcement Learning [A] . Dawud Jidda Jidda . 2020

Sample-Efficient Deep Reinforcement Learning with Directed Associative Graph

摘要

著录项

相似文献

相关主题

期刊订阅