A Novel Adaptive Sampling Strategy for Deep Reinforcement Learning

Liang Xingxing; Chen Li; Feng Yanghe; Liu Zhong; Ma Yang; Huang Kuihua

首页> 外文期刊>International Journal of Computational Intelligence and Applications >A Novel Adaptive Sampling Strategy for Deep Reinforcement Learning

【24h】

A Novel Adaptive Sampling Strategy for Deep Reinforcement Learning

机译：深度加强学习的新型自适应采样策略

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement learning, as an effective method to solve complex sequential decision-making problems, plays an important role in areas such as intelligent decision-making and behavioral cognition. It is well known that the sample experience replay mechanism contributes to the development of current deep reinforcement learning by reusing past samples to improve the efficiency of samples. However, the existing priority experience replay mechanism changes the sample distribution in the sample set due to the higher sampling frequency assigned to a specific transition, and it cannot be applied to actor-critic and other on-policy reinforcement learning algorithm. To address this, we propose an adaptive factor based on TD-error, which further increases sample utilization by giving more attention weight to samples of larger TD-error, and embeds it flexibly into the original Deep Q Network and Advantage Actor-Critic algorithm to improve their performance. Then we carried out the performance evaluation for the proposed architecture in the context of CartPole-V1 and 6 environments of Atari game experiments, respectively, and the obtained results either on the conditions of fixed temperature or annealing temperature, when compared to those produced by the vanilla DQN and original A2C, highlight the advantages in cumulative rewards and climb speed of the improved algorithms.

机译：None

著录项

来源
《International Journal of Computational Intelligence and Applications》 |2021年第2期|共20页
作者
Liang Xingxing; Chen Li; Feng Yanghe; Liu Zhong; Ma Yang; Huang Kuihua;
展开▼
作者单位

Natl Univ Def Technol Coll Syst Engn Changsha Peoples R China;

Natl Univ Def Technol Coll Syst Engn Changsha Peoples R China;

Natl Univ Def Technol Coll Syst Engn Changsha Peoples R China;

Natl Univ Def Technol Coll Syst Engn Changsha Peoples R China;

Natl Univ Def Technol Coll Syst Engn Changsha Peoples R China;

Natl Univ Def Technol Coll Syst Engn Changsha Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词
Deep reinforcement learning; an adaptive factor; DQN; Actor-Critic (AC) algorithm;

机译：深增强学习;自适应因素;DQN;演员 - 评论家（AC）算法;

相似文献

外文文献
中文文献
专利

1. Power System Flow Adjustment and Sample Generation Based on Deep Reinforcement Learning [J] . Shuang Wu, Wei Hu, Zongxiang Lu, 现代电力系统与清洁能源学报(英文) . 2020,第006期
2. Data-driven Optimal Control Strategy for Virtual Synchronous Generator via Deep Reinforcement Learning Approach [J] . Yushuai Li, Wei Gao, Weihang Yan, 现代电力系统与清洁能源学报(英文) . 2021,第004期
3. Optimal control strategy for COVID-19 concerning both life and economy based on deep reinforcement learning [J] . Wei Deng, Guoyuan Qi, Xinchen Yu 中国物理：英文版 . 2021,第012期
4. Curiosity-driven recommendation strategy for adaptive learning via deep reinforcement learning [J] . Han Ruijian, Chen Kani, Tan Chunxi The British journal of mathematical and statistical psychology . 2020,第Pta3期

机译：通过深度加强学习的好奇心推荐策略
5. Adaptive Incident Radiance Field Sampling and Reconstruction Using Deep Reinforcement Learning [J] . YUCHI HUO, RUI WANG, RUZAHNG ZHENG, ACM Transactions on Graphics . 2020,第1期

机译：深度强化学习的自适应入射辐射辐射场采样与重构
6. A Deep Learning Algorithm for the Max-Cut Problem Based on Pointer Network Structure with Supervised Learning and Reinforcement Learning Strategies [J] . Shenshen Gu, Yue Yang Mathematics . 2020,第2期

机译：一种深入学习算法，基于指针网络结构与监督学习和加固学习策略
7. Portfolio Management based on Deep Reinforcement Learning with Adaptive Sampling [C] . Yu-Hsiang Miao, Yi-Ting Hsiao, Szu-Hao Huang International Conference on Pervasive Artificial Intelligence . 2020

机译：基于深度加强学习的自适应采样的投资组合管理
8. Data-Driven Adaptive Traffic Signal Control via Deep Reinforcement Learning [D] . Tan, Tian. 2020

机译：通过深度增强学习数据驱动的自适应交通信号控制
9. Adaptive Learning Recommendation Strategy Based on Deep Q-learning [O] . Chunxi Tan, Ruijian Han, Rougang Ye, 2020

机译：基于深度Q学习的自适应学习推荐战略
10. Novel Deep Reinforcement Algorithm With Adaptive Sampling Strategy for Continuous Portfolio Optimization [O] . Szu-Hao Huang, Yu-Hsiang Miao, Yi-Ting Hsiao 2021

机译：具有连续投资组合优化的自适应采样策略的新型深度增强算法
11. Multi-Objective Reinforcement Learning-Based Deep Neural Networks for Cognitive Space Communications. [R] . Ferreria, P. V. R., Paffenroth, R., Wyglinski, A. M., 2017

机译：基于多目标强化学习的认知空间通信深度神经网络。

A Novel Adaptive Sampling Strategy for Deep Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅