Exploitation-Oriented Learning with Deep Learning - Introducing Profit Sharing to a Deep Q-Network

Kazuteru Miyazaki

首页> 外文期刊>Journal of Advanced Computatioanl Intelligence and Intelligent Informatics >Exploitation-Oriented Learning with Deep Learning - Introducing Profit Sharing to a Deep Q-Network

【24h】

Exploitation-Oriented Learning with Deep Learning - Introducing Profit Sharing to a Deep Q-Network

机译：深入学习的剥削为导向的学习 - 将利润分享到深度Q网络

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Currently, deep learning is attracting significant interest. Combining deep Q-networks (DQNs) and Q-learning has produced excellent results for several Atari 2600 games. In this paper, we propose an exploitation-oriented learning (XoL) method that incorporates deep learning to reduce the number of trial-and-error searches. We focus on a profit sharing (PS) method that is an XoL method, and combine it with a DQN to propose a DQNwithPS method. This method is compared with a DQN in Atari 2600 games. We demonstrate that the proposed DQN-with PS method can learn stably with fewer trial-and-error searches than required by only a DQN.

机译：目前，深度学习吸引了重大兴趣。组合Deep Q-Networks（DQN）和Q-Learning为几个Atari 2600游戏产生了出色的结果。在本文中，我们提出了一种引发的剥削学习（XOL）方法，该方法包含深入学习，以减少试验和错误搜索的数量。我们专注于作为XOL方法的利润共享（PS）方法，并将其与DQN结合起来提出DQNWithps方法。将该方法与Atari 2600游戏中的DQN进行比较。我们证明所提出的DQN-With PS方法可以稳定地学习，而不是仅仅是DQN所需的试验和错误搜索。

著录项

来源
《Journal of Advanced Computatioanl Intelligence and Intelligent Informatics》 |2017年第125期|共7页
作者
Kazuteru Miyazaki;
展开▼
作者单位

National Institution for Academic Degrees and Quality Enhancement of Higher Education;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类其他计算机;
关键词
Reinforcement learning; Deep learning; Deep reinforcement learning; Profit sharing; Deep Q-network;

机译：加强学习;深入学习;深增强学习;利润分享;深Q-Network;

相似文献

外文文献
中文文献
专利

1. Exploitation-Oriented Learning with Deep Learning - Introducing Profit Sharing to a Deep Q-Network [J] . Kazuteru Miyazaki Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2017,第5a125期

机译：深入学习的剥削为导向的学习 - 将利润分享到深度Q网络
2. Proposal of a Deep Q-network with Profit Sharing [J] . Kazuteru Miyazaki Procedia Computer Science . 2018,第1期

机译：具有利润分享的深度Q网络的建议
3. Active one-shot learning by a deep Q-network strategy [J] . Neurocomputing . 2020,第Mara28期

机译：通过深度Q网络策略主动进行一次学习
4. A Proposal for Reducing the Number of Trial-and-Error Searches for Deep Q-Networks Combined with Exploitation-Oriented Learning [C] . Naoki Kodama, Kazuteru Miyazaki, Taku Harada IEEE International Conference on Machine Learning and Applications . 2018

机译：减少深度Q网络尝试开发的次数与面向开发的学习相结合的建议
5. Not Covering but Discovering: Agency and Shared Autonomy in Deep Project-Based Language Learning. [D] . Busciglio, Daniela F. 2015

机译：不涵盖但发现：基于项目的深度语言学习中的代理和共享自主权。
6. Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network [O] . Ning Liu, Ying Liu, Brent Logan, -1

机译：通过深度Q网络从医疗注册数据中学习动态治疗方案
7. Learning Document-Level Label Propagation and Instance Selection by Deep Q-Network for Interactive Named Entity Annotation [O] . Tingming Lu, Yaocheng Gui, Zhiqiang Gao 2021

机译：学习文档级标签传播和Diefy Q-Network的实例选择，用于交互式命名实体注释

Exploitation-Oriented Learning with Deep Learning - Introducing Profit Sharing to a Deep Q-Network

摘要

著录项

相似文献

相关主题

期刊订阅