Deep reinforcement learning: Framework, applications, and embedded implementations: Invited paper

机译：深度强化学习：框架，应用程序和嵌入式实现：邀请论文

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The recent breakthroughs of deep reinforcement learning (DRL) technique in Alpha Go and playing Atari have set a good example in handling large state and actions spaces of complicated control problems. The DRL technique is comprised of (i) an offline deep neural network (DNN) construction phase, which derives the correlation between each state-action pair of the system and its value function, and (ii) an online deep Q-learning phase, which adaptively derives the optimal action and updates value estimates. In this paper, we first present the general DRL framework, which can be widely utilized in many applications with different optimization objectives. This is followed by the introduction of three specific applications: the cloud computing resource allocation problem, the residential smart grid task scheduling problem, and building HVAC system optimal control problem. The effectiveness of the DRL technique in these three cyber-physical applications have been validated. Finally, this paper investigates the stochastic computing-based hardware implementations of the DRL framework, which consumes a significant improvement in area efficiency and power consumption compared with binary-based implementation counterparts.

机译：深度强化学习（DRL）技术在Alpha Go和Atari游戏中的最新突破为处理大型状态和复杂控制问题的动作空间树立了典范。 DRL技术包括（i）离线深度神经网络（DNN）构建阶段，该阶段得出系统的每个状态-动作对及其值函数之间的相关性，以及（ii）在线深度Q学习阶段，自适应地得出最佳行动并更新价值估算值。在本文中，我们首先介绍了通用DRL框架，该框架可广泛用于具有不同优化目标的许多应用程序。接下来介绍了三个特定的应用程序：云计算资源分配问题，住宅智能电网任务调度问题以及建筑物HVAC系统最佳控制问题。 DRL技术在这三种网络物理应用程序中的有效性已得到验证。最后，本文研究了DRL框架的基于随机计算的硬件实现，与基于二进制的实现相比，DRL框架在面积效率和功耗上有显着改善。

著录项

来源
《IEEE/ACM International Conference on Computer-Aided Design》|2017年|847-854|共8页
会议地点
作者
Hongjia Li; Tianshu Wei; Ao Ren; Qi Zhu; Yanzhi Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Servers; Learning (artificial intelligence); Hardware; Stochastic processes; Machine learning; Resource management;

机译：服务器;学习（人工智能）;硬件;随机过程;机器学习;资源管理;

相似文献

外文文献
中文文献
专利

1. Deep reinforcement learning: Algorithm, applications, and ultra-low-power implementation [J] . Li Hongjia, Cai Ruizhe, Liu Ning, Nano communication networks . 2018,第JUNa期

机译：深度强化学习：算法，应用程序和超低功耗实现
2. The Implementation of Deep Reinforcement Learning in E-Learning and Distance Learning: Remote Practical Work [J] . Abdelali El Gourari, Mustapha Raoufi, Mohammed Skouri, Mobile information systems . 2021,第a期

机译：电子学习和远程学习深度加强学习的实施：远程实践工作
3. Reinforcement Renaissance The power of deep neural networks has sparked renewed interest in reinforcement learning, with applications to games, robotics, and beyond [J] . Krakovsky Marina Communications of the ACM . 2016,第8期

机译：强化文艺复兴深度神经网络的力量激发了人们对强化学习及其在游戏，机器人技术及其他领域的应用的新兴趣。
4. Deep reinforcement learning: Framework, applications, and embedded implementations: Invited paper [C] . Hongjia Li, Tianshu Wei, Ao Ren, IEEE/ACM International Conference on Computer-Aided Design . 2017

机译：深度钢筋学习：框架，应用和嵌入式实现：邀请纸
5. Methods and Applications of Deep Reinforcement Learning for Chemical Processes [D] . Hubbs, Christian D. 2021

机译：深增强学习的化学过程的方法和应用
6. A framework for learning about improvement: embedded implementation and evaluation design to optimize learning [O] . Danika Barry, Leighann E Kimble, Bejoy Nambiar, -1

机译：学习改进的框架：嵌入式实现和评估设计以优化学习
7. Knowledge-Assisted Deep Reinforcement Learning in 5G Scheduler Design: From Theoretical Framework to Implementation [O] . Zhouyou Gu, Changyang She, Wibowo Hardjawana, 2021

机译：5G调度设计中的知识辅助深度加固学习：从理论框架到实施

Deep reinforcement learning: Framework, applications, and embedded implementations: Invited paper

摘要

著录项

相似文献

相关主题

期刊订阅