Parallel Reinforcement Learning With Minimal Communication Overhead for IoT Environments

Camelo Miguel; Claeys Maxim; Latre Steven

首页> 外文期刊>Internet of Things Journal, IEEE >Parallel Reinforcement Learning With Minimal Communication Overhead for IoT Environments

【24h】

Parallel Reinforcement Learning With Minimal Communication Overhead for IoT Environments

机译：具有最小通信开销的并行增强学习，可用于IOT环境

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Many Internet of Things (IoT) applications require a distributed architecture for decision making either because of a lack of a centralized system, failure-prone connectivity to a centralized system or because the imposed latency to contact such a system is too high for real-time applications. Often, these IoT applications fall in the domain of reinforcement learning (RL), e.g., autonomous robot navigation in smart factories and traffic signal control in smart cities. However, RL-based applications require a long learning time. To overcome this limitation and scale with the number of agents, parallel RL (PRL) algorithms run multiple RL agents in parallel and on distributed environments. However, deploying PRL algorithms in such environments entails a communication overhead that increases the (actual) execution time. The state-of-the-art PRL algorithms are designed for reducing the learning time while assuming no (or limited) communication overhead. In this article, we present a novel partitioning algorithm that minimizes the communication overhead in PRL running on IoT environments. To the best of our knowledge, this is the first work that focuses on solving the communication overhead of distributing PRL algorithms without requiring any a priori knowledge about the structure of the problem. The proposed algorithm intelligently combines a dynamic state partitioning strategy, which exploits the agent's exploration capabilities to build partition knowledge while learning, with an efficient mapping of agents to partitions, which reduces the communication among agents. Performance evaluations show that the proposed algorithm can achieve almost no communication among PRL agents at the converged state.

机译：许多东西（物联网）应用程序（IoT）应用程序需要一个分布式架构，因为缺乏集中式系统，故障易于连接到集中式系统，或者因为施加的延迟与此类系统的施加延迟太高而实时太高应用程序。通常，这些物联网应用落入强化学习（RL）的领域，例如智能城市中的智能工厂和交通信号控制中的自主机器人导航。但是，基于RL的应用需要很长的学习时间。为了克服这种限制和缩放与代理的数量，并行RL（PRL）算法并行运行多个RL代理，并在分布式环境上运行多个RL代理。但是，在这种环境中部署PRL算法需要增加（实际）执行时间的通信开销。最先进的PRL算法被设计用于减少在假设没有（或有限）的通信开销时的学习时间。在本文中，我们提出了一种新的分区算法，可以最大限度地减少在IOT环境上运行的PRL的通信开销。据我们所知，这是第一个专注于解决分发PRL算法的通信开销的第一项工作，而无需任何关于问题结构的先验知识。该算法智能地结合了动态状态分区策略，该策略利用代理的探测能力来构建分区知识，同时学习，代理到分区的有效映射，从而降低了代理之间的通信。性能评估表明，该算法可以在融合状态下实现PRL代理之间的几乎没有通信。

著录项

来源
《Internet of Things Journal, IEEE》 |2020年第2期|1387-1400|共14页
作者
Camelo Miguel; Claeys Maxim; Latre Steven;
展开▼
作者单位

Univ Antwerp IMEC IDLab Res Grp Dept Math & Comp Sci B-2000 Antwerp Belgium;

Univ Antwerp IMEC IDLab Res Grp Dept Math & Comp Sci B-2000 Antwerp Belgium;

Univ Antwerp IMEC IDLab Res Grp Dept Math & Comp Sci B-2000 Antwerp Belgium;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Internet of Things; Reinforcement learning; Partitioning algorithms; Computational modeling; Heuristic algorithms; Mathematical model; Task analysis; Internet of Things (IoT); large state-action space partitioning; machine learning (ML); parallel reinforcement learning (PRL);

机译：事物互联网;加强学习;分区算法;计算建模;启发式算法;数学模型;任务分析;事物互联网（物联网）;机器学习（ML）;并行加强学习（PRL）;

相似文献

外文文献
中文文献
专利

1. SCHEDULING PARALLEL TASKS WITH INTRA-COMMUNICATION OVERHEAD IN A GRID COMPUTING ENVIRONMENT [J] . Jiann-Fu Lin International Journal of Innovative Computing Information and Control . 2011,第2期

机译：在网格计算环境中安排具有内部通信开销的并行任务
2. Scheduling Parallel Tasks with Communication Overhead in an Environment with Multiple Machines [J] . Jiann-Fu LIN IEICE Transactions on Information and Systems . 2008,第10期

机译：在具有多台计算机的环境中通过并行通信调度并行任务
3. Utilizing Observed Information for No-Communication Multi-Agent Reinforcement Learning toward Cooperation in Dynamic Environment [J] . Fumito UWANO, Keiki TAKADAMA SICE Journal of Control, Measurement, and System Integration (SICE JCMSI) . 2019,第5期

机译：利用观察到的无通信多智能经纪人强化学习信息，以便在动态环境中的合作
4. Enhanced Pub/Sub Communications for Massive IoT Traffic with SARSA Reinforcement Learning [C] . Carlos E. Arruda, Pedro F. Moraes, Nazim Agoulmine, International Conference on Machine Learning for Networking . 2020

机译：增强的PUB /子通信，具有萨拉强化学习的大规模物联网交通
5. Value Function Approximation Algorithms for Reinforcement Learning in Delay-Sensitive Wireless Communications [D] . Sharma, Nikhilesh. 2020

机译：延迟敏感无线通信中增强学习的价值函数近似算法
6. Reinforcement Learning for Energy Optimization with 5G Communications in Vehicular Social Networks [O] . Hyebin Park, Yujin Lim 2020

机译：车载社交网络中通过5G通信进行能源优化的强化学习
7. Enhanced Pub/Sub Communications for Massive IoT Traffic with SARSA Reinforcement Learning [O] . Carlos E. Arruda, Pedro F. Moraes, Nazim Agoulmine, 2021

机译：增强的PUB /子通信，具有萨拉强化学习的大规模物联网交通

Parallel Reinforcement Learning With Minimal Communication Overhead for IoT Environments

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅