An Actor-Critic Deep Reinforcement Learning Approach for Transmission Scheduling in Cognitive Internet of Things Systems

Yang Helin; Xie Xianzhong

首页> 外文期刊>IEEE systems journal >An Actor-Critic Deep Reinforcement Learning Approach for Transmission Scheduling in Cognitive Internet of Things Systems

【24h】

An Actor-Critic Deep Reinforcement Learning Approach for Transmission Scheduling in Cognitive Internet of Things Systems

机译：事实互联网传输调度的演员评论家深度加强学习方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The cognitive Internet of Things (CIoT) has attracted much interest recently in wireless networks due to its wide applications in smart cities, intelligent transportation systems, and smart metering networks. However, how to smartly schedule the packet transmission in CIoT systems is still a key challenge, that is, how to design a smart agent to realize the intelligent decision making and effective interoperability. In this paper, we model the system state transformation as a Markov decision process, and an actor-critic deep reinforcement learning algorithm based on a fuzzy normalized radial basis function neural network (called AC-FNRBF) is proposed to efficiently solve the intelligent transmission scheduling problem in CIoT systems under high-dimensional variables. The proposed AC-FNRBF algorithm can better approximate both the action function of the actor and the state-action value function of the critic without requiring the system prior knowledge, and a new reward function is established to maximize the system benefit, which jointly takes the transmission packet rate, the system throughput, the power consumption, and the transmission delay into account. Moreover, the AC-FNRBF has the ability to adjust its learning structure and parameters in dynamic environments. Simulation results verify that the proposed algorithm achieves higher transmission packet rate and system throughput with lower power consumption and transmission delay, compared with other existing reinforcement learning algorithms.

机译：由于智能城市，智能交通系统和智能计量网络的广泛应用，最近在无线网络中吸引了认知的事情（CIOT）引起了许多利益。但是，如何巧妙地安排CIOR系统中的数据包传输仍然是一个关键挑战，即如何设计智能代理以实现智能决策和有效的互操作性。在本文中，我们模拟了系统状态转换作为马尔可夫决策过程，并提出了一种基于模糊归一化径向基函数神经网络（称为AC-FNRBF）的演员 - 评论家深度加强学习算法，以有效地解决智能传输调度高维变量下的CIET系统问题。所提出的AC-FNRBF算法可以更好地近似于批评者的动作功能和批评者的状态 - 动作价值函数而不需要系统事先知识，并且建立了一个新的奖励功能，以最大限度地提高系统效益，这将共同采取的系统效益传输分组速率，系统吞吐量，功耗和传输延迟考虑。此外，AC-FNRBF能够在动态环境中调整其学习结构和参数。仿真结果验证，与其他现有增强学习算法相比，该算法验证了较高的传输分组速率和具有较低功耗和传输延迟的系统吞吐量。

著录项

来源
《IEEE systems journal》 |2020年第1期|51-60|共10页
作者
Yang Helin; Xie Xianzhong;
展开▼
作者单位

Nanyang Technol Univ Sch Elect & Elect Engn Singapore 639798 Singapore;

Chongqing Univ Posts & Telecommun Chongqing Key Lab Comp Network & Commun Technol D Chongqing 400065 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Actor-critic (AC); adaptive fuzzy neural network; cognitive Internet of Things (CIoT); deep reinforcement learning (DRL); transmission scheduling;

机译：演员 - 评论家（AC）;自适应模糊神经网络;认知物联网（CIOT）;深度加强学习（DRL）;传输调度;

相似文献

外文文献
中文文献
专利

1. An actor-critic deep reinforcement learning approach for metro train scheduling with rolling stock circulation under stochastic demand [J] . Ying Cheng-shuo, Chow Andy H. F., Chin Kwai-Sang Transportation Research Part B: Methodological . 2020,第Octa期

机译：随机需求下滚动股票循环的地铁列车调节探测深度加强学习方法
2. A New Deep-Q-Learning-Based Transmission Scheduling Mechanism for the Cognitive Internet of Things [J] . Jiang Zhu, Yonghui Song, Dingde Jiang, Internet of Things Journal, IEEE . 2018,第4期

机译：一种新的基于深度学习的基于深度学习的传输调度机制
3. Uplink NOMA-based long-term throughput maximization scheme for cognitive radio networks: an actor-critic reinforcement learning approach [J] . Giang Hoang Thi Huong, Hoan Tran Nhut Khai, Koo Insoo Wireless Networks . 2021,第2期

机译：基于上行的基于NOMA的长期吞吐量最大化方案，用于认知无线电网络：演员 - 评论家强化学习方法
4. Joint Transaction Transmission and Channel Selection in Cognitive Radio Based Blockchain Networks: A Deep Reinforcement Learning Approach [C] . Nguyen Cong Luong, Tran The Anh, Huynh Thi Thanh Binh, IEEE International Conference on Acoustics, Speech and Signal Processing . 2019

机译：基于认知无线电的区块链网络中的联合事务传输和信道选择：深度强化学习方法
5. Mars: Multi-Scalable Actor-Critic Reinforcement Learning Scheduler [D] . Baheri, Betis. 2020

机译：火星：多可扩展的演员 - 评论家强化学习调度员
6. A Graph Convolutional Network-Based Deep Reinforcement Learning Approach for Resource Allocation in a Cognitive Radio Network [O] . Di Zhao, Hao Qin, Bin Song, 2020

机译：一种图形卷积网络的资源分配在认知无线电网络中的基于卷积网络的深度加强学习方法
7. An Actor-Critic Reinforcement Learning Approach to Minimum age of Information Scheduling in Energy Harvesting Networks [O] . Shiyang Leng, Aylin Yener 2021

机译：能量收集网络中信息调度最低年龄的演员批评者加强学习方法
8. Multi-Objective Reinforcement Learning-Based Deep Neural Networks for Cognitive Space Communications. [R] . Ferreria, P. V. R., Paffenroth, R., Wyglinski, A. M., 2017

机译：基于多目标强化学习的认知空间通信深度神经网络。

An Actor-Critic Deep Reinforcement Learning Approach for Transmission Scheduling in Cognitive Internet of Things Systems

摘要

著录项

相似文献

相关主题

期刊订阅