首页> 外文期刊>Wireless Networks >Uplink NOMA-based long-term throughput maximization scheme for cognitive radio networks: an actor-critic reinforcement learning approach
【24h】

Uplink NOMA-based long-term throughput maximization scheme for cognitive radio networks: an actor-critic reinforcement learning approach

机译:基于上行的基于NOMA的长期吞吐量最大化方案,用于认知无线电网络:演员 - 评论家强化学习方法

获取原文
获取原文并翻译 | 示例
       

摘要

Non-orthogonal multiple access (NOMA) is one of the promising techniques for spectrum efficiency in wireless networks. In this paper, we consider an uplink NOMA cognitive system, where the secondary users (SUs) can jointly transmit data to the cognitive base station (CBS) over the same spectrum resources. Thereafter, successive interference cancellation is applied at the CBS to retrieve signals transmitted by the SUs. In addition, the energy-constrained problem in wireless networks is taken into account. Therefore, we assume that the SUs are powered by a wireless energy harvester to prolong their operations; meanwhile, the CBS is equipped with a traditional electrical supply. Herein, we propose an actor-critic reinforcement learning approach to maximize the long-term throughput of the cognitive network. In particular, by interacting and learning directly from the environment over several time slots, the CBS can optimally assign the amount of transmission energy for each SU according to the remaining energy of the SUs and the availability of the primary channel. As a consequence, the simulation results verify that the proposed scheme outperforms other conventional approaches (such as Myopic NOMA and OMA), so the system reward is always maximized in the current time slot, in terms of overall throughput and energy efficiency.
机译:非正交多次访问(NOMA)是无线网络中的频谱效率的有希望的技术之一。在本文中,我们考虑上行链路NOMA认知系统,其中二次用户(SUS)可以在相同的频谱资源上将数据与认知基站(CBS)共同传输到认知基站(CBS)。此后,在CBS处施加连续的干扰消除以检索由SUS传输的信号。此外,考虑了无线网络中的能量受限问题。因此,我们假设SUS由无线能源收割机提供动力以延长其运营;同时,CBS配备了传统的电源。在此,我们提出了演员 - 评论家强化学习方法,以最大限度地提高认知网络的长期吞吐量。特别地,通过在几个时隙直接从环境中进行交互和学习,CBS可以根据SUS的剩余能量和主要信道的可用性来最佳地分配每个SU的传输能量。因此,仿真结果验证了所提出的方案优于其他传统方法(例如近视NOMA和OMA),因此在整个吞吐量和能效方面,系统奖励总是在当前时隙中最大化。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号