首页> 外国专利> Link Change Decision-Making using Reinforcement Learning based on Tracked Rewards and Outcomes in a Wireless Communication System

Link Change Decision-Making using Reinforcement Learning based on Tracked Rewards and Outcomes in a Wireless Communication System

机译:基于无线通信系统中的跟踪奖励和结果的钢筋学习链路改变决策

摘要

Decision-making equipment (22) is configured for link change decision-making using reinforcement learning. The decision-making equipment (22) is configured to track rewards (30-1, . . . 30-M) earned for, and outcomes (28-1, . . . 28-M) of, respective link change decisions (26-1, . . . 26-M). In some embodiments, possible outcomes of a link change decision to change a serving link of a wireless device to a target link include at least: a change of the serving link of the wireless device from the target link to another link; and a network-initiated disconnect of the wireless device from the target link. Regardless, the decision-making equipment (22) is also configured to make a link change decision (28-(M+1)) based on the tracked rewards (30-1, . . . 30-M) and outcomes (28-1, . . . 28-M).
机译:决策设备(22)配置用于使用加强学习的链路改变决策。 决策设备(22)被配置为跟踪奖励(30-1,...... 30-M),并结果(28-1,。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。 -1,。。。26-m)。 在一些实施例中,链路改变决定将无线设备的服务链路改变为目标链路的可能结果至少包括:从目标链路到另一个链路的无线设备的服务链路的变化; 和网络启动从目标链路的无线设备断开连接。 无论如何,决策设备(22)还被配置为基于跟踪的奖励(30-1,... 30-1,32-1.。。。30-m)和结果进行链路改变决定(28-(m + 1))(28-。 1,。。。28米)。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号