首页> 外文OA文献 >An LSPI based reinforcement learning approach to enable network cooperation in cognitive wireless sensor networks
【2h】

An LSPI based reinforcement learning approach to enable network cooperation in cognitive wireless sensor networks

机译:基于LspI的强化学习方法,用于在认知无线传感器网络中实现网络协作

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The number of wirelessly communicating devices increases every day, along with the number of communication standards and technologies that they use to exchange data. A relatively new form of research is trying to find a way to make all these co-located devices not only capable of detecting each other's presence, but to go one step further - to make them cooperate. One recently proposed way to tackle this problem is to engage into cooperation by activating 'network services' (such as internet sharing, interference avoidance, etc.) that offer benefits for other co-located networks. This approach reduces the problem to the following research topic: how to determine which network services would be beneficial for all the cooperating networks. In this paper we analyze and propose a conceptual solution for this problem using the reinforcement learning technique known as the Least Square Policy Iteration (LSPI). The proposes solution uses a self-learning entity that negotiates between different independent and co-located networks. First, the reasoning entity uses self-learning techniques to determine which service configuration should be used to optimize the network performance of each single network. Afterwards, this performance is used as a reference point and LSPI is used to deduce if cooperating with other co-located networks can lead to even further performance improvements.
机译:每天,无线通信设备的数量以及用于交换数据的通信标准和技术的数量都在增加。相对较新的研究形式正在尝试寻找一种方法,以使所有这些位于同一地点的设备不仅能够检测彼此的存在,而且还可以进一步向前发展-使它们相互协作。解决该问题的一种新方法是通过激活“网络服务”(例如互联网共享,避免干扰等)来参与合作,这将为其他位于同一地点的网络带来好处。这种方法将问题减少到以下研究主题:如何确定哪些网络服务将对所有协作网络都有利。在本文中,我们使用称为最小二乘策略迭代(LSPI)的强化学习技术分析并提出了针对此问题的概念性解决方案。提出的解决方案使用了一个自学习实体,该实体在不同的独立和共置网络之间进行协商。首先,推理实体使用自学习技术来确定应使用哪种服务配置来优化每个单个网络的网络性能。然后,将该性能用作参考点,并使用LSPI推断是否与其他位于同一地点的网络合作可以进一步提高性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号