Adaptive maximum-lifetime routing in mobile ad-hoc networks using temporal difference reinforcement learning

Saloua Chettibi; Salim Chikhi

首页> 外文期刊>Evolving Systems >Adaptive maximum-lifetime routing in mobile ad-hoc networks using temporal difference reinforcement learning

【24h】

Adaptive maximum-lifetime routing in mobile ad-hoc networks using temporal difference reinforcement learning

机译：使用时间差异强化学习的移动自组织网络中的自适应最大生存时间路由

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Mobile ad-hoc NETworks (MANETs) are very dynamic environments. A routing protocol for MANETs should be adaptive in order to operate correctly in presence of variable network conditions. Reinforcement learning (RL) is a recently used technique to achieve adaptive routing in MANETs. In comparison to other machine learning and computational intelligence techniques, RL achieves optimal results at low processing and medium memory costs. To deal with adaptive energy-aware routing issue in MANETs, a RL-based maximum-lifetime routing strategy is proposed. Each mobile node learns how to adjust its route-request packets forwarding-rate according to its energy profile. In terms of RL-resolution methods, Q-Learning, SARSA, Q(λ) and SARSA(λ) which are Temporal difference RL-algorithms are used. The proposed RL model is implemented on the top of AODV routing protocol. Simulation results show that the RL-based AODV achieved good performances in comparison to Time-Delay and Probability based AODV. Particularly, the Q-Learning based AODV has marked the best global performances in terms of energy efficiency and end to end delay.

机译：移动即席NETworks（MANET）是非常动态的环境。 MANET的路由协议应该是自适应的，以便在可变网络条件下正确运行。强化学习（RL）是最近在MANET中用于实现自适应路由的技术。与其他机器学习和计算智能技术相比，RL以低处理和中等内存成本获得了最佳结果。为了解决MANET中自适应的能量感知路由问题，提出了一种基于RL的最大寿命路由策略。每个移动节点都学习如何根据其能量配置文件调整其路由请求数据包的转发速率。在RL解析方法方面，使用了时间差RL算法的Q-Learning，SARSA，Q（λ）和SARSA（λ）。所提出的RL模型是在AODV路由协议的顶部实现的。仿真结果表明，与基于时间和概率的AODV相比，基于RL的AODV表现良好。特别是，基于Q-Learning的AODV在能效和端到端延迟方面表现出最佳的全球性能。

著录项

来源
《Evolving Systems》 |2014年第2期|共20页
作者
Saloua Chettibi; Salim Chikhi;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自然科学总论;
关键词
MANETs; Reinforcement learning; Adaptive maximum-lifetime routing; Q-Learning; SARSA; Q(λ); SARSA(λ); AODV;

机译：移动自组网;强化学习;自适应最大生命周期路由;Q学习;SARSA;Q（λ）;SARSA（λ）;AODV;

相似文献

外文文献
中文文献
专利

1. Adaptive maximum-lifetime routing in mobile ad-hoc networks using temporal difference reinforcement learning [J] . Saloua Chettibi, Salim Chikhi Evolving Systems . 2014,第2期

机译：使用时间差异强化学习的移动自组织网络中的自适应最大生存时间路由
2. A Model-Based Reinforcement Learning Algorithm for Routing in Energy Harvesting Mobile Ad-Hoc Networks [J] . Maleki Meisam, Hakami Vesal, Dehghan Mehdi Wireless personal communications: An Internaional Journal . 2017,第3期

机译：一种基于模型的加强学习算法，用于在移动临时网络中的能量收集中的路由
3. Reinforcement Learning Based Mobility Adaptive Routing for Vehicular Ad-Hoc Networks [J] . Jinqiao Wu, Min Fang, Xiao Li Wireless personal communications: An Internaional Journal . 2018,第4期

机译：基于钢筋的车辆临床网络网络的移动自适应路由
4. Reinforcement Learning Based Routing Protocols Analysis for Mobile Ad-Hoc Networks Global Routing Versus Local Routing [C] . Redha Mili, Salim Chikhi International Conference on Machine Learning for Networking . 2019

机译：基于加固学习的移动ad-hoc网络的路由协议全局路由与本地路由
5. Reinforcement-learning-based Cross Layer Design in Mobile Ad-hoc Networks [D] . Ke, Wang. 2015

机译：基于加强学习的跨层设计在移动广告网络中
6. An Efficient and Reliable Routing Method for Hybrid Mobile Ad Hoc Networks Using Deep Reinforcement Learning [O] . Murtadha M. A. Alkadhmi, Osman N. Uçan, Muhammad Ilyas 2020

机译：利用深增强学习的混合移动特设网络有效可靠的路由方法
7. Quality of Service Issues for Reinforcement Learning Based Routing Algorithm for Ad-Hoc Networks [O] . Kulkarni, Shrirang Ambaji, Rao, G. Raghavendra 2012

机译：Ad-Hoc网络中基于强化学习的路由算法的服务质量问题

Adaptive maximum-lifetime routing in mobile ad-hoc networks using temporal difference reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅