Availability-aware and energy-aware dynamic SFC placement using reinforcement learning

Santos Guto Leoni; Lynn Theo; Kelner Judith; Endo Patricia Takako

首页> 外文期刊>Journal of supercomputing >Availability-aware and energy-aware dynamic SFC placement using reinforcement learning

【24h】

Availability-aware and energy-aware dynamic SFC placement using reinforcement learning

机译：可用性感知和能量感知动态SFC使用强化学习的展示

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Software-defined networking and network functions virtualisation are making networks programmable and consequently much more flexible and agile. To meet service-level agreements, achieve greater utilisation of legacy networks, faster service deployment, and reduce expenditure, telecommunications operators are deploying increasingly complex service function chains (SFCs). Notwithstanding the benefits of SFCs, increasing heterogeneity and dynamism from the cloud to the edge introduces significant SFC placement challenges, not least adding or removing network functions while maintaining availability, quality of service, and minimising cost. In this paper, an availability- and energy-aware solution based on reinforcement learning (RL) is proposed for dynamic SFC placement. Two policy-aware RL algorithms, Advantage Actor-Critic (A2C) and Proximal Policy Optimisation (PPO), are compared using simulations of a ground truth network topology based on the Rede Nacional de Ensino e Pesquisa Network, Brazil's National Teaching and Research Network backbone. The simulation results show that PPO generally outperformed A2C and a greedy approach in terms of both acceptance rate and energy consumption. The biggest difference in the PPO when compared to the other algorithms relates to the SFC availability requirement of 99.965%; the PPO algorithm median acceptance rate is 67.34% better than the A2C algorithm. A2C outperforms PPO only in the scenario where network servers had a greater number of computing resources. In this case, the A2C is 1% better than the PPO.

机译：软件定义的网络和网络功能虚拟化正在制作网络可编程，从而更灵活，灵活。为了满足服务级别协议，实现遗留网络的更大利用，更快的服务部署，减少支出，电信运营商正在部署越来越复杂的服务功能链（SFC）。尽管SFC的好处，但从云到边缘的不均匀性和动态度引入了显着的SFC放置挑战，并不是保持或移除网络功能，同时保持可用性，服务质量和最小化成本。本文提出了一种基于加强学习（RL）的可用性和能量感知解决方案，用于动态SFC放置。使用基于Rede Nacional De Insinoine Pesquisa网络，巴西国家教学和研究网络骨干的地面真实网络拓扑模拟比较了两个策略感知RL算法，优势演员 - 评论家（A2C）和近端政策优化（PPO）。。仿真结果表明，在接受率和能量消耗方面，PPO通常优于A2C和贪婪的方法。与其他算法相比，PPO的最大差异涉及99.965％的SFC可用性要求; PPO算法中值接受率比A2C算法更好67.34％。 A2C仅在网络服务器具有更多计算资源的情况下才能表达PPO。在这种情况下，A2C比PPO好1％。

著录项

来源
《Journal of supercomputing》 |2021年第11期|12711-12740|共30页
作者
Santos Guto Leoni; Lynn Theo; Kelner Judith; Endo Patricia Takako;
展开▼
作者单位

Univ Fed Pernambuco UFPE Ctr Informat Recife PE Brazil;

Dublin City Univ Business Sch Dublin Ireland;

Univ Fed Pernambuco UFPE Ctr Informat Recife PE Brazil;

Univ Pernambuco UPE Recife PE Brazil;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Service function chains; Placement; Network function virtualisation; Service function chain placement; Reinforcement learning; Availability optimisation; Energy consumption optimisation;

机译：服务功能链;放置;网络功能虚拟化;服务功能链放置;加固学习;可用性优化;能量消耗优化;

相似文献

外文文献
中文文献
专利

1. Renewable Energy-Aware Big Data Analytics in Geo-Distributed Data Centers with Reinforcement Learning [J] . Network Science and Engineering, IEEE Transactions on . 2020,第1期

机译：具有强化学习功能的地理分布数据中心中的可再生能源感知大数据分析
2. An energy-aware scheduling algorithm for budget-constrained scientific workflows based on multi-objective reinforcement learning [J] . Qin Yao, Wang Hua, Yi Shanwen, Journal of supercomputing . 2020,第1期

机译：基于多目标强化学习的预算受限科学工作流能量感知调度算法
3. Energy-aware resource management for uplink non-orthogonal multiple access: Multi-agent deep reinforcement learning [J] . Future generation computer systems . 2020,第Apra期

机译：用于上行链路非正交多路访问的能源感知资源管理：多主体深度强化学习
4. An Availability-Aware SFC Placement Algorithm for Fat-Tree Data Centers [C] . Ghada Moualla, Thierry Turletti, Damien Saucez IEEE International Conference on Cloud Networking . 2018

机译：胖树数据中心的可用性感知SFC放置算法
5. Reinforcement Learning and Recurrent Reinforcement Learning for Dynamic Portfolio Optimization [D] . Almahdi, Saud 2019

机译：强化学习和循环强化学习以实现动态资产组合优化
6. A multiplicative reinforcement learning model capturing learning dynamics and interindividual variability in mice [O] . Brice Bathellier, Sui Poh Tee, Christina Hrovat, 2013

机译：捕获小鼠学习动态和个体差异的乘法强化学习模型
7. An Availability-Aware SFC Placement Algorithm for Fat-Tree Data Centers [O] . Ghada Moualla, Thierry Turletti, Damien Saucez 2018

机译：用于脂肪树数据中心的可用性感知SFC放置算法

Availability-aware and energy-aware dynamic SFC placement using reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅