Implementation and Evaluation of Reinforcement Learning-based Lightpath Establishment in WDM Networks

Izumi KOYANAGI; Takuji TACHIBANA; Sugang XU

首页> 外文期刊>電子情報通信学会技術研究報告 >Implementation and Evaluation of Reinforcement Learning-based Lightpath Establishment in WDM Networks

【24h】

Implementation and Evaluation of Reinforcement Learning-based Lightpath Establishment in WDM Networks

机译：WDM网络中基于增强学习的光路径建立的实现和评估

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we implement the reinforcement-based lightpath establishment with Generalized Multi--Protocol Label Switching (GMPLS). In this implementation, at first, we modify Path message, which is used in Resource reSerVation Protocol-Traffic Engineering (RSVP-TE), so that the message can support the new processing method on service class. Then, we implement a reinforcement learning component into the control nodes. We also modify the processing of some messages. Each node decides the acceptance or rejection of the lightpath establishment by current state and priority information that is in the received Path message. When the node decides whether the request is accepted or not, the node can learn the optimal policy based on the experience. We build a 3-node tandem network with our implementation. Over this experimental network, we establish and release lightpaths by using the reinforcement-based lightpath establishment, and we evaluate the performance of this implementation. From experimental results, we indicate that we can cut the learning time with simulation results.%本稿では，既存のGMPLSを拡張して，強化学習を用いた光パス設定方式を実装する．本実装では，GMPLSで用いられるRSVP-TEプロトコルに対して，Pathメッセージへの優先度情報の追加を行う．さらに，強化学習コンポーネントの実装と強化学習で得られた最適行動に基づいた送受信処理を実装する．ノードが光パス設定要求を受け取ると，出力リンクの状態とサービスクラスを基に学習した最適行動に従って，設定要求の許可/棄却を決定する．光パスの設定が成功/失敗すると，各ノードは学習を行い，光パス設定が行われるにつれて最適な行動が学習されていく．本実装を適用した3ノードタンデムネットワークを構築し，光パス設定実験を行う．実験結果から，本実装システムを用いてサービスの差別化が行えることを示す．さらに，シミュレーション結果を用いた強化学習時間の短縮法についても調査する．

机译：在本文中，我们使用通用多协议标签交换（GMPLS）实现基于增强的光路建立。在此实现中，首先，我们修改资源消息预留协议流量工程（RSVP-TE）中使用的Path消息，以便该消息可以支持服务类上的新处理方法。然后，我们在控制节点中实现强化学习组件。我们还修改了某些消息的处理。每个节点通过接收到的Path消息中的当前状态和优先级信息来决定是否接受光路。当节点决定是否接受该请求时，该节点可以基于经验来学习最佳策略。我们在实施过程中建立了一个三节点串联网络。在这个实验网络上，我们使用基于增强的光路建立来建立和释放光路，并评估该实现的性能。从实验结果可以看出，通过模拟结果可以缩短学习时间。％本稿では，既存のGMPLSを拡张して，强化学习を用いた光パス设定方式を実装する。 RSVP-TEプロトコルに対して，Pathメッセージへの优先度情报の追加を行う。さらに，强化学习コンポーネントの実装と强化学习で得られた最适作用に基づいた送受信处理を実装する。要求を受け取ると，出力リンクの状态とサービスクラスを基を学习した最适行动に従って，设定要求の许可/弃却を决定する。光パスの设定が成功/失败すると，各ノードは学习を行い，光パス本実装を适用した3ノードタンドタットワークを构筑し，光パス设定実験を行う。実験结果から，本実装シスな用いてサーうスの分解化実行える。さらにとを示す。さらに，シミュレーション结果を用いた强化学习时间の短缩法についても调查する。

著录项

来源
《電子情報通信学会技術研究報告》 |2010年第448期|p.491-496|共6页
作者
Izumi KOYANAGI; Takuji TACHIBANA; Sugang XU;
展开▼
作者单位

Graduate School of Information Science, Nara Institute of Science and Technology 8916-5 Takayama, Ikoma, Nara 630-0192, Japan,New Generation Network Research Center National Institute of Information and Communications Technology 4-2-1 Nukui-kita, Koganei, Tokyo 184-8795, Japan;

Graduate School of Information Science, Nara Institute of Science and Technology 8916-5 Takayama, Ikoma, Nara 630-0192, Japan;

New Generation Network Research Center National Institute of Information and Communications Technology 4-2-1 Nukui-kita, Koganei, Tokyo 184-8795, Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
reinforcement learning; Q-learning; dynamic lightpath establishment; service differentiation; GMPLS;

机译：强化学习;Q学习动态光路建立;服务差异化;通用MPLS;

相似文献

外文文献
中文文献
专利

1. Implementation and Evaluation of Reinforcement Learning-based Lightpath Establishment in WDM Networks [J] . Izumi KOYANAGI, Takuji TACHIBANA, Sugang XU 電子情報通信学会技術研究報告. ネットワ-クシステム. Network Systems . 2009,第448期

机译：WDM网络中基于增强学习的光路径建立的实现和评估
2. Distributed Lightpath Establishment based on Reinforcement Learning in WDM Networks - On Providing Service Differentiation and Effective Wavelength Utilization [J] . Izumi KOYANAGI, Takuji TACHIBANA, Kenji SUGIMOTO 電子情報通信学会技術研究報告. ネットワ-クシステム. Network Systems . 2009,第129期

机译：WDM网络中基于强化学习的分布式光路建立-关于提供服务区分和有效的波长利用。
3. Distributed Lightpath Establishment based on Reinforcement Learning in WDM Networks: On Providing Service Differentiation and Effective Wavelength Utilization [J] . Izumi KOYANAGI, Takuji TACHIBANA, Kenji SUGIMOTO 電子情報通信学会技術研究報告 . 2009,第129期

机译：WDM网络中基于强化学习的分布式光路建立：提供服务差异化和有效的波长利用
4. A Reinforcement Learning-Based Lightpath Establishment for Service Differentiation in All-Optical WDM Networks [C] . Koyanagi I., Tachibana T., Sugimoto K. Global Telecommunications Conference, 2009. GLOBECOM 2009 . 2009

机译：基于增强学习的光路径建立，用于全光WDM网络中的服务区分
5. Lightpath reliability in WDM networks. [D] . Yuan, Shengli. 2004

机译：WDM网络中的光路可靠性。
6. Reinforcement Learning-Based Data Forwarding in Underwater Wireless Sensor Networks with Passive Mobility [O] . Haotian Chang, Jing Feng, Chaofan Duan 2019

机译：具有被动移动能力的水下无线传感器网络中基于增强学习的数据转发
7. Virtual Fiber Configuration Method for Dynamic Lightpath Establishment in Large–Scaled WDM Networks [O] . Shinya Ishida, Masayuki Murata 2013

机译：大规模WDm网络中动态光路建立的虚拟光纤配置方法

Implementation and Evaluation of Reinforcement Learning-based Lightpath Establishment in WDM Networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅