Online Layered Learning for Cross-layer Optimization of Dynamic Multimedia Systems

机译：在线分层学习用于动态多媒体系统的跨层优化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In our recent work, we proposed a systematic cross-layer framework for dynamic multimedia systems, which allows each layer to make autonomous and foresighted decisions that maximize the system's long-term performance, while meeting the application's real-time delay constraints. The proposed solution solved the cross-layer optimization offline, under the assumption that the multimedia system's probabilistic dynamics (e.g. the application's rate-distortion-complexity behavior) were known a priori, by modeling the system as a layered Markov decision process. In practice, however, these dynamics are unknown a priori and therefore must be learned online. In this paper, we address this problem by allowing the multimedia system layers to learn, through repeated interactions with each other, to autonomously optimize the system's long-term performance at run-time. We propose two reinforcement learning algorithms for optimizing the system under different design constraints: the first algorithm solves the cross-layer optimization in a centralized manner, and the second solves it in a decentralized manner. We analyze both algorithms in terms of their required computation, memory, and inter-layer communication overheads. In our experiments, we demonstrate that decentralized learning can perform equally as well as centralized learning, while enabling the layers to act autonomously. Additionally, we show that existing myopic learning algorithms deployed in multimedia systems perform significantly worse than our proposed foresighted learning methods.

机译：在我们最近的工作中，我们为动态多媒体系统提出了一个系统的跨层框架，该框架允许每一层做出自主和有远见的决策，以最大化系统的长期性能，同时满足应用程序的实时延迟约束。假设多媒体系统的概率动力学（例如应用程序的速率失真复杂性行为）是先验的，则该解决方案可以通过将系统建模为分层的马尔可夫决策过程来离线解决跨层优化问题。然而，实际上，这些动态是先验未知的，因此必须在线学习。在本文中，我们通过允许多媒体系统层通过反复交互来学习，以在运行时自主优化系统的长期性能，从而解决了这一问题。我们提出了两种强化学习算法，用于在不同设计约束下优化系统：第一种算法以集中方式解决跨层优化，第二种算法以分散方式解决。我们根据所需的计算，内存和层间通信开销来分析这两种算法。在我们的实验中，我们证明了分散式学习可以和集中式学习一样出色地执行，同时使各层能够自主地行动。此外，我们表明，部署在多媒体系统中的现有近视学习算法的性能明显比我们提出的有远见的学习方法差。

著录项

来源
《ACM SIGMM conference on multimedia systems 2010》|2010年|P.47-58|共12页
会议地点
作者
Nicholas Mastronarde; Mihaela van der Schaar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类多媒体技术与多媒体计算机;
关键词
cross-layer multimedia system design; layered reinforcement learning; foresighted decision making; cross-layer adaptation to support real-time requirements;

机译：跨层多媒体系统设计;分层强化学习;有远见的决策;跨层适配以支持实时需求;

相似文献

外文文献
中文文献
专利

1. Decomposition Principles and Online Learning in Cross-Layer Optimization for Delay-Sensitive Applications [J] . Fangwen Fu, van der Schaar M. Signal Processing, IEEE Transactions on . 2010,第3期

机译：延迟敏感应用的跨层优化中的分解原理和在线学习
2. A Cross-Layer Delay Differentiation Packet Scheduling Scheme for Multimedia Content Delivery in 3G Satellite Multimedia Systems [J] . Fan L., Du H., Mudugamuwa U., IEEE Transactions on Broadcasting . 2008,第4期

机译：3G卫星多媒体系统中多媒体内容传递的跨层延迟差分分组调度方案
3. Dynamic resource allocation in OFDM systems: an overview of cross-layer optimization principles and techniques [J] . Mathias Bohge, James Gross, Adam Wolisz, IEEE Network . 2007,第1期

机译：OFDM系统中的动态资源分配：跨层优化原理和技术概述
4. Cross-layer optimization of a multimedia streaming system via dynamic programming [C] . Combernoux, Alice, Delestre, Cyrile, Changuel, Nesrine, IEEE International Conference on Image Processing;ICIP 2012 . 2012

机译：通过动态编程对多媒体流系统进行跨层优化
5. Cross-layer optimized wireless multimedia networking. [D] . Wu, Dalei. 2010

机译：跨层优化的无线多媒体网络。
6. A Survey on Multimedia-Based Cross-Layer Optimization in Visual Sensor Networks [O] . Daniel G. Costa, Luiz Affonso Guedes 2011

机译：视觉传感器网络中基于多媒体的跨层优化研究
7. CROSS-LAYER OPTIMIZATION OF A MULTIMEDIA STREAMING SYSTEM VIA DYNAMIC PROGRAMMING [O] . Alice Combernoux, Cyrile Delestre, Nesrine Changuel, 2016

机译：通过动态规划实现多媒体流媒体系统的跨层优化

Online Layered Learning for Cross-layer Optimization of Dynamic Multimedia Systems

摘要

著录项

相似文献

相关主题

期刊订阅