首页> 外文会议>ACM SIGMM conference on multimedia systems 2010 >Online Layered Learning for Cross-layer Optimization of Dynamic Multimedia Systems
【24h】

Online Layered Learning for Cross-layer Optimization of Dynamic Multimedia Systems

机译:在线分层学习用于动态多媒体系统的跨层优化

获取原文

摘要

In our recent work, we proposed a systematic cross-layer framework for dynamic multimedia systems, which allows each layer to make autonomous and foresighted decisions that maximize the system's long-term performance, while meeting the application's real-time delay constraints. The proposed solution solved the cross-layer optimization offline, under the assumption that the multimedia system's probabilistic dynamics (e.g. the application's rate-distortion-complexity behavior) were known a priori, by modeling the system as a layered Markov decision process. In practice, however, these dynamics are unknown a priori and therefore must be learned online. In this paper, we address this problem by allowing the multimedia system layers to learn, through repeated interactions with each other, to autonomously optimize the system's long-term performance at run-time. We propose two reinforcement learning algorithms for optimizing the system under different design constraints: the first algorithm solves the cross-layer optimization in a centralized manner, and the second solves it in a decentralized manner. We analyze both algorithms in terms of their required computation, memory, and inter-layer communication overheads. In our experiments, we demonstrate that decentralized learning can perform equally as well as centralized learning, while enabling the layers to act autonomously. Additionally, we show that existing myopic learning algorithms deployed in multimedia systems perform significantly worse than our proposed foresighted learning methods.
机译:在我们最近的工作中,我们为动态多媒体系统提出了一个系统的跨层框架,该框架允许每一层做出自主和有远见的决策,以最大化系统的长期性能,同时满足应用程序的实时延迟约束。假设多媒体系统的概率动力学(例如应用程序的速率失真复杂性行为)是先验的,则该解决方案可以通过将系统建模为分层的马尔可夫决策过程来离线解决跨层优化问题。然而,实际上,这些动态是先验未知的,因此必须在线学习。在本文中,我们通过允许多媒体系统层通过反复交互来学习,以在运行时自主优化系统的长期性能,从而解决了这一问题。我们提出了两种强化学习算法,用于在不同设计约束下优化系统:第一种算法以集中方式解决跨层优化,第二种算法以分散方式解决。我们根据所需的计算,内存和层间通信开销来分析这两种算法。在我们的实验中,我们证明了分散式学习可以和集中式学习一样出色地执行,同时使各层能够自主地行动。此外,我们表明,部署在多媒体系统中的现有近视学习算法的性能明显比我们提出的有远见的学习方法差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号