首页> 外文会议>International Symposium on Parallel Distributed Processing >High Performance MPEG-2 Software Decoder on the Cell Broadband Engine
【24h】

High Performance MPEG-2 Software Decoder on the Cell Broadband Engine

机译:电池宽带引擎上的高性能MPEG-2软件解码器

获取原文

摘要

The Sony-Toshiba-IBM Cell Broadband Engine is a heterogeneous multicore architecture that consists of a traditional microprocessor (PPE) with eight SIMD co-processing units (SPEs) integrated on-chip. While the Cell/B.E. processor is designed with multimedia applications in mind, there are currently no open-source, optimized implementations of such applications available. In this paper, we present the design and implementation behind the creation of an optimized MPEG-2 software decoder for this unique parallel architecture, and demonstrate its performance through an experimental study. This is the first parallelization of an MPEG-2 decoder for a commodity heterogeneous multicore processor such as the IBM Cell/B.E. While Drake et al. have recently parallelized MPEG-2 using StreamIt for a streaming architecture, our algorithm is quite different and is the first to address the new challenges related to the optimization and tuning of a multicore algorithm with DMA transfers and local store memory. Our design and efficient implementation target the architectural features provided by the heterogeneous multicore processor. We give an experimental study on Sony PlayStation 3 and IBM QS20 dual-Cell Blade platforms. For instance, using 16 SPEs on the IBM QS20, our decoder runs 3.088 times faster than a 3.2 GHz Intel Xeon and achieves a speedup of over 10.545 compared with a PPE-only implementation. Our source code is freely-available through SourceForge under the CellBuzz project.
机译:索尼东芝-IBM Cell Broadband Engine是异构多核架构,它由一个微处理器的传统(PPE)与集成在芯片8 SIMD协同处理单元(SPE)的的。而细胞/ B.E。处理器被设计时考虑到多媒体应用,目前没有开源,可用这样的应用的优化的实施方式。在本文中,我们目前的设计和实现建立一个优化的MPEG-2软件解码器的这个独特的并行架构的背后,并展示通过实验研究其性能。这是MPEG-2解码器对一种商品异构多核处理器的第一并行如IBM细胞/ B.E。而Drake等人。最近使用StreamIt的流架构并行MPEG-2,我们的算法是完全不同的,是第一个以解决涉及与DMA传输和本地存储内存多核算法的优化和调整的新挑战。我们的设计和有效的实现目标由异构多核处理器中提供的建筑特色。我们给索尼PlayStation 3和IBM QS20双节刀片平台的实验研究。例如,使用在IBM QS20 16层的SPE,我们的解码器的运行速度比3.2 GHz英特尔至强更快的3.088倍,并实现了与一个只有PPE的实现相比,超过10.545的加速。我们的源代码是免费,可通过SourceForge上的CellBuzz项目下。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号