首页> 外文会议>IEEE International Conference on Application-specific Systems, Architectures, and Processors >An Efficient SIMD Architecture with Parallel Memory for 2D Cosine Transforms of Video Coding
【24h】

An Efficient SIMD Architecture with Parallel Memory for 2D Cosine Transforms of Video Coding

机译:一种高效的SIMD架构,具有用于2D余弦变换的并联存储器的视频编码

获取原文

摘要

This paper proposes an efficient SIMD architecture with parallel memory for 2D cosine transforms of multiple video standards. A novel parallel memory scheme is employed to provide conflict-free parallel access in both horizontal and vertical directions with the successive or even/odd mode, as well as to eliminate data permutation and matrix transposition. Furthermore, application specific instructions are presented to accelerate the transform kernels, such as butterfly and rotate operations with scaling, rounding and clipping. The simulation results show that proposed architecture achieves significant performance improvement with low hardware cost of 3.2K equivalent gate count for parallel memory subsystem (not including SRAMs) and 19.8K for arithmetic units@250MHz in 0.18 μm process.
机译:本文提出了一种高效的SIMD架构,具有用于多视频标准的2D余弦变换的并联存储器。采用新颖的并行存储器方案在连续或偶数/奇数模式下提供水平和垂直方向的无冲突并行访问,以及消除数据置换和矩阵转换。此外,提出了应用特定指令以加速变换核,例如蝴蝶,并通过缩放,舍入和剪切旋转操作。仿真结果表明,该建筑以3.2K等效栅极计数的低硬件成本实现了显着的性能改善,用于平行存储器子系统(不包括SRAM)和19.8K在0.18μm过程中250MHz的算术单元。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号