首页> 外文期刊>Journal of electronic imaging >Real-time high-resolution downsampling algorithm on many-core processor for spatially scalable video coding
【24h】

Real-time high-resolution downsampling algorithm on many-core processor for spatially scalable video coding

机译:用于空间可伸缩视频编码的多核处理器上的实时高分辨率下采样算法

获取原文
获取原文并翻译 | 示例
       

摘要

The progression toward spatially scalable video coding (SVC) solutions for ubiquitous endpoint systems introduces challenges to sustain real-time frame rates in downsampling high-resolution videos into multiple layers. In addressing these challenges, we put forward a hardware accelerated downsampling algorithm on a parallel computing platform. First, we investigate the principal architecture of a serial downsampling algorithm in the Joint-Scalable-Video-Model reference software to identify the performance limitations for spatially SVC. Then, a parallel multicore-based downsampling algorithm is studied as a benchmark. Experimental results for this algorithm using an 8-core processor exhibit performance speedup of 5.25x against the serial algorithm in downsampling a quantum extended graphics array at 1536p video resolution into three lower resolution layers (i.e., Full-HD at 1080p, HD at 720p, and Quarter-HD at 540p). However, the achieved speedup here does not translate into the minimum required frame rate of 15 frames per second (fps) for real-time video processing. To improve the speedup, a many-core based downsampling algorithm using the compute unified device architecture parallel computing platform is proposed. The proposed algorithm increases the performance speedup to 26.14x against the serial algorithm. Crucially, the proposed algorithm exceeds the target frame rate of 15 fps, which in turn is advantageous to the overall performance of the video encoding process. (C) 2015 SPIE and IS&T
机译:面向无处不在的端点系统的空间可伸缩视频编码(SVC)解决方案的发展带来了在将高分辨率视频下采样到多层时维持实时帧速率的挑战。为了应对这些挑战,我们在并行计算平台上提出了一种硬件加速下采样算法。首先,我们研究了联合可伸缩视频模型参考软件中串行下采样算法的主要架构,以确定空间SVC的性能限制。然后,研究了基于并行多核的下采样算法作为基准。该算法使用8核处理器的实验结果显示,在将1536p视频分辨率的量子扩展图形阵列降采样为三个较低分辨率的层(即1080p的Full-HD,720p的HD,和540p分辨率的四分之一高清)。但是,此处实现的提速并未转化为实时视频处理所需的最小每秒15帧(fps)的帧速率。为了提高速度,提出了一种使用计算统一设备架构并行计算平台的基于多核的下采样算法。与串行算法相比,所提出的算法将性能提速提高到26.14倍。至关重要的是,提出的算法超过了15帧/秒的目标帧速率,这又有利于视频编码过程的整体性能。 (C)2015 SPIE和IS&T

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号