首页> 外文会议>2011 18th IEEE International Conference on Electronics, Circuits and Systems >An efficient VLSI implementation of H.264/AVC intra-frame transcoder
【24h】

An efficient VLSI implementation of H.264/AVC intra-frame transcoder

机译:H.264 / AVC帧内转码器的高效VLSI实现

获取原文
获取原文并翻译 | 示例

摘要

The number of different display terminals increased steadily, from HD TV to mobile phone TV and transcoding has become an indispensable operation in video processing. In the most cases, transcoding has to be done in real time but H.264/AVC intra-frame decoding and encoding contain a set of computation-intensive coding tools forming a loop in which the data are strongly dependant. Parallelization of each function isn''t though effortless. In this paper, we present an optimized transcoding chain for AVC intra-frame stream. The transcoding chain is characterized by several operators based on loop iterations and working on 4×4 luma or 2×2 chroma blocs. This generates heavy latency. Ours approaches uses loop unrolling and data parallelization. A tradeoff is done between critical path and number of cycles in order to improve global latency. The architecture described in this paper includes a powerful CAVLC coder and decoder, an optimized transform-quantization and a frequency selection function for, respectively, requantization and quick decimation of the high frequency values in a quantified coefficient block. This whole system performs an efficient transcoding operation. Our design, thanks to a high parallelization, can decode then recode a video stream in a 1080p format at 30 frames per second (fps) in real time at the frequency of 47Mhz. This design has been implemented in a Virtex 5 FPGA. Each block is fully described giving the surface occupied and the timing diagram.
机译:从高清电视到手机电视,不同显示终端的数量稳步增长,而转码已成为视频处理中必不可少的操作。在大多数情况下,必须实时进行转码,但是H.264 / AVC帧内解码和编码包含一组计算密集型编码工具,这些工具形成一个循环,数据在其中非常依赖。每个函数的并行化并非易事。在本文中,我们提出了一种针对AVC帧内流的优化转码链。转码链的特征是基于循环迭代并在4×4亮度或2×2色度块上工作的多个运算符。这会产生沉重的延迟。我们的方法使用循环展开和数据并行化。在关键路径和周期数之间进行权衡以改善全局延迟。本文描述的架构包括功能强大的CAVLC编码器和解码器,优化的变换量化和频率选择功能,分别用于量化系数块中的高频值的重新量化和快速抽取。整个系统执行有效的代码转换操作。由于高度并行化,我们的设计可以以47Mhz的频率实时解码和重新编码1080p格式的视频流,速度为每秒30帧(fps)。该设计已在Virtex 5 FPGA中实现。充分描述了每个模块,并给出了占用的面积和时序图。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号