首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >Architecture and applications of the HiPAR video signal processor
【24h】

Architecture and applications of the HiPAR video signal processor

机译:HiPAR视频信号处理器的架构和应用

获取原文
获取原文并翻译 | 示例
           

摘要

We propose the architecture of a highly parallel DSP (HiPAR-DSP) as a flexible and programmable processor for image and video processing. The design is based on an analysis of image processing algorithms in terms of available parallelization resources, demands on program control, and required data access mechanisms. This led to a very long instruction word (VLIW)-controlled ASIMD RISC-architecture with four or sixteen data paths, employing data-level parallelism, parallel instructions, micro-instruction pipelining, and data transfer concurrently to data processing. Common data access patterns for image processing algorithms are supported by use of a shared on-chip memory with parallel matrix type access patterns and a separate data-cache per data path. By properly balancing processing and controlling capabilities as internal and external memory bandwidth, this approach is optimized to make the best use of currently available silicon resources. A high clock frequency is achieved by implementation of classic RISC features. The architecture fully supports high level language programming. With the 16 data path version and a 100 MHz clock, a sustained performance of more than 2 billion arithmetic operations per second (GOPS) is achieved for a wide range of algorithms. The examples show the parallel implementation of image processing algorithms like histogramming, Hough transform, or search in a sorted list with efficient use of the processor resources. A prototype of the architecture with four parallel data paths is available, using a 0.6 μm CMOS technology
机译:我们提出了高度并行DSP(HiPAR-DSP)的体系结构,作为用于图像和视频处理的灵活且可编程的处理器。该设计基于对图像处理算法的分析,包括可用的并行化资源,对程序控制的要求以及所需的数据访问机制。这导致了一个非常长的指令字(VLIW)控制的ASIMD RISC体系结构,具有四个或十六个数据路径,采用了数据级并行性,并行指令,微指令流水线技术,并同时将数据传输到数据处理。通过使用具有并行矩阵类型访问模式的共享片上存储器以及每个数据路径独立的数据缓存,可以支持图像处理算法的通用数据访问模式。通过适当地平衡处理和控制内部和外部存储器带宽的能力,此方法已得到优化,以充分利用当前可用的硅资源。通过实现经典的RISC功能可以实现较高的时钟频率。该体系结构完全支持高级语言编程。凭借16个数据路径版本和100 MHz时钟,对于各种算法,都可实现每秒20亿次算术运算(GOPS)的持续性能。这些示例显示了图像处理算法的并行实现,例如直方图,霍夫变换或在有效使用处理器资源的情况下在排序列表中进行搜索。采用0.6μmCMOS技术的架构原型具有四个并行数据路径

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号