【24h】

A fast parallel Reed-Solomon decoder on a reconfigurable architecture

机译:可重配置架构上的快速并行Reed-Solomon解码器

获取原文

摘要

This paper presents a software implementation of a very fast parallel Reed-Solomon decoder on the second generation of MorphoSys reconfigurable computation platform, which is targeting on streamed applications such as multimedia and DSP. Numerous modifications of the first-generation of the architecture have made a scalable computation and communication intensive architecture capable of extracting parallelisms of fine grain in instruction level. Many algorithms and the whole digital video broadcasting base-band receiver as well, have been mapped onto the second architecture with impressive performance. The mapping of a Reed-Solomon decoder proposed in the paper highly parallelizes all of its sub-algorithms, including Syndrome Computation, Berlekamp Algorithm, Chein Search, and Error Value Computation, in a SIMD fashion. The mapping is tested on a cycle-accurate simulator, "Mulate", and the performance is encouragingly better than other architectures. The decoding speed of the RS (255,239,16) decoder using two different methods of GF multiplication can be 1.319 Gbps and 2.534 Gbps, respectively. Furthermore, since there is no functionality specifically tailored to Reed-Solomon decoder, the result has demonstrated the capability of MorphoSys architecture to extracting instruction level parallelism from streamed applications.
机译:本文介绍了第二代MorphoSys可重构计算平台上非常快速的并行Reed-Solomon解码器的软件实现,该平台针对流媒体应用,例如多媒体和DSP。对第一代体系结构的许多修改使得可扩展的计算和通信密集型体系结构能够提取指令级细粒度的并行性。许多算法以及整个数字视频广播基带接收器,都以令人印象深刻的性能被映射到第二种架构。本文中提出的Reed-Solomon解码器的映射以SIMD方式高度并行化了其所有子算法,包括综合计算,Berlekamp算法,Chein搜索和错误值计算。该映射在周期精确的模拟器“ Mulate”上进行了测试,其性能令人鼓舞地优于其他体系结构。使用两种不同的GF乘法方法的RS(255,239,16)解码器的解码速度可以分别为1.319 Gbps和2.534 Gbps。此外,由于没有专门针对Reed-Solomon解码器的功能,因此结果证明了MorphoSys体系结构具有从流式应用程序中提取指令级并行性的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号