...
首页> 外文期刊>Concurrency and computation: practice and experience >Efficient GPU-based implementation for decoding non-binary LDPC codes with layered and flooding schedules
【24h】

Efficient GPU-based implementation for decoding non-binary LDPC codes with layered and flooding schedules

机译:基于GPU的高效实现,用于使用分层和泛洪调度来解码非二进制LDPC码

获取原文
获取原文并翻译 | 示例

摘要

Nonbinary low-density parity-check (NB-LDPC) codes are excellent error correcting codes andrnoutperform their binary counterparts under the same code length. NB-LDPC decoders arernbased on Belief Propagation Algorithm, which demands intensive message-passing computation.rnRecently, to achieve both flexibility and good throughput performance, NB-LDPC decoders havernbeen ported from dedicated hardware solutions to multi/many-core systems. In this paper, wernpropose an FFT-based q-ary Sum-Product Algorithm (QSPA) decoding architecture forNB-LDPCrncodes with layered and flooding schedules on a graphics processing unit (GPU). To improve thernthroughput performance of the proposed decoder, four optimization methods are presented tornnot only accelerate the decoding kernel execution but also improve the data transfer efficiency.rnThe experiments are mainly accomplished on NVIDIA GTX580 and GTX Titan X. Throughputsrnup to 63 Mbps over GF(16) and 7.65 Mbps over GF(256) are achieved on GTX580 when executingrn5 layered decoding iterations. Throughputs can reach up to 139 Mbps over GF(16) andrn17 Mbps over GF(256) on GTX Titan X. Experimental results show that the speedups of therndecoding throughputs range from×1.7 to×16.8 by comparison with the existing FFT-basedQSPArndecoders on GPU.
机译:非二进制低密度奇偶校验(NB-LDPC)码是出色的纠错码,并且在相同码长下的性能优于其二进制对应码。 NB-LDPC解码器基于信仰传播算法,需要大量的消息传递计算。最近,为了实现灵活性和良好的吞吐量性能,NB-LDPC解码器已从专用硬件解决方案移植到多/多核系统。本文针对图形处理单元(GPU)上具有分层和泛洪调度的NB-LDPCrncode,提出了一种基于FFT的q元求和算法(QSPA)解码架构。为提高解码器的吞吐性能,提出了四种优化方法,不仅可以加快解码内核的执行速度,而且可以提高数据传输效率。实验主要在NVIDIA GTX580和GTX Titan X上完成。吞吐速率在GF上达到63 Mbps(16)当执行rn5分层解码迭代时,GTX580可获得GF(256)上的7.65 Mbps。在GTX Titan X上,吞吐量可达到GF(16)上高达139 Mbps的速度,而GF(256)上则高达17 Mbps。实验结果表明,与GPU上现有的基于FFT的QSPArn解码器相比,解码吞吐量的提升范围为×1.7至×16.8。 。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号