Efficient GPU-based implementation for decoding non-binary LDPC codes with layered and flooding schedules

Zhanxian Liu; Rongke Liu; Yi Hou; Hao Peng; Ling Zhao

首页> 外文期刊>Concurrency and computation: practice and experience >Efficient GPU-based implementation for decoding non-binary LDPC codes with layered and flooding schedules

【24h】

Efficient GPU-based implementation for decoding non-binary LDPC codes with layered and flooding schedules

机译：基于GPU的高效实现，用于使用分层和泛洪调度来解码非二进制LDPC码

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nonbinary low-density parity-check (NB-LDPC) codes are excellent error correcting codes andrnoutperform their binary counterparts under the same code length. NB-LDPC decoders arernbased on Belief Propagation Algorithm, which demands intensive message-passing computation.rnRecently, to achieve both flexibility and good throughput performance, NB-LDPC decoders havernbeen ported from dedicated hardware solutions to multi/many-core systems. In this paper, wernpropose an FFT-based q-ary Sum-Product Algorithm (QSPA) decoding architecture forNB-LDPCrncodes with layered and flooding schedules on a graphics processing unit (GPU). To improve thernthroughput performance of the proposed decoder, four optimization methods are presented tornnot only accelerate the decoding kernel execution but also improve the data transfer efficiency.rnThe experiments are mainly accomplished on NVIDIA GTX580 and GTX Titan X. Throughputsrnup to 63 Mbps over GF(16) and 7.65 Mbps over GF(256) are achieved on GTX580 when executingrn5 layered decoding iterations. Throughputs can reach up to 139 Mbps over GF(16) andrn17 Mbps over GF(256) on GTX Titan X. Experimental results show that the speedups of therndecoding throughputs range from×1.7 to×16.8 by comparison with the existing FFT-basedQSPArndecoders on GPU.

机译：非二进制低密度奇偶校验（NB-LDPC）码是出色的纠错码，并且在相同码长下的性能优于其二进制对应码。 NB-LDPC解码器基于信仰传播算法，需要大量的消息传递计算。最近，为了实现灵活性和良好的吞吐量性能，NB-LDPC解码器已从专用硬件解决方案移植到多/多核系统。本文针对图形处理单元（GPU）上具有分层和泛洪调度的NB-LDPCrncode，提出了一种基于FFT的q元求和算法（QSPA）解码架构。为提高解码器的吞吐性能，提出了四种优化方法，不仅可以加快解码内核的执行速度，而且可以提高数据传输效率。实验主要在NVIDIA GTX580和GTX Titan X上完成。吞吐速率在GF上达到63 Mbps（16）当执行rn5分层解码迭代时，GTX580可获得GF（256）上的7.65 Mbps。在GTX Titan X上，吞吐量可达到GF（16）上高达139 Mbps的速度，而GF（256）上则高达17 Mbps。实验结果表明，与GPU上现有的基于FFT的QSPArn解码器相比，解码吞吐量的提升范围为×1.7至×16.8。。

著录项

来源
《Concurrency and computation: practice and experience 》 |2018年第16期| e4442.1-e4442.13| 共13页
作者
Zhanxian Liu; Rongke Liu; Yi Hou; Hao Peng; Ling Zhao;
展开▼
作者单位

School of Electronic and InformationEngineering, Beihang University, Beijing, China;

School of Electronic and InformationEngineering, Beihang University, Beijing, China;

School of Electronic and InformationEngineering, Beihang University, Beijing, China;

School of Electronic and InformationEngineering, Beihang University, Beijing, China;

School of Electronic and InformationEngineering, Beihang University, Beijing, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
CUDA; flooding; GPU; layered; nonbinary LDPC; synchronization;

机译：CUDA;洪水;GPU;分层非二进制LDPC同步化;

相似文献

外文文献
中文文献
专利

1. GPU-based non-binary LDPC decoder with weighted bit-reliability based algorithm [J] . Liu Zhanxian, Liu Rongke, Zhao Ling Communications, China . 2020 ,第5期

机译：基于GPU的非二进制LDPC解码器，具有基于加权比特可靠性的算法
2. High-Throughput FFT-SPA Decoder Implementation for Non-Binary LDPC Codes on x86 Multicore Processors [J] . Le Gal Bertrand, Jego Christophe Journal of VLSI signal processing systems for signal, image, and video technology . 2020 ,第1期

机译：x86多核处理器上非二进制LDPC码的高吞吐量FFT-SPA解码器实现
3. An efficient dynamic schedule for layered belief-propagation decoding of LDPC codes [J] . Han G., Liu X. Communications Letters, IEEE . 2009 ,第12期

机译：LDPC码分层置信传播解码的有效动态调度
4. Memory efficient column-layered decoder design for non-binary LDPC codes [C] . He, Kai, Sha, Jin, Wang, Zhongfeng ISCAS 2012;IEEE International Symposium on Circuits and Systems . 2012

机译：非二进制LDPC码的内存高效列分层解码器设计
5. Algebraic constructions of high performance and efficiently encodable non-binary quasi-cyclic LDPC codes. [D] . Zhou, Bo. 2008

机译：高性能且可有效编码的非二进制准循环LDPC码的代数结构。
6. An area efficient and high throughput implementation of layered min-sum iterative construction a posteriori probability LDPC decoder [O] . Hasnain Raza, Syed Azhar Ali Zaidi, Aamir Rashid, 2021

机译：分层最小和迭代构建的一个区域高效和高吞吐量的后验概率LDPC解码器
7. Fourier Domain Decoding Algorithm of Non-Binary LDPC codes for Parallel Implementation [O] . Kasai, Kenta, Sakaniwa, Kohichi 2010

机译：并行算法非二进制LDpC码的傅里叶域解码算法履行
8. Memory-efficient decoding of LDPC codes [R] . Kwok-San Lee, Jason, Thorpe, Jeremy, Hawkins, Jon 2005

机译：LDpC码的存储器有效解码

Efficient GPU-based implementation for decoding non-binary LDPC codes with layered and flooding schedules

摘要

著录项

相似文献

相关主题

期刊订阅