首页> 外文OA文献 >Implementation of a fully-parallel turbo decoder on a general-purpose graphics processing unit
【2h】

Implementation of a fully-parallel turbo decoder on a general-purpose graphics processing unit

机译:在通用图形处理单元上实现全并行turbo解码器

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Turbo codes comprising a parallel concatenation of upper and lower convolutional codes are widely employed in state-of-the-art wireless communication standards, since they facilitate transmission throughputs that closely approach the channel capacity. However, this necessitates high processing throughputs in order for the turbo code to support real-time communications. In stateof- the-art turbo code implementations, the processing throughput is typically limited by the data dependencies that occur within the forward and backward recursions of the Log-BCJR algorithm, which is employed during turbo decoding. In contrast to the highly-serial Log-BCJR turbo decoder, we have recently proposed a novel Fully Parallel Turbo Decoder (FPTD) algorithm, which can eliminate the data dependencies and perform fully parallel processing. In this paper, we propose an optimized FPTD algorithm, which reformulates the operation of the FPTD algorithm so that the upper and lower decoders have identical operation, in order to support Single Instruction Multiple Data (SIMD) operation. This allows us to develop a novel General Purpose Graphics Processing Unit (GPGPU) implementation of the FPTD, which has application in Software-Defined Radios (SDRs) and virtualized Cloud- Radio Access Networks (C-RANs). As a benefit of its higher degree of parallelism, we show that our FPTD improves the higher processing throughput of the Log-BCJR turbo decoder by between 2.3 and 9.2 times, when employing a high-specification GPGPU. However, this is achieved at the cost of a moderate increase of the overall complexity by between 1.7 and 3.3 times.
机译:在现有技术的无线通信标准中,包括上卷积码和下卷积码的并行级联的Turbo码被广泛采用,因为它们促进了接近信道容量的传输吞吐量。但是,这需要高处理吞吐量才能使Turbo代码支持实时通信。在最新的turbo码实现中,处理吞吐量通常受到在turbo解码期间采用的Log-BCJR算法的前向和后向递归内出现的数据依赖性的限制。与高度串行的Log-BCJR Turbo解码器相反,我们最近提出了一种新颖的全并行Turbo解码器(FPTD)算法,该算法可以消除数据依赖性并执行完全并行处理。在本文中,我们提出了一种优化的FPTD算法,重新设计了FPTD算法的操作,以使上下解码器具有相同的操作,从而支持单指令多数据(SIMD)操作。这使我们能够开发FPTD的新颖的通用图形处理单元(GPGPU)实现,该实现已在软件定义无线电(SDR)和虚拟化云无线电接入网络(C-RAN)中应用。作为其较高并行度的一项优势,我们证明了当采用高规格GPGPU时,FPTD将Log-BCJR Turbo解码器的较高处理吞吐量提高了2.3到9.2倍。但是,这是以总体复杂性适度增加1.7到3.3倍为代价的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号