HSPA;/LTE-A turbo decoder on GPU and multicore CPU

Michael Wu, Guohui Wang, Bei Yin, Christoph Studer, Joseph R. Cavallaro
2013 2013 Asilomar Conference on Signals, Systems and Computers  
This paper compares two implementations of reconfigurable and high-throughput turbo decoders. The first implementation is optimized for an NVIDIA Kepler graphics processing unit (GPU), whereas the second implementation is for an Intel Ivy Bridge processor. Both implementations support max-log-MAP and log-MAP turbo decoding algorithms, various code rates, different interleaver types, and all block-lengths, as specified by HSPA+ and LTE-Advanced. In order to ensure a fair comparison between both
more » ... mplementations, we perform device-specific optimizations to improve the decoding throughput and error-rate performance. Our results show that the Intel Ivy Bridge processor implementation achieves up to 2× higher decoding throughput than our GPU implementation. In addition our CPU implementation requires roughly 4× fewer codewords to be processed in parallel to achieve its peak throughput.
doi:10.1109/acssc.2013.6810402 dblp:conf/acssc/WuWYSC13 fatcat:whqyeveyu5fsfdrixn6tl2zhsu