Lanash Science Inc.

K. Jones^*

Retired Consultant Mathematician, Weymouth, Dorset, UK

Submitted on 24 May 2025; Accepted on 12 July 2025; Published on 23 July 2025

To cite this article: K. Jones, “A Comparison of Two Schemes, Based upon Multi-Level LUTs and Second-Order Recursion, for Parallel Computation of FFT Twiddle Factors,” Trans. Appl. Sci. Eng. Technol., vol. 1, no. 1, pp. 1-11, 2025.

Copyright:

Abstract

The paper describes two schemes, together with architectures, for the resource-efficient parallel computation of twiddle factors for the fixed-radix version of the fast Fourier transform (FFT) algorithm. Assuming a silicon-based hardware implementation with suitably chosen parallel computing equipment, the two schemes considered provide one with the facility for trading off the arithmetic component of the resource requirements, as expressed in terms of the numbers of multipliers and adders, against the memory component, as expressed in terms of the amount of memory required for constructing the look-up tables (LUTs) needed for their storage. With a separate processing element (PE) being assigned to the computation of each twiddle factor, the first scheme is based upon the adoption of the single instruction multiple data (SIMD) technique, as applied in the ‘spatial’ domain, whereby the PEs operate independently upon their own individual LUTs and may thus be executed simultaneously; the second scheme is based upon the adoption of the pipelining technique, as applied in the ‘temporal’ domain, whereby the operation of all but the first LUT-based PE is based upon second-order recursion using previously computed PE outputs. Although the FFT radix and LUT level (where the LUT may be of either single-level or multi-level type) may each take on arbitrary integer values, we will be particularly concerned with the radix-4 version of the FFT algorithm, together with the two-level version of the LUT, as these two algorithmic choices facilitate ease of illustration and offer the potential for flexible computationally-efficient FFT designs. A brief comparison of the resource requirements for the two schemes is provided for various parameter sets that cater, in particular, for those big data memory-intensive applications involving the use of long (with length of order one million) to ultra-long (with length of order one billion) FFTs.

Keywords: butterfly; FFT; LUT; parallel; recursion; twiddle factor

Abbreviations: FFT: fast Fourier transform; LUTs: look-up tables; PE: processing element; SIMD: single instruction multiple data; DFT: discrete Fourier transform; DIT: decimation-in-time; DR: digit-reverse; NAT: naturally; DIF: decimation-in-frequency; FPGA: field-programmable gate array; RAM: random access memory; FHT: fast Hartley transform

References

View PDF

Transactions on Applied Science, Engineering and Technology

A Comparison of Two Schemes, Based upon Multi-Level LUTs and Second-Order Recursion, for Parallel Computation of FFT Twiddle Factors

About Us

Main Links

Contact Us