Implementation of 64-Point FFT
Implementation of 64-Point FFT
Research Paper
Fast Fourier Transform (FFT) processing is one of the key procedure in popular Orthogonal
Frequency Division Multiplexing (OFDM) communication systems. Structured pipeline
architectures, low power consumption, high speed and reduced chip area are the main concerns
in this VLSI implementation. In this paper, the efficient implementation of FFT/IFFT processor
for OFDM applications is presented. The processor can be used in various OFDM-based
communication systems, such as Worldwide interoperability for Microwave access (Wi-Max),
Digital Audio Broadcasting (DAB), Digital Video Broadcasting-Terrestrial (DVB-T). We adopt
single-path delay feedback architecture. To eliminate the Read Only Memories (ROM’s) used to
store the twiddle factors, this proposed architecture applies a reconfigurable complex multiplier
to achieve a ROM-less FFT/IFFT processor and to reduce the truncation error we adopt the
fixed width modified booth multiplier. The three Processing Elements (PE’s), Delay-Line (DL)
buffers are used for computing IFFT. Thus we consume the low power, lower hardware cost,
high efficiency and reduced chip size.
Keywords: FFT/IFFT, Single delay feedback path, OFDM, Pipelined architectures, High power
consumption
57
Int. J. Elec&Electr.Eng&Telecoms. 2013 K Venkata Subba Reddy and K Bala, 2013
Delay Commutator (SDC) and Multi-path equation essentially combines two stages of
Delay Commutator. The advantages of Single- a radix-2 FFT into one, so that half as many
path Delay Feedback (SDF) are (1) This SDF stages are required. To calculate 16-point FFT,
architecture is very simple to implement the the radix-2 takes log216 = 4 stages but the
different length FFT. (2) The required registers radix-4 takes only log416 = 2 stages. A 16-
in SDF architecture is less than MDC and point, radix-4 decimation-in-frequency FFT
SDC architectures. (3) The control unit of SDF algorithm is shown in Figure 1. Its input is in
architecture is easier. We implement the normal order and its output is in digit-reversed
processor in SDF architecture with radix-8 order. It has exactly the same computational
algorithm. There are various algorithms to complexity as the decimation-in-time radix-4
implement FFT, such as radix-2, radix-4 and FFT algorithm.
split-radix with arbitrary sizes, radix-2 algorithm
Figure 1: Flow Graph of a 16-Point
is the simplest one, but its calculation of Radix-4 FFT Algorithm
addition and multiplication is more than radix-
4’s. Though being more efficient than radix-2,
radix-8 only can process 8 n-point FFT. The
radix-8 FFT equation essentially combines
three stages of a radix-2 FFT into one, so that
one third as many stages are required. Since
the radix-8 FFT requires fewer stages and
butterflies than the radix 2 FFT, the
computations of FFT can be further improved.
Using radix-8 algorithm, we propose a 64-
point FFT/IFFT processor with ROM
architecture.
58
Int. J. Elec&Electr.Eng&Telecoms. 2013 K Venkata Subba Reddy and K Bala, 2013
59
Int. J. Elec&Electr.Eng&Telecoms. 2013 K Venkata Subba Reddy and K Bala, 2013
three time stage of SFG of radix-8 butterfly. R2SDF at the last time stage can redeem the
When R2SDF performs a N-point DFT, where R22SDF to Implement all time stage. When
N is a power of 2, every PE of R2SDF the R23SDF implements N- point D FT, where
includes a complex multiplier except the PE N is a power of 8, employing PEs of R23SDF
of penultimate time stage employs a “–j” can completely implement all time stage.
complex multiplier. W hen the R2 2 SDF Except N isn’t a power of 8, adding extra
implements N-point DFT, where N is a power R2SDF PE at the last one remainder time
of 4, employing PEs o f R2 2 SDF can stage or R22SDF PE at two remainder time
completely implement all time stage. Except stage can completely implement operation of
N isn’t a power of 4, adding a extra PE of all time stages.
Table 1: Analyze the Number of Complex Multiplications in SDF with Different Radix
R2SDF
FFT point 64 128 256 512 1024 2048 4096 8192
Constant mul 0 0 0 0 0 0 0 0
Complex mul 4 5 6 7 8 9 10 11
Multiplierarea 131459.2 164324 197188.8 230053.6 262918.4 295783.2 328648 361512.8
R22SDF
FFT point 64 128 256 512 1024 2048 4096 8192
Constant mul 0 0 0 0 0 0 0 0
Complex mul 2 3 3 4 4 5 5 6
Multiplierarea 65729.6 98594.4 98594.4 131459.2 131459.2 164324 164324 197188.8
3
R2 SDF
FFT point 64 128 256 512 1024 2048 4096 8192
Constant mul 2 2 2 3 3 3 4 4
Complex mul 1 2 2 2 3 3 3 4
Multiplierarea 53275.6 86140.4 86140.4 96345.8 129210.6 129210.6 139416 172280.8
Note: Constant and Complex Multiplier bit width: 20 bit, process: umc. 18. Complex Multiplier: 32864.8 um . Constant Multiplier: 10205.4 um2.
2
60
Int. J. Elec&Electr.Eng&Telecoms. 2013 K Venkata Subba Reddy and K Bala, 2013
61