Advanced Training Course On FPGA Design and VHDL For Hardware Simulation and Synthesis
Advanced Training Course On FPGA Design and VHDL For Hardware Simulation and Synthesis
Massimiliano Nolich
DEEI
Facolta' di Ingegneria Universita' degli Studi di Trieste
via Valerio, 10, 34127
Trieste
Italy
The Discrete Fourier Transform
The discrete-time Fourier transform (DTFT) of a sequence is a continuous
function of !, and repeats with period 2. In practice we usually want to
obtain the Fourier components using digital computation, and can only
evaluate them for a discrete set of frequencies. The discrete Fourier transform
(DFT) provides a means for achieving this.
The DFT is itself a sequence, and it corresponds roughly to samples, equally
spaced in frequency, of the Fourier transform of the signal. The discrete
Fourier transform of a length N signal xŒn, n D 0; 1; : : : ; N 1 is given by
N
X1
j.2=N /k n
XŒk D xŒne :
nD0
When dealing with the DFT, it is common to define the complex quantity
j.2=N /
WN D e :
An important property of the DFT is that it is cyclic, with period N , both in the
1
discrete-time and discrete-frequency domains. For example, for any integer r,
N
X1 N
X1
XŒk C rN D xŒnWN.kCrN /n D xŒnWNk n .WNN /rn
nD0 nD0
N
X1
D xŒnWNk n D XŒk;
nD0
n mod N D ..n//N D l:
xŒn
n
0 N
xŒn
Q
n
0 N
Similarly, the periodic extension of XŒk is defined to be
Q
XŒk D XŒk mod N D XŒ..k//N :
2
It is sometimes better to reason in terms of these periodic extensions when
dealing with the DFT. Specifically, if XŒk is the DFT of xŒn, then the inverse
DFT of XŒk is xŒn.
Q The signals xŒn and xŒn
Q are identical over the interval 0
to N 1, but may differ outside of this range. Similar statements can be made
regarding the transform XŒk.
D
Linearity: ax1 Œn C bx2 Œn !aX1 Œk C bX2 Œk.
D
Circular time shift: xŒ..n m//N !WNkm XŒk.
Circular convolution:
N
X1 D
x1 Œmx2 Œ..n m//N !X1 ŒkX2 Œk:
mD0
3
Modulation:
N 1
D 1 X
x1 Œnx2 Œn ! X1 ŒlX2 Œ..k l//N :
N
lD0
Some of these properties, such as linearity, are easy to prove. The properties
involving time shifts can be quite confusing notationally, but are otherwise
quite simple. For example, consider the 4-point DFT
3
X
XŒk D xŒnW4k n
nD0
since W44k D W40k . This can be seen to be the DFT of the sequence
xŒ3; xŒ0; xŒ1; xŒ2, which is precisely the sequence xŒn circularly shifted to
the right by one sample. This proves the time-shift property for a shift of
length 1. In general, multiplying the DFT of a sequence by WNkm results in an
N-point circular shift of the sequence by m samples. The convolution
properties can be similarly demonstrated.
It is useful to note that the circularly shifted signal xŒ..n m//N is the same
as the linearly shifted signal xŒn
Q m, where xŒn
Q is the N-point periodic
extension of xŒn.
4
xŒn
n
0 N
xŒn
Q
n
0 N
xŒn
Q m
n
0 N
xŒ..n m//N
n
0 N
x1 Œn x2 Œn
n n
0 N 0 N
5
the circular convolution x3 Œn D x1 Œn
N x2 Œn is the signal xŒn
Q delayed by
two samples, evaluated over the range 0 to N 1:
x3 Œn
n
0 N
x3 Œn D x1 Œn
N x2 Œn D N; 0nN 1:
Suppose now that x1 Œn and x2 Œn are considered to be length 2L sequences by
augmenting with zeros. The N D 2L-point circular convolution is then seen to
be the same as the linear convolution of the finite-duration sequences x1 Œn and
x2 Œn:
6
x1 Œn D x2 Œn
n
0 L N
L
x1 Œn
N x2 Œn
n
0 L N
Consider a sequence x1 Œn with length L points, and x2 Œn with length P
points. The linear convolution of the sequences,
1
X
x3 Œn D x1 Œmx2 Œn m;
mD 1
7
2
x [n]
1
0
0 L
1
x [n]
2
0
0 P
8
x [n]
3
0
0 L+P−1
n
It is easy to see that the circular convolution product will be equal to the linear
convolution product on the interval 0 to N 1 as long as we choose
N L C P 1. The process of augmenting a sequence with zeros to make it
of a required length is called zero padding.
8
h[n]
0 P
x[n]
0 L 2L 3L
n
The solution is to use block convolution, where the signal to be filtered is
segmented into sections of length L. The input signal xŒn, here assumed to be
causal, can be decomposed into blocks of length L as follows:
1
X
xŒn D xr Œn rL;
rD0
where 8
<xŒn C rL 0nL 1
xr Œn D
:0 otherwise:
x0[n]
0 L
n
x1[n]
L 2L
n
x2[n]
2L 3L
n
9
The convolution product can therefore be written as
1
X
yŒn D xŒn hŒn D yr Œn rL;
rD0
0 L+P−1
n
y1[n]
L
n
y2[n]
2L
n
Since the sequences xr Œn have only L nonzero points and hŒn is of length P ,
each response term yr Œn has length L C P 1. Thus linear convolution can
be obtained using N-point DFTs with N L C P 1. Since the final result is
obtained by summing the overlapping output regions, this is called the
overlap-add method.
y[n]
0 L 2L 3L
10
impulse response hŒn with an L-point segment xr Œn. The portion of the
output that corresponds to linear convolution is then identified (consisting of
L .P 1/ points), and the resulting segments patched together to form the
output.
0.5
r
0
0 N−1
n
The windowed signal is then xw Œn D xŒnwr Œn. The DTFT of this windowed
signal is given by
1
X N
X1
Xw .e j! / D xw Œne j!n
D xw Œne j!n
:
nD 1 nD0
11
it is evident that
Xw Œk D Xw .e j! /ˇ!D2k=N :
ˇ
The values of the DFT Xw Œk of the signal xw Œn are therefore periodic
samples of the DTFT Xw .e j! /, where the spacing between the samples is
2=N . Since the relationship between the discrete-time frequency variable and
the continuous-time frequency variable is ! D T , the DFT frequencies
correspond to continuous-time frequencies
2k
:
k D
NT
The DFT can therefore only be used to find points on the DTFT of the
windowed signal xw Œn of xŒn.
The operation of windowing involves multiplication in the discrete time
domain, which corresponds to continuous-time periodic convolution in the
DTFT frequency domain. The DTFT of the windowed signal is therefore
Z
j! 1
Xw .e / D X.e j /W .e j.! / /d;
2
where W .e j! / is the frequency response of the window function. For a simple
rectangular window, the frequency response is as follows:
1
wr Œn
0.5
0
0 8
jWr .e j! /j
0
0
!
The DFT therefore effectively samples the DTFT of the signal convolved with
the frequency response of the window.
12
Example: Spectrum analysis of sinusoidal signals Suppose we have the
sinusoidal signal combination
1
xŒn
0
−1
0 8
1
wr Œn
0.5
0
0 8
1
xw Œn
0
−1
0 8
n
13
jX.e j! /j
0.5
0 2 2
3 3
0 3 3
jWr .e j! /j
0 2 2
3 3
0 3 3
jXw .e j! /j
0 2 2
3 3
0 3 3
!
The operation of windowing therefore limits the ability of the Fourier
transform to resolve closely-spaced frequency components. When the DFT is
used for spectrum estimation, it effectively samples the spectrum of this
modified signal at the locations of the crosses indicated:
6
jXŒkj
0
0 1 2 3 4 5 6 7
k
Note that since k D 0 corresponds to ! D 0, there is a corresponding shift in
the sampled values.
In general, the elements of the N -point DFT of xw Œn contain N evenly-spaced
samples from the DTFT Xw .e j! /. These samples span an entire period of the
DTFT, and therefore correspond to frequencies at spacings of 2=N . We can
obtain samples with a closer spacing by performing more computation.
14
Suppose we form the zero-padded length M signal xM Œn as follows:
8
<xŒn 0nN 1
xM Œn D
:0 N n M 1:
The sample Xp Œk can therefore be seen to correspond to the DTFT of the the
windowed signal xw Œn at frequency !k D 2k=M . Since M is chosen to be
larger than N , the transform values correspond to regular samples of Xw .e j! /
with a closer spacing of 2=M . The following figure shows the magnitude of
the DFT transform values for the 8-point signal shown previously, but
zero-padded to use a 32-point DFT:
1
xM Œn
0
−1
0 5 10 15 20 25 30
n
5
jXŒkj
0
0 5 10 15 20 25 30
k
Note that this process increases the density of the samples, but has no effect on
the resolution of the spectrum.
If W .e j! / is sharply peaked, and approximates a Dirac delta function at the
15
origin, then Xw .e j! / X.e j! /. The values of the DFT then correspond quite
accurately to samples of the DTFT of xŒn. For a rectangular window, the
approximation improves as N increases:
1
wr Œn
0.5
0
0 32
jWr .e j! /j
32
0
0
jXw .e j! /j
0
0
!
The magnitude of the DFT of the windowed signal is
15
jXŒkj
10
5
0
0 5 10 15 20 25 30
k
which is clearly easier to interpret than for the case of the shorter signal. As
the window length tends to 1, the relationship becomes exact.
The rectangular window inherent in the DFT has the disadvantage that the peak
sidelobe of Wr .e j! / is high relative to the mainlobe. This limits the ability of
the DFT to resolve frequencies. Alternative windows may be used which have
preferred behaviour — the only requirement is that in the time domain the
16
window function is of finite duration. For example, the triangular window
1
wr Œn
0.5
0
0 32
jWr .e j! /j
0
0
jXw .e j! /j
0
0
!
0
0 5 10 15 20 25 30
k
Here the sidelobes have been reduced at the cost of diminished resolution —
the mainlobe has become wider.
The method just described forms the basis for the periodogram spectrum
estimate. It is often used in practice on account of its perceived simplicity.
However, it has a poor statistical properties — model-based spectrum
estimates generally have higher resolution and more predictable performance.
Finally, note that the discrete samples of the spectrum are only a complete
17
representation if the sampling criterion is met. The samples therefore have to
be sufficiently closely spaced.
If the factors W8k n have been calculated in advance (and perhaps stored in a
lookup table), then the calculation of XŒk for each value of k requires 8
complex multiplications and 7 complex additions. The 8-point DFT therefore
requires 8 8 multiplications and 8 7 additions. For an N-point DFT these
become N 2 and N.N 1/ respectively. If N D 1024, then approximately one
million complex multiplications and one million complex additions are
required.
The key to reducing the computational complexity lies in the observation that
the same values of xŒnW8k n are effectively calculated many times as the
computation proceeds — particularly if the transform is long.
The conventional decomposition involves decimation-in-time, where at each
stage a N-point transform is decomposed into two N=2-point transforms. That
18
is, XŒk can be written as
N=2 1 N=2 1
.2rC1/k
X X
XŒk D xŒ2rWN2rk C xŒ2r C 1WN
rD0 rD0
N=2 1 N=2 1
X X
D xŒ2r.WN2 /rk C WNk xŒ2r C 1.WN2 /rk :
rD0 rD0
19