Signal Processing
Signal Processing
Seismology
Diego Quiros, Ph.D.
Lecturer in Geophysics
September 2022
Geophysics Module
Basics of Signal Processing
Topics Spectrogram
1. Signal Processing
frequency
1. Introduction
3. Fourier transform
5. Spectrograms
6. Sampling theorem
7. Frequency filtering
For example in seismology we can treat either a continuous (analog) record of ground motion or the
digital data that results from representing the ground motion as being sampled at discrete inter-
vals, providing numbers that can be manipulated using a computer.
In general terms, we can think of f i l t e r i ng a signal, or applying some opperation that modifies the
signal. For example, a seismometer is a filter, in that it yields a record of ground motion that differs
from the actual ground motion.
In this lecture we will extend these ideas by considering mathematical approaches that are common
to such applications and see how these approaches give extraordinary insights into physical
processes.
Fourier Series
In seismology we use an approach based on the idea that any t i m e series can be decomposed into
a sum of sines and cosines (sinusoids). Each sinusoid having a particular amplitude, frequency, and
phase, this is known as Fourier analysis. Fourier Terms
0
Let's first consider the decomposition o f a signal with
finite duration into a Fourier series, or explicitly a sum of
harmonic (sinusoidal) components with different
frequencies. The Fourier series for an a r b i t r a r y function 2
Term number
(e.g., seismogram of an earthquake) of time f(t) defined
over the interval -T/2 < t < T/2 is
∞ 4
(
∞
( Σb
f(t) = a0 + Σan cos 2n π t
(
+ n
sin 2n π t
(
n T n T
We can decompose any time series in this way, and study -T/2 T/2
each sinusoid component. Time
Figure. Successive terms of a Fourier
series. Solid lines are sine function and
dashed lines are cosines.
Fourier Series Example
Let look at an example using the concepts from the previous page. Imagine different sinusoids each
with a specific frequency, peak amplitude, and phase, these are shown in the figure. The motions
represented by these different Fourier terms can be sum in the time domain.
Frequency (Hz)
transform the motion from frequency to
time. This t r a n s f o r m a t i o n is reversible,
since the time-domain signal (trace at the 48
bottom) can be broken d o w n (analyzed)
into its sinusoidal components in the
frequency domain. 32
∫
(going from the time ∞
∫ f(t) = 1
domain to the freq. domain)
F(ω) e iωt dω
F(ω) = f(t) e-iωt dt 2π
-∞ -∞
An important feature of the transform and inverse transform is that their dimensions are different.
For example if f(t) is a seismogram that has dimensions of displacement, its transform F(ω) has di-
mensions of displacement multiplied by time (from the dt term). Thus if f(t) gives the ground motion
in centimeters, F(ω) gives the transform of ground motion in centimeter-seconds.
The Fourier transform, a complex-valued function, can be written in terms of two real-valued func-
tions
F(ω) = ǀF(ω)ǀ eiφ(ω)
where ǀF(ω)ǀ is called the a m p l i t u d e spectrum and φ(ω) is called the phase spectrum.
The Amplitude Spectrum
In many applications only the amplitude spectrum is shown because it indicates how the energy
(amplitude squared) in the time series (e.g., seismogram) depends on frequency.
The figure below shows a seismogram for an earthquake and below two panels that include only
body waves, and surface waves.
Vanuatu earthquake (Ms 6.5)
(p-wave coda)
Amplitude
0.5
0.0
0.00 0.02 0.04 0.06 0.08 0.10 0.12 0.14
Frequency (Hz)
surface waves have lower frequencies.
Time domain vs Frequency domain
Looking at the seismogram in the previous page we see that the surface waves contain
longer-period energy than the body waves (i.e., body waves have higher frequencies). The spectra
actually shows this, body waves are dominated by energy with frequencies between 0.08 - 0.1 Hz
whereas the surface wave is dominated by energy with frequencies between 0.05 - 0.07 Hz.
The Fourier transform F(ω) is another way of representing the time series f(t). We speak of f(t) as
being in the t i m e domain and F(ω) as being in the f r e que nc y domain. The two representations are
equivalent because we can easily convert data from one domain to the other without losing any
information.
The transform and its inverse relate a function of time to a function of angular frequency. Similar
relations apply between other pairs of variables, in seismology for example the other commonly used
pair is distance and spatial f r e que nc y (wavenumber).
(lamda) equals 1/k(spatial freq./wavenumber) --> could also write velocity in terms of freg. and spatial freq. -> V equals f/k
Creating a spectrogram is done via the FFT. The data in the time domain is
broken into chunks, which usually overlap, the chunks are Fourier
transformed to calculate the Amplitude Spectra for each. Each chunk then
corresponds to a vertical line in the image, these spectra are then laid side
by side to create the spectrogram graph.
The figure shows an example. The seismogram on top was divided into
very small time chunks and Fourier transformed to get the spectrogram.
At the bottom the amplitude spectra of the entire seismogram (full 500 s)
is shown for comparison. Figure. Seismogram,
spectrogram, and
to make a spectogram -> first break the ampl vs time seismogram into distinct time intervals and create a amplitude amplitude spectra.
sepctra diagram of all time intervals (each time interval will have a amplitude and freq signal to be used in each time interval). Then you put all of these together
and flip the axis to get freq vs. time to get the spectrogram (you represent the amplitudes using colours -> high amplitudes as yellow, low ampl.s as blue)
Analog vs. Digital
A seismic signal is a continuous time function. In digital recording, the continuous (analog) seismic
signal is sampled at a fixed rate in time, called the sampling i nt e r v a l (or sampling rate). The figure
below shows a continuous signal in time. The discrete samples that might actually be recorded are
shown by dots. The bottom curve in the figure is an a t t e m p t e d reconstruction of the original ana-
log signal, which is shown as the curve on top.
Note that the reconstructed signal lacks the details present in the original analog signal. These
details correspond to high-frequency components that were lost by sampling. If a smaller sampling
interval were chosen, then the reconstructed signal would more accurately represent the original
signal.
Nyquist Frequency
Is there a measure of the highest restorable frequency of the digitized data?
f Nyq = 1
2Δt
Note Interpolation does not recover the frequencies lost by sampling; it only generates extra
samples.
Frequency Aliasing
What happens when an analog signal that is digitized with sampling rate Δt, originally had frequen-
cies above the Nyquist for that particular sampling rate?
Let's look at an example. Start with a signal in the time-domain made of a 1 Hz and a 22 Hz
sinusoids on top of a normal-distributed noise floor.
where f(t) can be the part of the signal we seek to identify re-
Cc(L) = lim
T ∞ T
1
-T/2
∫ x(t) f(t + L) dt
peated in the seismogram (e.g., direct P), and the remaining por-
tion of the time series is x(t). You can also think of f(t) as one
signal and x(t) as the other. Cc(L) is the cross-correlation of x(t) and f(t) and it measures the similari-
t y between the two signals by shifting f(t) by different lag times, L, and evaluating the integral of the
product as a function of L. The lag for which Cc(L) is maximum is the time shift that makes the two
functions most similar.
In the case of discrete time series we can write a discrete form of Cc(L). Imagine two time series xm
and fm. In discrete f o r m Ck we simply write N
Cc(k) = ∑
m=-N
xm f m + k
-∞
We can use deconvolution to describe a seismogram as the output
resulting from sending a source signal through a set of linear systems such as the effects of earth
structure on a earthquake signal, and the effect of the seismometer on the signal itself. This is gener-
ally written as u(t) = x(t) * g(t) * i(t), where the * signifies convolution.
An important property of convolution is that is commutative x(t) * g(t) = g(t) * x(t), unlike cross-cor-
relation. In discrete form is written as N
Cv(k) = ∑
m=-N
xm f k - m
very similar to cross-correlation, except you flip one
of the time series
An example of the convolution of two time series xm = [1, -1/2], f m = [1, 0, 1/2].
We start by reversing the time series that is going to be Reflectivity Output
shifted xm , line up the last element of x-m with the first Sequence Response
element of f m . Multiply the elements of both series
1
and shift x -m . We follow the same procedure as in the 1 0 2
cross-correlation example. 1 − 2 1 1
− 1 − 1
2 1 2
1 1
− 2 1 2
− 1 − 1
2 1 4
Convolution is a way of representing a mixture -> could use this to study the earth, or even a soup.
Properties of Convolution
similarly,
A consequence of this property is that frequency filtering can be applied in the frequency domain as
multiplication or in the time domain as convolution.
Cv(t)
When multiple seismic stations are available, seismic processing operations can be defined to operate
on several seismic traces simultaneously. This type of processing is called multichannel processing.
The 2D Fourier transform is an operation that can work on multiple seismic traces at a time. Imagine
having a line of seismometers spaced apart with constant interstation distance. This type of seismic
networ is sampling the earthquake signal in both time and space.
Before we defined frequency, particularly temporal frequency or the number of cycles per unit time.
Temporal frequency is the Fourier dual of the time variable. However, a seismic wavefield is not only a
function of time, but also a function of a space variable (distance). The fourier dual for the space vari-
able is defined as spatial frequency, which is the number of cycles per unit distance or wavenumber
(k).
Just as the temporal Nyquist frequency, the Nyquist wavenumber (k) is defined as
kNyq = 1
2Δx
where Δx is the spatial sampling interval (spatial sampling rate).
How to compute the Wavenumber k?
12 15
The top panels of the figure on the right are seismic sections.
Each vertical trace records a 12 Hz sinusoid. You can check this by
counting the number of peaks on the last trace of each section
numbered 12 and 15. The numbers on the top refer to the dip (slope
time (s)
or 1/v) in the horizontal direction (e.g. 12 ms/trace, or 15 ms/trace).
First we need the total time difference along a peak (or trough)
across the section. On the 15 ms/trace section, let's follow the peak
that starts at t = 0 s, for that peak the time difference is ~ 0.35 s.
frequency (Hz)
The spacing between traces is 25 m, there are 24 traces, thus the
spatial extend of the section is 23 * 25 = 575 m. Therefore the
wavenumber associated with the 15 ms/trace dip and the 12 Hz
frequency is
-20 20
k = 4.2/0.575 = 7.2 cycles / km (wavenumber for the particular signal)
Wavenumber
(cycles/km)
The bottom panels on the figure are known as f-k plots for
frequency vs wavenumber plots. In the figure both f-k plots
show the same frequency since both come from a 12 Hz signal, but the wavenumber is different
because the dip (velocity) for the two seismic sections is not the same. Events that have negative
slopes (dip to the left) will map into the left quadrant.
How does Spatial Aliasing looks?