IT-101
Section 001
Introduction to Information
Technology
Lecture #8
Overview
Chapter 10-11
Background on audio signals
Period, frequency and amplitude
Audio signal components
Sampling rate
Undersampling and Oversampling
Reconstructing audio from samples
Background on Audio
Signals
Sound is created by the motion of air particles in space
A microphone converts this motion into an electrical
signal
Remember the hello wave:
Properties of audio signals
Audio signals have frequency components that are
complex
In other words, most audio signals are made up of
many different frequencies, combining to make the
sound we recognize
The standard for human voice is taken to vary from
about 100 Hz to 3000 Hz
Piano: Concert A above Middle C is 440 Hz
Lowest audible frequency for humans is around 20 Hz
(A low, rumbling bass note)
Highest audible frequency is 20 kHz (beyond the range
of most humans)
Notice the faster rate at which the ear can detect
stimulus changes compared to the eye (ear can detect
rates up to 20,000 per second, eye can detect only 50
times per second)
This needs to be taken into account when converting
audio information into digital form-we need to use far
more samples per second of information for audio
Period, Frequency &
Amplitude
Frequency (f), measured in Hertz (Hz) refers to the rate of repetition of
the signal
If the frequency of a signal is 2Hz, then 2 cycles of the wave are
completed per second
Period, measured in seconds (s) is the time it takes to complete one
cycle of the wave
Frequency and period are Inversely Proportional:
T=1/f
f=1/T
If the frequency of the signal is 2Hz, then the period is 0.5 s (i.e. it takes
0.5 seconds for the wave to complete one cycle)
Amplitude is the magnitude of the signal at a given point in time. This
could be volts, etc..
Amplitude relates to volume
Louder sounds have greater vertical displacement of sound wave
Frequency : f=1/(10x10-3)=100 Hz
One cycle
4V
of the wave
3V Period: T=10 ms
2V
1V Amplitude: A=4 V Amplitude: A=3 V
0V
-1 V
Amplitude: A=-2 V
-2 V
-3 V
-4 V
0 5 10 t (ms)
Signal with twice the frequency
4V
3V
Notice the period
2V
is half the value as before
1 V Period: T=5 ms
0V
Notice the frequency
-1 V
is twice the value as before
-2 V Frequency : f=1/(5x10-3)=200 Hz
-3 V
-4 V
0 5 10 t (ms)
Lets listen to sine waves with
various frequencies:
http://www.mindspring.com/~scottr/zmusic/
Multipliers
The following are the common multipliers used for audio
characteristics such as period (T) and frequency (f):
Giga (G) 109 1,000,000,000.
Mega (M) 106 1,000,000.
Kilo (k) 103 1,000.
milli (m) 10-3 .001
micro () 10-6 .000001
nano (n) 10-9 .000000001
For example, KHz = KiloHertz = 1000 Hz
ms = millisecond = 1/1000 = .001 seconds
Audio Signal
Components
Concert A on a vibraphone
Sound waves are the sum of simple pure
tones
Frequency composition
(spectrum) of a signal
The different frequency
components which are added
together to produce a complex
waveform are called the
frequency spectrum of that
waveform.
Sound Wave Vs. Frequency
Spectrum
Note there is not
much frequency
content above 1
KHz
Fortunately, we do not need to know the specific frequency content
of a signal to digitize it.
We only need to know the highest frequency signal in a sample.
Why?
Digitizing Audio Signals
In the previous lecture, we learned how continuous images
are digitized first by dividing the image into a certain
number of pixels, then determining the brightness level of
each pixel and finally assigning a code of certain length to
each pixel.
A similar procedure is used to digitize audio signals.
The first step is called sampling where the waveform is
sampled at certain intervals.
The second step, called quantization, involves rounding
off the continuous values of the audio samples so they can
be represented by a finite number of bits.
Two step process for digitizing
audio
Continuous function of time
Infinite amount of information
Must choose particular instants of time
STEP 1 STEP 2
Continuous Audio Made Discrete Quantized into a
Signal In Time Series of Binary Digits
Sampling Rate
The sampling rate determines how many values of the signal we choose to
Sampling Interval (T)
Amount of time separating the samples
Also called sampling period
Sampling Rate (f)
Number of samples per second
Also called sampling frequency
Ts = 1/fs or fs = 1/Ts
Sampling Interval Sampling Rate
1 milliseconds 1 kHz = 1000 samples/sec
4 milliseconds 250 Hz = 250 samples/sec
16 milliseconds 62.5 Hz = 62.5 samples/sec
Digital Telephone Example
In a digital telephone system, the speech signal
is sampled 8,000 Hz. What is the sampling
period?
Ts = 1/fs
Ts = 1/8000
Ts = .000125 = 1.25 x 10-4 = 125 s
Nyquist Sampling Frequency
How often do you sample?
The sampling rate depends on the signals highest frequency (for base
Harry Nyquist, working at Bell Labs developed what has
become known as the Nyquist Sampling Theorem:
In order to be perfectly represented by its samples,
a signal must be sampled at a sampling rate (also
called sampling frequency) equal to at least twice
its highest frequency component
Or: fs = 2f
Note that fs here is frequency of sampling, not the
frequency of the sample
Sampling Rate Examples
Take Concert A: 440 Hz
What would be the minimum sampling rate needed
to accurately capture this signal?
f = 2 x 440 Hz = 880 Hz
s
Take your telephone used for voice, mostly
Highest voice component is: 3000 Hz
Minimum sampling rate: f = 2 x 3000 Hz = 6000 Hz
s
Real telephone digitization is done at 8000 Hz
sampling rate (supporting a 4 kHz bandwidth). Why?
Remember that Nyquist said equal to at least
twice
Undersampling and
Oversampling
Undersampling
Sampling at an inadequate frequency rate
Aliased into new form - Aliasing
Loses information in the original signal
Oversampling
Sampling at a rate higher than minimum rate
More values to digitize and process
Increases the amount of storage and transmission
COST $$
Effects of Undersampling
Original waveform
Reconstructed
waveform
Reconstructing Audio
from Samples
After receiving the signal, it is necessary to
reconstruct it in order to hear it.
The signal is reconstructed from its samples.
Exact reconstruction is possible if the sampling rate is
sufficiently high enough.
Reconstructed speech
A short section of a speech
waveform with 1 KHz sampling
waveform (highest frequency
rate (note the resulting
component is 3KHz)
waveform does not resemble
Reconstructed speech Reconstructed speech
waveform with 5 KHz sampling waveform with 10 KHz
rate (the resulting waveform sampling rate (the resulting
starts resembling the original waveform highly resembles
waveform) the original waveform)
Comments for next class
Chapter 12
Digital Audio
Digitization of Audio Samples
Quantization
Reconstruction
Quantization error