0% found this document useful (0 votes)

7 views

Lecture 16

Uploaded by

ETHAN ETHAN

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Lecture 16

Uploaded by

ETHAN ETHAN

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

ELEC1200: A System View of

Communications: from Signals to Packets

Lecture 16
• Time-Frequency Analysis
– Analyzing sounds as a sequence of frames
– Spectrogram

• Source Coding/Data Compression

– Lossless vs. Lossy compression
– MP3 encoding

ELEC1200
Time-Frequency Analysis
• For many complex signals (like speech, music and other sounds), short
segments are well described by a sinusoidal representation with a
few important frequency components.

• Time-frequency analysis refers to the analysis of how short-term

frequency content changes over time.

• The spectrogram of a signal is a picture of how its amplitude

spectrum changes over time.
– Vertical axis represents frequency
– Horizontal axis represents time
– Image color represents amplitude
• Red = large amplitude, Blue = small amplitude

ELEC1200 2
Spectrogram Example
amplitude spectrum amplitude spectrum amplitude spectrum
35 35 35

30 loud 30 quiet 30 loud

25
high frequency 25
lower frequency 25
many frequencies
20 20 20

15 15 15

10 10 10

5 5 5

0 0 0
0 1000 2000 3000 4000 0 1000 2000 3000 4000 0 1000 2000 3000 4000
frequency (Hz) frequency (Hz) frequency (Hz)

4000

3500
blue = quiet
red = loud
frequency (Hz) 3000

2500

2000

1500

1000

500
Red = high amplitude
Blue = low amplitude 0
5 10 15 20 25 30 35
frame number
ELEC1200 3
Computation of the Spectrogram
• Divide the signal into a set of frames, typically about 20-50 ms long.

frame

• Compute the amplitude spectrum of each frame.

• This gives you a two dimensional array of real numbers, indexed by frame number
and frequency index.

• Plot this as an image.

– It is generally more informative to plot the logarithm of the amplitude, as this
compresses large amplitudes allowing the smaller details to show up.
– To avoid problems at zero, apply a small positive floor to the values (i.e. replace each
amplitude by P if it is smaller than P, where P is small).

ELEC1200 4
Speech Spectrogram

spectrogram of “she” can you guess what word this is?

4000 4000

3500 3500

3000 3000

2500 2500

frequency (Hz)
frequency (Hz)

2000 2000

1500 1500

1000 1000

500 500

0 0
5 10 15 20 25 30 35 5 10 15 20 25 30 35
frame number frame number

ELEC1200 5
Train Whistle vs. Bird Chirps
Can you figure out which is which?

spectrogram spectrogram
4000 4000

3500 3500

3000 3000

2500 2500

frequency (Hz)
frequency (Hz)

2000 2000

1500 1500

1000 1000

500 500

0 0
10 20 30 40 50 10 20 30 40 50
frame number frame number

ELEC1200 6
Speech Data Red = high amplitude
Blue = low amplitude

• Characteristics
– Recent measurements more
informative in predicting the
future than those in the distant
past.

– At each point in time, different

sounds (phonemes) may be
pronounced.

– Different phonemes have

different spectral content.

Phonemes
Any distinct unit of sound that distinguish
one word from another (e.g., p, b)
ELEC1200 7
ELEC1200: A System View of
Communications: from Signals to Packets
Lecture 16
• Time-Frequency Analysis
– Analyzing sounds as a sequence of frames
– Spectrogram

• Source Coding/Data Compression

– Lossless vs. Lossy compression
– MP3 encoding

ELEC1200 8
Source Coding/Data Compression
We have a way to reliably send bits across a complex communications
network

Key Question: Can we re-code or compress the

message bit stream to send the information it
contains in as few bits as possible?

ELEC1200 9
Source Encoding & Decoding

Source Store/Retrieve Source

INPUT OUT
Encoding Transmit/Receive Decoding

• Encoding (Compression)
– INPUT information is converted to a bit stream with as few bits as possible.
– Example INPUTs: text, music, images, video

• Decoding (Decompression)
– bit stream is converted to OUT, which is similar/identical to INPUT.

ELEC1200 10
Lossless vs. Lossy Compression
Source Store/Retrieve Source
INPUT OUT
Encoding Transmit/Receive Decoding

• For lossless compression

– OUTPUT exactly same as INPUT
– Usually used for “naturally digital” bit streams, e.g. documents, messages,
datasets, …
– Examples: Huffman encoding, LZW, zip files, rar files

• For lossy compression

– OUTPUT “close” or “similar” to INPUT
– Appropriate for data streams (audio, video) intended for human consumption via
imperfect sensors (ears, eyes).
– Examples: MP3, MPEG, WMV
ELEC1200 11
Why Lossy Compression?
Data Rate = Sampling rate * Quantization bits * Channels

• Sampling rate = No. Samples per second (Hz)

• Quantization bits = Value each sample needs to be rounded to a finite number of
values (e.g., 256).
• Channels: e.g., Need to support one or two audio channels
1. Monophonic -- single audio channel
2. Stereo -- 2 channels

• Digital Audio Example: 44100 Hz with 16 bits quantization and 2 channels

– Will generate about 1.4 Mb of data per second (84 Mb/minute or 5 Gb/hour)

– A 3-minute song will require 1.411 Mb/sec * 180 sec = 254 Mb = 31.75 MB

ELEC1200 12
Poor ways to compress an audio file
• Reduce the total number of bits per sample
– e.g., 16 to 8 bit
– Gives you a factor of 2 in compression
– However, introduces noticeable distortion
• Reduce the sampling rate
– e.g., 44 kHz to 22 kHz
– Again only a gain of a factor of 2 in size
– However, leads to a noticeable loss of high frequency information

• These schemes result in highly perceptible changes in the signal, but

a relatively small reduction in bit rate.

ELEC1200 13
MP3
• MPEG = Moving Pictures Experts Group
– set up by ISO (international standards organization)
– every few years issues a standard
• MPEG1 (1992)
• MPEG2 (1994)..

• MP3 stands for MPEG audio layer III

– MP3 achieves a 10:1 compression ratio!
– This enables
• bit-streaming
• compact audio storage
– It uses concepts of psycho-acoustics & human perception

ELEC1200 14
Psycho-acoustics
• Psycho-acoustic ~ Human perception of sound principles
• Human hearing frequencies at [20 Hz,20 kHz]
– Most sensitive at 2 to 4 KHz.
• Normal voice range is about 500 Hz to 2 kHz
– Low frequencies are vowels such as A, E, I, O, …
– High frequencies are consonants (can be combined with a vowel to form a
syllable) such as B, C, D, F, G, K, L, ...
• More sensitive to loudness at mid frequencies than at other
frequencies (e.g., Two tones of equal power and different
frequencies will not be equally loud)
– Intermediate frequencies at [500 Hz, 5000 Hz]
– Sensitivity decreases at low and high frequencies

ELEC1200 15
Perceptual Coding
• What matters is how the consumer (e.g. human ears or eyes)
perceives the input.
– Frequency range, amplitude sensitivity, color response, …
– Masking effects
• Identify information that can be removed from bit stream without
perceived effect, e.g.,
– Sounds outside frequency range
– Masked sounds
• Encode remaining information efficiently
– Use frequency-based transformations
– Quantize coefficients of frequency (loss occurs here)
– Add lossless coding (e.g., the Huffman encoding)

ELEC1200 16
Masking
• Masking: If a dominant tone • Definitions
is present, then sounds at – Auditory threshold – minimum
frequencies near it will be signal level at which a pure tone
harder to hear. can be heard
– Masking threshold – minimum
• Coding consequences signal level if a dominant tone is
– Less precision is required to present
store information about nearby
frequencies
– Less precision = coarser
quantization

ELEC1200 17
Quantization
• Real signals typically assume continuous values, e.g., between 0 and
+1 or -1 and +1.
• However, for digital storage, we use binary numbers, which have a
limited number of values
– An n bit number has 2n different values.
• Thus, we divide the expected signal range, R into 2n different levels,
and quantize the original signal by recording the closest level.

quantized signal

R 3 bits → 8 levels

original signal
ELEC1200 18
Resolution
• Quantization or rounding error is the difference between the actual
signal and the quantized signal
• We reduce quantization error by using more levels
• Resolution is either measured as
– the number of bits (more bits mean finer resolution)
– the difference between levels (smaller is better)
• Higher resolution requires more storage

quantization
error

R 8 levels
3 bit resolution
R
resolution =
2n − 1

ELEC1200 19
Principles of Auditory Coding
• Time frequency decomposition
– Divide the signal into frames
– Obtain the spectrum of each piece
• Use psycho-acoustic model to determine what information to keep
– Don’t store information outside the hearing range
(40 Hz to 15 kHz)
– Stereo info not stored for low frequencies
– Masking
• A component (at a given frequency) masks components at neighboring frequencies
• Store the information in the most compact way possible
– Minimize the bitrate requirement
– Maximize the audible auditory content

ELEC1200 20
MP3 schematic
Input: 1.411 Mbit/s (16 bit per channel @44.1 kHz - stereo)
Output: Coded audio signal at ~128 kbit/s
Without Compression: 1.4 M of data per second or 84 M per minute or 5 G per hour
MP3: 128 K of data per second or 7.68 M per minute or 0.46 G per hour

Frequency analysis loss of information lossless compression

similar to Fourier series happens here

minor extra
Frequency analysis effects of masking
similar to Fourier series
information
determined here
encoded here
ELEC1200 21
Non-uniform quantization
• MP3 compression quantizes the amplitudes of different frequency components
differently, depending upon masking.
• Frequency components near a dominant masker are quantized with fewer bits.

Frequency analysis lossless compression

loss of information
similar to Fourier series happens here

minor extra
Frequency analysis effects of masking
similar to Fourier series
information
determined here
encoded here
ELEC1200 22
Summary
• Audio waveforms are typically analyzed as a sequence of frames
– Within each frame, the signal can be well approximated by a few frequency
components
– The spectrogram can be used to visualize changes in the frequency content over
time
• Source coding/data compression
– Recode message stream to remove redundant information, aka compression. The
goal is to match data rate to actual information content.
– Two types of compression: Lossless vs lossy
• MP3 audio lossy compression combines framing and frequency analysis
with a non-uniform quantization based on a perceptual model
– By throwing away “unimportant” (imperceptible) information, we can obtain large
compression ratios

ELEC1200 23

HyperX Cloud Alpha
100% (1)
HyperX Cloud Alpha
1 page
Bitsclass 2013
No ratings yet
Bitsclass 2013
6 pages
Image Compression Comparison Using Golden Section Transform, CDF 5/3 (Le Gall 5/3) and CDF 9/7 Wavelet Transform by Matlab
No ratings yet
Image Compression Comparison Using Golden Section Transform, CDF 5/3 (Le Gall 5/3) and CDF 9/7 Wavelet Transform by Matlab
29 pages
Math Music
No ratings yet
Math Music
12 pages
2503.18600v1
No ratings yet
2503.18600v1
5 pages
804-Whitepaper RA0403 and RA0404
No ratings yet
804-Whitepaper RA0403 and RA0404
7 pages
Decoding DTMF: Filters in The Frequency Domain: Laboratory 7
No ratings yet
Decoding DTMF: Filters in The Frequency Domain: Laboratory 7
12 pages
Iso 1999-Liedtke EJEMPLO
No ratings yet
Iso 1999-Liedtke EJEMPLO
4 pages
Math Music Slides
No ratings yet
Math Music Slides
11 pages
Stax SR007 SZ31576
No ratings yet
Stax SR007 SZ31576
1 page
Application Note - Pulsed lasers
No ratings yet
Application Note - Pulsed lasers
3 pages
Lecture16 Hanoi 4up
No ratings yet
Lecture16 Hanoi 4up
10 pages
Active Lowpass Filter
No ratings yet
Active Lowpass Filter
3 pages
Lec2 Audition
No ratings yet
Lec2 Audition
37 pages
t400s16 400ST/R160
No ratings yet
t400s16 400ST/R160
2 pages
Philips SHP9500
No ratings yet
Philips SHP9500
1 page
GCSE Histograms
No ratings yet
GCSE Histograms
16 pages
Clamp Sensors Hioki
No ratings yet
Clamp Sensors Hioki
8 pages
Requirement-1: An Ordinary Frequency Distribution Table: Page 1 of 3
No ratings yet
Requirement-1: An Ordinary Frequency Distribution Table: Page 1 of 3
3 pages
sm58 Specification Sheet English PDF
No ratings yet
sm58 Specification Sheet English PDF
1 page
Amonics Limited C + L Band Erbium-Doped Amplifier Model: AEDFA-CL-PA-R
No ratings yet
Amonics Limited C + L Band Erbium-Doped Amplifier Model: AEDFA-CL-PA-R
3 pages
EQ Setting For Philips SHP9500: SPL Frequency Response With EQ SPL Frequency Response Without EQ
No ratings yet
EQ Setting For Philips SHP9500: SPL Frequency Response With EQ SPL Frequency Response Without EQ
1 page
LED Comparizon Table
No ratings yet
LED Comparizon Table
1 page
Shure Sm81-Specification-Sheet-English PDF
No ratings yet
Shure Sm81-Specification-Sheet-English PDF
1 page
Important Tables
100% (2)
Important Tables
3 pages
Impact Sound Insulation - Lecture Notes
No ratings yet
Impact Sound Insulation - Lecture Notes
39 pages
Rffundamentalsseminarrecordingsection 1 V 21588598809158
No ratings yet
Rffundamentalsseminarrecordingsection 1 V 21588598809158
60 pages
Audiogram
No ratings yet
Audiogram
38 pages
Introduction To Fourier Transform Infrared Spectrometry
No ratings yet
Introduction To Fourier Transform Infrared Spectrometry
7 pages
ODE_Assignment_Task3.1_3.2(2)
No ratings yet
ODE_Assignment_Task3.1_3.2(2)
3 pages
Audio Technica ATH-M50x
No ratings yet
Audio Technica ATH-M50x
1 page
Delta Ia-Asda Asda-A2 C en 20211015
No ratings yet
Delta Ia-Asda Asda-A2 C en 20211015
72 pages
APM Catalogo - ASDA-A2 - 13793531 - 13794392
No ratings yet
APM Catalogo - ASDA-A2 - 13793531 - 13794392
72 pages
CRR201261
No ratings yet
CRR201261
10 pages
Solutions HW 5
No ratings yet
Solutions HW 5
5 pages
Illumination Calculation
No ratings yet
Illumination Calculation
4 pages
Beyerdynamic DT770 (New Earpads)
No ratings yet
Beyerdynamic DT770 (New Earpads)
1 page
Lab 4
No ratings yet
Lab 4
15 pages
sm35 Specification Sheet English
No ratings yet
sm35 Specification Sheet English
1 page
DSP Project2
No ratings yet
DSP Project2
7 pages
Laboratory Session No 02
No ratings yet
Laboratory Session No 02
5 pages
EQ Setting For Superlux HD668B: SPL Frequency Response With EQ SPL Frequency Response Without EQ
No ratings yet
EQ Setting For Superlux HD668B: SPL Frequency Response With EQ SPL Frequency Response Without EQ
1 page
Delta Ia-Asda Asda-A2 C en 20230214
No ratings yet
Delta Ia-Asda Asda-A2 C en 20230214
72 pages
Bass Extension ELEKTOR
No ratings yet
Bass Extension ELEKTOR
2 pages
Sennheiser HD800 (SDR Mod)
No ratings yet
Sennheiser HD800 (SDR Mod)
1 page
Impedance Measurement Techniques: Sine Correlation
No ratings yet
Impedance Measurement Techniques: Sine Correlation
4 pages
ETC Source 4 Spec Sheet Compilation 12-9-18
No ratings yet
ETC Source 4 Spec Sheet Compilation 12-9-18
11 pages
Background Estimation of Biomedical Raman Spectra
No ratings yet
Background Estimation of Biomedical Raman Spectra
7 pages
Sony WH1000XM4 PDF
No ratings yet
Sony WH1000XM4 PDF
1 page
Design and Study of Frequency Response of Band Pass and Band Reject Filters Using Operational Amplifiers
No ratings yet
Design and Study of Frequency Response of Band Pass and Band Reject Filters Using Operational Amplifiers
5 pages
Superlux HD-668B (Velours Earpads)
No ratings yet
Superlux HD-668B (Velours Earpads)
1 page
Asda-A2 C en 20180302
No ratings yet
Asda-A2 C en 20180302
72 pages
Drop Panda
No ratings yet
Drop Panda
1 page
Multiratedigitalsignalprocessingkiruba 150719071925 Lva1 App6891 PDF
No ratings yet
Multiratedigitalsignalprocessingkiruba 150719071925 Lva1 App6891 PDF
13 pages
Digital Representation of Audio Information
No ratings yet
Digital Representation of Audio Information
22 pages
Ecs 2100
No ratings yet
Ecs 2100
1 page
Hifiman Sundara (2020 Revised Earpads) (Optimum HiFi Curve)
No ratings yet
Hifiman Sundara (2020 Revised Earpads) (Optimum HiFi Curve)
1 page
Cleer Next
No ratings yet
Cleer Next
1 page
Control Systems 2: Lecture 6 - Quantitative Feedback Theory (QFT) Disturbance Rejection
No ratings yet
Control Systems 2: Lecture 6 - Quantitative Feedback Theory (QFT) Disturbance Rejection
16 pages
Amateur Radio Electronics V11 Home Study
From Everand
Amateur Radio Electronics V11 Home Study
Clive W. Humphris
No ratings yet
Amateur Radio Electronics on Your Mobile
From Everand
Amateur Radio Electronics on Your Mobile
Clive W. Humphris
5/5 (1)
Learn Amateur Radio Electronics on Your Smartphone
From Everand
Learn Amateur Radio Electronics on Your Smartphone
Clive W. Humphris
No ratings yet
MMWP 2008C Datasheet
No ratings yet
MMWP 2008C Datasheet
12 pages
PDF Biomedical Diagnostics and Clinical Technologies Applying High Performance Cluster and Grid Computing 1st Edition Manuela Pereira download
No ratings yet
PDF Biomedical Diagnostics and Clinical Technologies Applying High Performance Cluster and Grid Computing 1st Edition Manuela Pereira download
45 pages
Image Compression Techniques: H.S Samra
No ratings yet
Image Compression Techniques: H.S Samra
4 pages
Help Guide: MHC-V73D
No ratings yet
Help Guide: MHC-V73D
187 pages
BoomBox Manual 021009
No ratings yet
BoomBox Manual 021009
76 pages
IPC-HFW2439S-SA-LED-S2 - 4MP Bullet
No ratings yet
IPC-HFW2439S-SA-LED-S2 - 4MP Bullet
3 pages
583 GEVC Patent Ex Parte Request FINAL
No ratings yet
583 GEVC Patent Ex Parte Request FINAL
128 pages
EC8394 Notes U2
No ratings yet
EC8394 Notes U2
82 pages
Ibm Hana Redbook
No ratings yet
Ibm Hana Redbook
80 pages
Ambix
No ratings yet
Ambix
11 pages
A Software Implementation of The Shannon-Fano Coding Algorithm
No ratings yet
A Software Implementation of The Shannon-Fano Coding Algorithm
4 pages
Customized Segment Anything Model For Medical Image Segmentation
No ratings yet
Customized Segment Anything Model For Medical Image Segmentation
16 pages
Cloud 3
No ratings yet
Cloud 3
39 pages
ITC-UNIT-4
No ratings yet
ITC-UNIT-4
17 pages
Chapter 4 Multi
No ratings yet
Chapter 4 Multi
45 pages
2024_Real-Time Deepfake Detection in the Real-World
No ratings yet
2024_Real-Time Deepfake Detection in the Real-World
16 pages
Lecture 7
No ratings yet
Lecture 7
108 pages
DCT Based Coding
No ratings yet
DCT Based Coding
49 pages
What Is Huffman Coding and Its History
No ratings yet
What Is Huffman Coding and Its History
5 pages
Coding Form Guru
No ratings yet
Coding Form Guru
7 pages
Go Pro PDF
No ratings yet
Go Pro PDF
37 pages
File Explorer File Computer This PC
No ratings yet
File Explorer File Computer This PC
6 pages
Electrical & Electronic Systems: Analysis of Power Reduction Techniques Used in Testing of VLSI Circuits
No ratings yet
Electrical & Electronic Systems: Analysis of Power Reduction Techniques Used in Testing of VLSI Circuits
3 pages
Teradata Vantage™ SQL Operators and User Defined Functions
No ratings yet
Teradata Vantage™ SQL Operators and User Defined Functions
272 pages
Dereje Teferi (PHD) Dereje - Teferi@Aau - Edu.Et
No ratings yet
Dereje Teferi (PHD) Dereje - Teferi@Aau - Edu.Et
36 pages
H96Max+M3+TV+Stick+规格书
No ratings yet
H96Max+M3+TV+Stick+规格书
6 pages
Generative AI notes for class 9
No ratings yet
Generative AI notes for class 9
5 pages
DxDiag Laptop
No ratings yet
DxDiag Laptop
27 pages

Lecture 16

Uploaded by

Lecture 16

Uploaded by

ELEC1200: A System View of

Communications: from Signals to Packets

• Source Coding/Data Compression

• Time-frequency analysis refers to the analysis of how short-term

• The spectrogram of a signal is a picture of how its amplitude

30 loud 30 quiet 30 loud

• Compute the amplitude spectrum of each frame.

• Plot this as an image.

spectrogram of “she” can you guess what word this is?

– At each point in time, different

– Different phonemes have

• Source Coding/Data Compression

Key Question: Can we re-code or compress the

Source Store/Retrieve Source

• For lossless compression

• For lossy compression

• Sampling rate = No. Samples per second (Hz)

• Digital Audio Example: 44100 Hz with 16 bits quantization and 2 channels

• These schemes result in highly perceptible changes in the signal, but

• MP3 stands for MPEG audio layer III

Frequency analysis loss of information lossless compression

Frequency analysis lossless compression

You might also like