Audio Compression

Audio compression reduces the amount of data in recorded audio for efficient transmission and storage, utilizing techniques such as silence compression, DPCM, and psychoacoustic models. It employs methods like predictive encoding for speech and perceptual encoding for music, which take advantage of human hearing limitations to mask inaudible frequencies, leading to lossy compression. The process involves quantization, which introduces noise but is often imperceptible to listeners due to frequency masking.

Uploaded by

Snehasis

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Audio Compression

Uploaded by

Snehasis

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

AUDIO COMPRESSION

DEFINITION
AUDIO COMPRESSION
• In MPEG Audio compression, following techniques are used:
• Silence compression — detect the “silence” in the audio signal, and
apply run-length encoding to remove the silence periods to achieve
compression.
• Differential Pulse Coded Modulation (DPCM) is adopted where the
amplitude difference between the two successive samples can be
stored using reduced bits if the difference in amplitude between
successive samples is small.
• Adaptive Differential Pulse Code Modulation (ADPCM)- Encode the
difference between two or more consecutive signals; the difference
is then quantized --> hence the loss. The loss in lossy compressions
is due to the quantization process that converts continuous range of
values to discrete ones.
• It is necessary to predict where the waveform is headed-mApple
has proprietary scheme called ACE/MACE. A Lossy scheme that tries
to predict where wave will go in next sample. Gives about 2:1
compression.
• Adaptive Predictive Coding (APC) is used on Speech

Please note: Since we are quantizing the samples, there is loss in compression
techniques. Quantisation leads to lossy compression.
• APC or Adaptive Predictive Coding is used for
Speech compression.
– Input signal is divided to fixed segments or windows.
– For each segment, some sample characteristics are
computed, e.g. pitch, period, loudness.
- These characteristics are used to predict the
signal.
- Computerised talking (speech synthesisers use
such methods) but at low bandwidth:
(What is quantization?)
Quantization is defined as a lossy data compression technique by
which intervals of data are grouped or binned into a single value
(or quantum). Quantization, in mathematics and digital signal
processing, is the process of mapping input values from a large
set (often a continuous set) to output values in a (countable)
smaller set, often with a finite number of elements. Rounding
and truncation are typical examples of quantization processes. In
MPEG audio compression, some bits are allocated for
quantisation. Quantisation provides compression but introduces
noise and makes the compression lossy. However, quantization
noise may not be perceived by human ear if the quantisation
noise frequencies are masked by the masking frequencies.

Audio compression uses psycho-acoustic models

We use this limited hearing property of the ear to compress audio
• If the frequencies are close and the amplitude of one is less than
the other close frequency then the second frequency may not be
heard (masked)

THE FREQUENCIES THAT ARE MASKED ARE SUPPRESSED IN AUDIO COMPRESSION

AND ARE NOT TRANSMITTED. IF MORE FREQUENCIES CAN BE MASKED, HIGHER
COMPRESSION RATIO CAN BE ACHIEVED. However, that degrades audio quality.
• All the inaudible frequencies (frequency
masking of signal in frequency domain) and
inaudible tones ( i.e. Audio masking of signals
in time domain as in temporal masking) are
masked in MPEG Audio compression.
• Explain audio compression with a neat
diagram (Very important) Give any one
diagram
1. The input audio signal is sampled and quantised
using PCM (Pulse coded Modulation)
2. The PCM samples are divided into frequency sub-
bands by using Analysis filters or critical band filters
(Filter banks) which breaks the signal to equal width
sub-band. These filters have Direct Cosine
Transforms (DCT) that divide the signal to 32 sub-
bands (32 narrow width frequency bands). The
scaling factors for these sub-bands are computed.
• Bits for quantisation and scaling are allocated by
the psychoacoustic model which works on data in
parallel to the subdivision of input signal to
frequency bands.
• So the Audio PCM signal is converted to
frequency Domain, quantised and scaling factors
are computed.
• Among the 32 frequency bands which have been
quantised applying quantisation bits from
psychoacoustic model, all frequencies are not
audible by human ear. Moreover, these
frequencies are quantised. The quantisation
noises are also not perceptible by ears. SO most
quantisation noise is suppressed.
6. The psychoacoustic modelling is applied in parallel to filter
process, i.e. converting input to narrow frequency bands.
According to this model, there can be frequency masking
or temporal masking. All audio frequency bits are not
transmitted as many audio bits are masked. Only certain
bits are allocated to be transmitted. The psychoacoustic
modeller has FFT transform (Fast Fourier Transform). The
quantisation noise is minimised by minimising the
audibility through masking.
7. Then the quantised output along with bits for quantisation
and scaling are formatted. The formatted or encoded
bitstream is transmitted.
8. At the decoder end, encoded bitstream is decoded as
PCM audio.
• In speech encoding, predictive encoding is
done. In music encoding, perceptual encoding
is done. Here low frequency sounds which are
not heard by ears are suppressed. The stereo
mode is turned off and only mono mode is
kept for low frequency sound. Temporal
masking and frequency masking are used to
mask those frequencies which human ear
cannot hear.
• Q: What is audio compression? Why it is done?
(2) What techniques it uses for speech and music
compression? (2)
• Q: Explain MPEG Audio compression technique
with a neat diagram (5) (Already discussed)
• Q: What is psychoacoustic model? What features
of psychoacoustic model are used for
compression in MPEG Audio? Which frequencies
and tones are masked? Also give the features of
frequency masking and temporal masking.
(3+3+2+2) (Given in slides)
• Q: How many layers of compression are offered by
MPEG (There are 3 layers of compression in MPEG)?
What are the encoding complexities in each layer?/
Q: What are the compression/encoding complexity in
Layer 1, layer 2 and layer 2 of MPEG Audio
compression? (3)
• What are the audio features of MPEG? (3)
• Q: What is predictive encoding and perceptual
encoding? Which gives higher compression ratio?
(Perceptual)(2+1)
• Q: What is quantization? When quantization is applied
to compression, is it lossy or lossless and why?
• Q: What is quantization? When quantization is
applied to compression, is it lossy or lossless
and why?
• Quantization is defined as a lossy data compression
technique by which intervals of data are grouped or
binned into a single value (or quantum). Quantization,
in mathematics and digital signal processing, is the
process of mapping input values from a large set (often
a continuous set) to output values in a (countable)
smaller set, often with a finite number of elements.
Rounding and truncation are typical examples of
quantization processes. In MPEG audio compression,
some bits are allocated for quantisation. Quantisation
provides compression but introduces noise and makes
the compression lossy. However, quantization noise
may not be perceived by human ear if the quantisation
noise frequencies are masked by the masking
frequencies
• When quantization is applied, compression is
always lossy. This is because a continuous
range of values is dicretised or mapped to a
single value. This introduces noise. This noise
needs to be removed. This causes
compression to be lossy where quantisation is
applied. In audio compression, most of the
noise frequencies are masked and not
transmitted.
• Q: What is audio compression? Why it is done?
(2) What techniques it uses for speech and music
compression in MPEG? (2)
• Audio compression (data), a type of lossy or
lossless compression in which the amount of
data in a recorded waveform is reduced to
differing extents for transmission respectively
with or without some loss of quality, used in CD
and MP3 encoding, Internet radio, and the like.
• It is done to reduce the total quantity of data
contained in the audio file for transmission so
that it does not require excess bandwidth of the
network. Moreover, it is easy to store small sized
compressed files.
• MPEG uses Predictive encoding for Speech
compression and Perceptual encoding for
music compression
• Predictive encoding transmits the difference in
signal values between successive samples
instead of transmitting the absolute sample
values and CR is less.
• Perceptual encoding takes the psychoacoustic
phenomenon in account. Human ears can
generally hear sound in freq. range of 20 Hz-
20 KHz but MOST SENSITIVE TO FREQ OF1-5
KHz, more specifically 2-4 KHz.
• The long silent periods of audio are
suppressed by applying Run length encoding
and are not transmitted.
• Perceptual encoding uses frequency and
temporal masking. Two types of masking happens
in our ears, Frequency masking and temporal
masking. It is seen that within nearby
frequencies, a louder sound of lower frequency
can mask a softer sound in higher frequency but
the frequencies have to be is a smaller range. If
range of frequencies is high, then frequency
masking does not take place. Like a frequency of
1 Kz at 60 dB can mask (cover, make inaudible) a
sound of 1.1 Kz at 40 dB. This is called frequency
masking
• THE MASKED FREQUENCIES ARE COMPRESSED, THEY ARE
NOT TRANSMITTED. Since these masked frequencies do
not contribute to quality of sound (as our human ears
cannot perceive them), the loss in transmitted audio
quality cannot be understood by the audio hearer. This
phenomenon is made use of in perceptual encoding
• Masking of sound in frequency domain is called frequency
masking and in time domain is called temporal masking.
When a loud or strong signal of lower frequency is heard
for some time, then when that sound is removed, for some
time we are unable to hear softer sounds at nearby
frequencies. This is called temporal masking. So softer
frequencies near loud frequencies can be masked or
covered during transmission.
• The masked frequencies are not transmitted and
suppressed resulting in good Compression Ratio (CR)
• Q: What are the compression/encoding
complexity in Layer 1, layer 2 and layer 2 of
MPEG Audio compression? (3)
• MPEG Compression are done in 3 layers.
• What are the audio features of MPEG? (3)
• Q: What is predictive encoding and perceptual
encoding? Which gives higher compression ratio?
(Perceptual)(2+1)
• (Already explained)
• Perceptual encoding gives higher CR as much of
the frequencies which our ear cannot hear and
perceive can be masked. The acoustically
irrelevant frequencies are removed and not
transmitted with the signal. Predictive encoding
transmits the difference in signal values between
successive samples instead of transmitting the
absolute sample values and CR is less.
• Give advantages and disadvantages of Audio
compression

2023 PhysSci GR 10 Revision & Activity Book April 2023
100% (1)
2023 PhysSci GR 10 Revision & Activity Book April 2023
115 pages
Acoustics of Multi-Use Performing Arts Center
No ratings yet
Acoustics of Multi-Use Performing Arts Center
36 pages
MPEG Standards For Audio
No ratings yet
MPEG Standards For Audio
46 pages
Simple Audio Compression Methods: A Udio Com Pression
No ratings yet
Simple Audio Compression Methods: A Udio Com Pression
6 pages
Unit Ii
No ratings yet
Unit Ii
34 pages
Laboratory6 ELECTIVE2
No ratings yet
Laboratory6 ELECTIVE2
5 pages
Audio Compression
No ratings yet
Audio Compression
53 pages
Audio Compression
No ratings yet
Audio Compression
11 pages
Fundamentals of Perceptual Audio Coding
No ratings yet
Fundamentals of Perceptual Audio Coding
30 pages
Lossy and Lossless Compression Techniques
No ratings yet
Lossy and Lossless Compression Techniques
18 pages
Venkata Lakshmi 08011012170 Sep Audio Compression
No ratings yet
Venkata Lakshmi 08011012170 Sep Audio Compression
8 pages
Unit-Ii Itc
No ratings yet
Unit-Ii Itc
42 pages
Noakhali Science & Technology University
No ratings yet
Noakhali Science & Technology University
12 pages
Audio Compression
No ratings yet
Audio Compression
6 pages
Digital Audio Compression: by Davis Yen Pan
No ratings yet
Digital Audio Compression: by Davis Yen Pan
14 pages
MPEG
No ratings yet
MPEG
12 pages
Bab 7 Multimedia Kompresi Audio
No ratings yet
Bab 7 Multimedia Kompresi Audio
52 pages
Audio Compression Standards: James Rodney P. Santiago
No ratings yet
Audio Compression Standards: James Rodney P. Santiago
51 pages
Stress at Work
No ratings yet
Stress at Work
4 pages
Compression Research Project - Advanced Music Technology January 2002
100% (2)
Compression Research Project - Advanced Music Technology January 2002
4 pages
Audio and Audio Compression
No ratings yet
Audio and Audio Compression
27 pages
Speech Compression Techniques: An Overview
No ratings yet
Speech Compression Techniques: An Overview
4 pages
Audio Coding For TV
No ratings yet
Audio Coding For TV
36 pages
Audio Compression
0% (1)
Audio Compression
26 pages
Audio Compression Using Wavelet Techniques: Project Report
No ratings yet
Audio Compression Using Wavelet Techniques: Project Report
41 pages
2. Multimedia
No ratings yet
2. Multimedia
80 pages
5. Audio Coding and Standards
No ratings yet
5. Audio Coding and Standards
32 pages
Advanced Audio Coding (Aac)
No ratings yet
Advanced Audio Coding (Aac)
33 pages
Audio and Video Compresssion
100% (1)
Audio and Video Compresssion
61 pages
Audio Compression: Ashish Sharma
No ratings yet
Audio Compression: Ashish Sharma
7 pages
Audio Compression Using Daubechie Wavelet
No ratings yet
Audio Compression Using Daubechie Wavelet
4 pages
Digital Audio Coding - Dr. T. Collins: Standard MIDI Files Perceptual Audio Coding MPEG-1 Layers 1, 2 & 3 MPEG-4
No ratings yet
Digital Audio Coding - Dr. T. Collins: Standard MIDI Files Perceptual Audio Coding MPEG-1 Layers 1, 2 & 3 MPEG-4
23 pages
Chap 5 Compression
No ratings yet
Chap 5 Compression
43 pages
Audio Compression: Usha Sree
No ratings yet
Audio Compression: Usha Sree
23 pages
Audio Compression
No ratings yet
Audio Compression
23 pages
AUDIO COMPRESSION1 (1)
No ratings yet
AUDIO COMPRESSION1 (1)
22 pages
MPEG, The MP3 Standard, and Audio Compression
No ratings yet
MPEG, The MP3 Standard, and Audio Compression
12 pages
GDPHM 505
No ratings yet
GDPHM 505
19 pages
Digital Audio
No ratings yet
Digital Audio
29 pages
Atracmp 3
No ratings yet
Atracmp 3
10 pages
Low Bit Rate Coding
No ratings yet
Low Bit Rate Coding
4 pages
DC 17
No ratings yet
DC 17
4 pages
Huff Man 1
No ratings yet
Huff Man 1
4 pages
Audio Coding: Basics and State of The Art
No ratings yet
Audio Coding: Basics and State of The Art
6 pages
Audio Coding: Basics and State of The Art
No ratings yet
Audio Coding: Basics and State of The Art
6 pages
Psychoacoustics
No ratings yet
Psychoacoustics
22 pages
RT202C-3 Audio Streaming
No ratings yet
RT202C-3 Audio Streaming
20 pages
Data Compression: This Article May Require by
No ratings yet
Data Compression: This Article May Require by
25 pages
Sub-Band Coding
No ratings yet
Sub-Band Coding
2 pages
EE412/CS455 Principles of Digital Audio and Video
No ratings yet
EE412/CS455 Principles of Digital Audio and Video
71 pages
Audio Compression
No ratings yet
Audio Compression
30 pages
What Data Compression Does To Your Music: Feature
No ratings yet
What Data Compression Does To Your Music: Feature
8 pages
Snehal
No ratings yet
Snehal
1 page
Chapter 3
No ratings yet
Chapter 3
23 pages
Lossless and Lossy Audio Data Compression Revisi
100% (1)
Lossless and Lossy Audio Data Compression Revisi
8 pages
Multimedia
No ratings yet
Multimedia
2 pages
Data Compression: © 2011 SAE Education Ltd. - Subject To Change Without Notice!
No ratings yet
Data Compression: © 2011 SAE Education Ltd. - Subject To Change Without Notice!
5 pages
MPEG Audio
No ratings yet
MPEG Audio
68 pages
AES 17 Conference Mp3 and AAC Explained AES17
No ratings yet
AES 17 Conference Mp3 and AAC Explained AES17
12 pages
M5_audio
No ratings yet
M5_audio
32 pages
Noise Reduction: Enhancing Clarity, Advanced Techniques for Noise Reduction in Computer Vision
From Everand
Noise Reduction: Enhancing Clarity, Advanced Techniques for Noise Reduction in Computer Vision
Fouad Sabry
No ratings yet
Digital Audio Formats
From Everand
Digital Audio Formats
Ambrose Delaney
No ratings yet
q2_data
No ratings yet
q2_data
139 pages
Iit bhu
No ratings yet
Iit bhu
20 pages
EVEN_CALENDAR_2020-21-_FIRST_YEAR_UG
No ratings yet
EVEN_CALENDAR_2020-21-_FIRST_YEAR_UG
1 page
Heat Transfer_Radiation_Lecture- Assignment-MKM (1)
No ratings yet
Heat Transfer_Radiation_Lecture- Assignment-MKM (1)
23 pages
f52183087138583e0a5c94baffaceca3
No ratings yet
f52183087138583e0a5c94baffaceca3
35 pages
Microbiology 8 - Microbial Nutrition
No ratings yet
Microbiology 8 - Microbial Nutrition
30 pages
Refrigeration
No ratings yet
Refrigeration
26 pages
UOCE-Absorption-Final[1811]
No ratings yet
UOCE-Absorption-Final[1811]
37 pages
Iit jodhpur
No ratings yet
Iit jodhpur
4 pages
BTC301_3a
No ratings yet
BTC301_3a
26 pages
Microbiology 5 - Microbial Structures - Cell Membrane, Ribosome
No ratings yet
Microbiology 5 - Microbial Structures - Cell Membrane, Ribosome
22 pages
Ideal Gas Law
No ratings yet
Ideal Gas Law
26 pages
3. Methods of media sterilization
No ratings yet
3. Methods of media sterilization
35 pages
Btech Microbiology Practical VII Final
No ratings yet
Btech Microbiology Practical VII Final
18 pages
1. BTC302
No ratings yet
1. BTC302
38 pages
2.BTC302
No ratings yet
2.BTC302
25 pages
signal
No ratings yet
signal
8 pages
Btech_Microbiology Practical_V_final (1)
No ratings yet
Btech_Microbiology Practical_V_final (1)
14 pages
Btech_Microbiology Practical_VI_final
No ratings yet
Btech_Microbiology Practical_VI_final
25 pages
Btech_Microbiology Practical_VIII_ final
No ratings yet
Btech_Microbiology Practical_VIII_ final
19 pages
Btech Microbiology Practical IV Final
No ratings yet
Btech Microbiology Practical IV Final
15 pages
Accounting Concepts
No ratings yet
Accounting Concepts
14 pages
Microbiology 2
No ratings yet
Microbiology 2
21 pages
Microbiology 23_Microbial Taxonomy
No ratings yet
Microbiology 23_Microbial Taxonomy
35 pages
ssm_Gene_Therapy_Intro
No ratings yet
ssm_Gene_Therapy_Intro
95 pages
Accounting Equation BasicProblemUnderstanding
No ratings yet
Accounting Equation BasicProblemUnderstanding
21 pages
Dela Cruz, Aleli A. BEED 2-1D ASYNCHRONOUS ACTIVITY
No ratings yet
Dela Cruz, Aleli A. BEED 2-1D ASYNCHRONOUS ACTIVITY
2 pages
Guitar Mixing Cheatsheet 2
No ratings yet
Guitar Mixing Cheatsheet 2
7 pages
How Much Power
No ratings yet
How Much Power
5 pages
FX4021 Sell Sheet R02
No ratings yet
FX4021 Sell Sheet R02
2 pages
Gelombang Pulsa
No ratings yet
Gelombang Pulsa
10 pages
Sci 7 SLHT Wk5 Final Rev 3
No ratings yet
Sci 7 SLHT Wk5 Final Rev 3
5 pages
Phys2 Week6 Resonance in Aircolumn
No ratings yet
Phys2 Week6 Resonance in Aircolumn
3 pages
MusicTech May 2019 05 PDF
100% (2)
MusicTech May 2019 05 PDF
94 pages
Mustang III-IV-V Preset List
No ratings yet
Mustang III-IV-V Preset List
2 pages
Ideal EQ Settings For HK Audio System
No ratings yet
Ideal EQ Settings For HK Audio System
5 pages
Wallows Remember When
No ratings yet
Wallows Remember When
7 pages
Precision Acoustic Calibrator Manual
No ratings yet
Precision Acoustic Calibrator Manual
16 pages
Complete List of Effect Patches in Reason 4.0: Patch Category Subcategory 1
No ratings yet
Complete List of Effect Patches in Reason 4.0: Patch Category Subcategory 1
6 pages
(Ebook) Digital Noise Monitoring of Defect Origin (Lecture Notes Electrical Engineering, 2) by Telman Aliev ISBN 9780387717531, 9780387717548, 0387717536, 0387717544 All Chapters Instant Download
100% (3)
(Ebook) Digital Noise Monitoring of Defect Origin (Lecture Notes Electrical Engineering, 2) by Telman Aliev ISBN 9780387717531, 9780387717548, 0387717536, 0387717544 All Chapters Instant Download
81 pages
Catalogue 2
No ratings yet
Catalogue 2
1 page
Acousticalmaterialsfinal
No ratings yet
Acousticalmaterialsfinal
27 pages
The Intentional Use of Sound Design in the egyptian temples
No ratings yet
The Intentional Use of Sound Design in the egyptian temples
2 pages
Music Analysis Guide
No ratings yet
Music Analysis Guide
7 pages
Ansi Agma 6025-D98 Gear Drives
No ratings yet
Ansi Agma 6025-D98 Gear Drives
28 pages
Modeling The Voice Source in Terms of Spectral Slopes
No ratings yet
Modeling The Voice Source in Terms of Spectral Slopes
7 pages
MENA UAE HIKVISION Audio and Sensing PriceBook 2024 04-17-21!27!45
No ratings yet
MENA UAE HIKVISION Audio and Sensing PriceBook 2024 04-17-21!27!45
30 pages
Assignment-1 Building Services-3 Acoustics
No ratings yet
Assignment-1 Building Services-3 Acoustics
9 pages
Occupationalnoiseexposure 170513025451
100% (1)
Occupationalnoiseexposure 170513025451
39 pages
Form 2 19 Sound Waves
No ratings yet
Form 2 19 Sound Waves
32 pages
Caleb Carlson: Experience
No ratings yet
Caleb Carlson: Experience
1 page
Widex Moment™ Ric 10 With S-Receiver S-Receiver Mrb0: Standard Technology
No ratings yet
Widex Moment™ Ric 10 With S-Receiver S-Receiver Mrb0: Standard Technology
2 pages
Music Room
No ratings yet
Music Room
2 pages
Laboratory Measurement of Airborne Sound Transmission Loss of Building Partitions and Elements
No ratings yet
Laboratory Measurement of Airborne Sound Transmission Loss of Building Partitions and Elements
15 pages

Audio Compression

Uploaded by

Audio Compression

Uploaded by

AUDIO COMPRESSION

Audio compression uses psycho-acoustic models

THE FREQUENCIES THAT ARE MASKED ARE SUPPRESSED IN AUDIO COMPRESSION

You might also like