Audio Compression
Audio Compression
DEFINITION
AUDIO COMPRESSION
• In MPEG Audio compression, following techniques are used:
• Silence compression — detect the “silence” in the audio signal, and
apply run-length encoding to remove the silence periods to achieve
compression.
• Differential Pulse Coded Modulation (DPCM) is adopted where the
amplitude difference between the two successive samples can be
stored using reduced bits if the difference in amplitude between
successive samples is small.
• Adaptive Differential Pulse Code Modulation (ADPCM)- Encode the
difference between two or more consecutive signals; the difference
is then quantized --> hence the loss. The loss in lossy compressions
is due to the quantization process that converts continuous range of
values to discrete ones.
• It is necessary to predict where the waveform is headed-mApple
has proprietary scheme called ACE/MACE. A Lossy scheme that tries
to predict where wave will go in next sample. Gives about 2:1
compression.
• Adaptive Predictive Coding (APC) is used on Speech
Please note: Since we are quantizing the samples, there is loss in compression
techniques. Quantisation leads to lossy compression.
• APC or Adaptive Predictive Coding is used for
Speech compression.
– Input signal is divided to fixed segments or windows.
– For each segment, some sample characteristics are
computed, e.g. pitch, period, loudness.
- These characteristics are used to predict the
signal.
- Computerised talking (speech synthesisers use
such methods) but at low bandwidth:
(What is quantization?)
Quantization is defined as a lossy data compression technique by
which intervals of data are grouped or binned into a single value
(or quantum). Quantization, in mathematics and digital signal
processing, is the process of mapping input values from a large
set (often a continuous set) to output values in a (countable)
smaller set, often with a finite number of elements. Rounding
and truncation are typical examples of quantization processes. In
MPEG audio compression, some bits are allocated for
quantisation. Quantisation provides compression but introduces
noise and makes the compression lossy. However, quantization
noise may not be perceived by human ear if the quantisation
noise frequencies are masked by the masking frequencies.