TS Lecture7
TS Lecture7
=S2/12
Signal to noise ratio (SQR) is
a good measure of
performance of a PCM system.
SQR=1.76+6.02n dB
12
Companding In linear or uniform quantization,
the magnitude of quantization
noise is absolute for a particular
system and is independent of the
input signal amplitude.
Therefore, comparatively, the
weak and low-level signals suffer
worse from quantization noise
than the loud and strong signals.
The very high percentage error
at low input signal levels actually
represents idle channel noise.
The effect of this is particularly
bothersome during speech
pauses and can be minimized by
ef=(S/2)/|V|
For sinusoidal input, S=2Vm/M,
choosing 0 volt level as a
Hence, ef=[Vm/(M|V|)]×100% quantization level and avoiding
the mid points of the first
intervals on either side of the
zero level as quantization levels.
13
Companding
The scheme which uses the
two first midpoints is known
as mid-riser scheme and the
other as mid-tread scheme.
The mid-tread scheme uses
odd number of quantization
levels, i.e., M=2n-1
In mid-tread scheme, very
low signals are decoded into a
constant, zero-level output.
However, if a d.c. bias exists
in the encoder, idle channel
noise is still a problem with
mid-tread quantisation.
14
Companding A more efficient method of minimizing
large variations in the percentage
quantization error over the signal range is
to use nonlinear or nonuniform
quantization.
It is interesting to note that uniform
quantization intervals result in
nonuniform SQR over the signal range
and nonuniform intervals result in
uniform SQR.
The effect of permitting larger
quantization intervals at higher signal
amplitudes is to compress the input signal
to achieve a uniform quantization level.
The input signal is first compressed by
using a nonlinear functional device and
then a linear quantizer is used. At the
receiving end, the quantized signal is
expanded by a nonuniform device having
an inverse characteristic of the
compression at the sending end.
The process of first compressing and then
expanding is referred to as companding.
15
Companding A variety of nonlinear
compression-expansion
functions can be chosen to
implement a compandor. The
obvious one is a logarithmic
law.
Unfortunately, the function
y=lnx does not pass through
the origin.
So, it is necessary to substitute
a linear portion to the curve for
lower values of x.
Most practical companding
systems are based on a law
For logarithmic section, suggested by K.W. Cattermole.
y=(1+lnAx)/(1+lnA) for 1/A≤x≤1 These equations are collectively
For linear section, known as A-law used by India
y=Ax/(1+lnA) for 0≤x≤1/A and other European countries.
A=compression coefficient
The expansion function is given by, U.S.A & Japan follow a variation
x=ey(1+lnA)-1/A for 1/(1+lnA) ≤y≤1 of A-law known as µ-law.
x=y(1+lnA)/A for 0≤y≤1/(1+lnA) 16
Companding In practice, a piecewise linear
segment approximation is used.
A-law companding consists of eight
linear segments for each polarity.
The slope halves for each segment
except the lowest two segments
which have the same slope.
The lowest two segments of positive
& negative polarities coalesce into
one straight line segment.
As a result, there are 13 effective
segments in the curve and the law is
sometimes referred to as 13-
segment companding law.
In µ-law, the slope halves in the
lowest two segments also, giving rise
to 15 effective segments.
Each segment is divided into 16
linear steps. Eight bits are required
to represent each sample value: 1-
bit sign, 3-bit segment number and a
4-bit linear step number.
There are in all 256 defined signal
levels.
17
Differential coding
PCM is not specifically designed for digitizing speech
waveforms.
Speech waveforms exhibit considerable redundancy which
can be usefully exploited in designing coding schemes.
The following characteristics of speech signals contribute to
the redundancy:
Nonuniform amplitude distributions
Sample-to-sample correlations
Periodicity or cycle-to-cycle correlations
Pitch interval-to-pitch interval correlations
Speech pauses or inactivity factors
A sizeable fraction of the human speech sounds is produced
by the flow of puffs of air from the lungs into the vocal tract.
The interval between these puffs of air is known as the pitch
interval. There may be as many as 20 to 40 pitch intervals in
a single sound.
Typically, a party is active for about 40% of a call duration.
18
Differential coding
Delta or differential coding systems are designed to take
advantage of the sample-to-sample redundancies in speech
waveforms.
Because of the strong correlation between adjacent speech
samples, large abrupt changes in levels do not occur frequently
in speech waveforms.
In such situations, it is more efficient to transmit or encode and
transmit only the signal changes instead of the absolute value
of the samples.
Delta modulation (DM) is a scheme that transmits only the
signal changes and differential pulse code modulation (DPCM)
encodes the differences and transmits them.
A delta modulator may be implemented by simply comparing
each new signal sample with the previous sample and
transmitting the resulting difference signal.
At the receiver end, the difference signals are added up to
construct the absolute signal by using an integrator.
However, such a system, being open loop, suffers from the
possibility of the receiver output diverging from the transmitter
input due to system errors or inaccuracies. 19
Differential coding The system can be converted into
a closed loop system by setting up
a feedback path with an integrator
at the transmitting end.
When the input is constant, the
output of the transmitter is an
alternating positive and negative
pulse train. This constitutes the
quantization noise in delta
modulators and is also known as
granular noise.
If the transmitter input signal
changes too rapidly, the receiver
output is unable to keep up and
this phenomenon is known as
slope overload.
This problem may be overcome by
using a variable slope integrator
whose output slope is increased or
decreased, depending on the rate
of change of the input signal.
20
Vocoders
By considering some of the properties that are more or less
unique to speech, such as pitch interval and cycle
correlations, significant reductions can be achieved in bit
rates.
Coding systems that are so specifically designed for voice
signal are known as voice coders or vocoders & operate
typically at bit rates in the range 1.2-2.4 kbps.
Vocoders take into account the physiology of the vocal cords,
the larynx, the throat, the mouth, the nasal passages and the
ear in their design.
The basic purpose of the vocoders is to encode only the
perceptually important aspects of speech and thereby reduce
the bit rate significantly.
As a result, the reproduced voice is synthetic sounding and
unnatural with artificial quality.
Main applications include recorded message announcements,
encrypted voice transmission, voice mail etc.
21
Vocoders Human speech is generated in
two basic ways:
Voiced sounds generated as