Multimedia Systems Chapter 4
Multimedia Systems Chapter 4
Multimedia System 1
Human Vision
The eye works like a camera, with the lens focusing an image onto the retina
(upside-down and left-right reversed).
The retina consists of an array of rods and three kinds of cones. The rods come
into play when light levels are low and produce a image in shades of gray (\all
cats are gray at night!").
For higher light levels, the cones each produce a signal. Because of their
differing pigments, the three kinds of cones are most sensitive to red (R), green
(G), and blue (B) light. It seems likely that the brain makes use of differences
R-G, G-B, and B-R, as well as combining all of R, G, and B into a high-light-
level achromatic channel.
Spectral Sensitivity of the Eye
The eye is most sensitive to light in the middle of the visible spectrum.
The sensitivity of our receptors is also a function of wavelength (Fig.
4.2 below).
The Blue receptor sensitivity is not shown to scale because
it is much smaller than the curves for Red or Green | Blue is a late addition,
in evolution.
Statistically, Blue is the favorite color of humans, regardless of nationality |
perhaps for this reason: Blue is a latecomer and thus is
a bit surprising!
Fig. 4.2 shows the overall sensitivity as a dashed line | this important curve
is called the luminous-efficiency function.
It is usually denoted V (λ) and is formed as the sum of the response
curves for Red, Green, and Blue.
The rod sensitivity curve looks like the luminous-efficiency function V (λ)
but is shifted to the red end of the spectrum.
The achromatic channel produced by the cones is approximately
proportional to 2R + G + B/20
Multimedia System 2
.
Fig. 4.2: R,G, and B cones, and Luminous Efficiency curve V(λ).
These spectral sensitivity functions are usually denoted by letters other than
\R; G; B"; here let’s use a vector function q (λ), with components
The response in each color channel in the eye is proportional to the number
of neurons firing.
A laser light at wavelength λ would result in a certain number of neurons
firing. An SPD is a combination of single frequency lights (like \lasers"),
so we add up the cone responses for all wavelengths, weighted by the eye’s
relative response at that wavelength.
We can succinctly write down this idea in the form of an integral:
Image Formation
Surfaces reflect different amounts of light at different wavelengths, and dark
surfaces reflect less energy than light surfaces.
Fig. 4.3 shows the surface spectral reflectance from (1) orange sneakers and (2)
faded bluejeans. The reflectance function is denoted S(λ).
Multimedia System 3
Fig. 4.3: Surface spectral reflectance functions S(λ) for objects.
Image formation is thus:
Light from the illuminant with SPD E(λ) impinges on a
surface, with surface spectral reflectance function S(λ), is
reflected, and then is filtered by the eye’s cone functions
q (λ).
Reflection is shown in Fig. 4. 4 below.
The function C(λ) is called the color signal and consists of the product
of E(λ), the illuminant, times S(λ), the reflectance:
C(λ) = E(λ) S(λ).
Multimedia System 4
Figure 4.4 Image formation model
The equations that take into account the image formation model are:
Camera Systems
Camera systems are made in a similar fashion; a studio quality camera has
three signals produced at each pixel location (corresponding to a retinal
position).
Analog signals are converted to digital, truncated to integers, and stored. If
the precision used is 8-bit, then the maximum value for any of R; G; B is
255, and the minimum is 0.
However, the light entering the eye of the computer user is that which is
emitted by the screen the screen is essentially a self-luminous source.
Therefore we need to know the light E(λ) entering the eye.
Gamma Correction
The light emitted is in fact roughly proportional to the voltage raised to a
power; this power is called gamma, with symbol γ.
(a) Thus, if the file value in the red channel is R, the screen emits light
Multimedia System 5
proportional to Rγ, with SPD equal to that of the red phosphor paint on the
screen that is the target of the red channel electron gun. The value of gamma is
around 2.2.
(b) It is customary to append a prime to signals that are gamma-corrected by
raising to the power (1/γ) before transmission. Thus we arrive at linear signals:
Multimedia System 6
Fig. RGB and CMY color cubes.
Multimedia System 7
Transformation from RGB to CMY
Simplest model we can invent to specify what ink density to lay down on paper,
to make a certain desired RGB color:
Fig.: color combinations that result from combining primary colors available in
the two situations, additive color and subtractive color.
Fig. : Additive and subtractive color. (a): RGB is used to specify additive color.
(b): CMY is used to specify subtractive color
4.3 Color Models in Video
Video Color Transforms
Largely derive from older analog methods of coding color for TV.
Luminance is separated from color information.
For example, a matrix transform method called YIQ is used to transmit
TV signals in North America and Japan.
This coding also makes its way into VHS video tape coding in these
countries since video tape technologies also use YIQ.
Multimedia System 8
In Europe, video tape uses the PAL or SECAM codings, which are
based on TV that uses a matrix transform called YUV.
Finally, digital video mostly uses a matrix transform called YCbCr
that is closely related to YUV
YUV Color Model
YUV codes a luminance signal (for gamma-corrected signals).
Chrominance refers to the difference between a color and a reference
white at the same luminance. use color differences U, V :
Fig. 4.18: decomposition of color image. Top image (a) is original color
image;
Multimedia System 9
YCbCr Color Model
The Rec. 601 standard for digital video uses another color space, YCbCr,
often simply written YCbCr | closely related to the YUV transform.
YUV is changed by scaling such that Cb is U, but with a coefficient of 0.5
multiplying B’. In some software systems, Cb and Cr are also shifted such
that values are between 0 and 1.
This makes the equations as follows:
Fig: 4.19
The above figure 4.19 shows the color image and its Y′, CB and
CR components. The Y′ image is essentially a grey scale copy of the main
image.
Multimedia System 10