Video Compression MPEG
Video Compression MPEG
COMPRESSION
INTRODUCTION
■ Film
– Invented in late 18th
century, still widely
used today
■ DVD
– Released in 1996,
dominant for over a
decade
■ Hard Disk
– Around for many
years, only recently
widely used for
storing video (helped
by explosion of
Internet)
Transition from analog
mediums to digital mediums
■ The “N word”
– Analog signals are prone to corruption by noise
■ Economics
– Optical media is cheaper to produce than
magnetic media
■ Creates need to convert analog video to
digital format
Video digitization
■ Interlaced scan
– Odd and even lines displayed on alternate frames
– Initially used to save bandwidth on TV transmission
– When displaying interlaced video on a progressive scan
display, can see “comb effect”
■ Progressive scan
– Display all lines on each frame
– New “fixed-resolution” displays (such as LCD,
Plasma) all use progressive scan
– Deinterlacing is not a trivial task
MPEG (Moving Pictures
Expert Group)
■ Committee of experts that develops video
encoding standards
■ Until recently, was the only game in town (still
the most popular, by far)
■ Suitable for wide range of videos
– Low resolution to high resolution
– Slow movement to fast action
■ Can be implemented either in software or
hardware
Evolution of MPEG
■ MPEG-1
– Initial audio/video compression standard
– Used by VCD’s
– MP3 = MPEG-1 audio layer 3
– Target of 1.5 Mb/s bitrate at 352x240 resolution
– Only supports progressive pictures
■ MPEG-2
– Current de facto standard, widely used in DVD and Digital
TV
– Ubiquity in hardware implies that it will be here for a long
time
■ Transition to HDTV has taken over 10 years and is not
finished yet
– Different profiles and levels allow for quality control
Evolution of MPEG
■ MPEG-3
– Originally developed for HDTV, but abandoned when
MPEG-2 was determined to be sufficient
■ MPEG-4
– Includes support for AV “objects”, 3D content, low bitrate
encoding, and DRM
– In practice, provides equal quality to MPEG-2 at a lower
bitrate, but often fails to deliver outright better quality
– MPEG-4 Part 10 is H.264, which is used in HD-DVD and
Blu-Ray
MPEG technical specification
■ Part 1 - Systems - describes synchronization and multiplexing of video and audio.
■ Part 2 - Video - compression codec for interlaced and non-interlaced video signals.
■ Part 3 - Audio - compression codec for perceptual coding of audio signals. A
multichannel-enabled extension of MPEG-1 audio.
■ Part 4 - Describes procedures for testing compliance.
■ Part 5 - Describes systems for Software simulation.
■ Part 6 - Describes extensions for DSM-CC (Digital Storage Media Command and
Control.)
■ Part 7 - Advanced Audio Coding (AAC)
■ Part 8 - Deleted
■ Part 9 - Extension for real time interfaces.
■ Part 10 - Conformance extensions for DSM-CC.
MPEG video spatial domain
processing
■ Spatial domain handled very similarly to JPEG
– Convert RGB values to YUV colorspace
– Split frame into 8x8 blocks
– 2-D DCT on each block
– Quantization of DCT coefficients
– Run length and entropy coding
MPEG video time domain
processing
■ General idea – Use motion vectors to specify how a 16x16
macroblock translates between reference frames and
current frame, then code difference between reference and
actual block
Types of frames
■ I frame (intra-coded)
– Coded without reference to other frames
■ P frame (predictive-coded)
– Coded with reference to a previous reference frame
(either I or P)
– Size is usually about 1/3rd of an I frame
■ B frame (bi-directional predictive-coded)
– Coded with reference to both previous and future
reference frames (either I or P)
– Size is usually about 1/6th of an I frame
GOP (Group of Pictures)
■ GOP is a set of consecutive frames that can be decoded without
any other reference frames
■ Usually 12 or 15 frames
■ Transmitted sequence is not the same as displayed sequence
■ Random access to middle of stream – Start with I frame
Things about prediction