1.2, 1.3 Text and Image Storage & Compression
1.2, 1.3 Text and Image Storage & Compression
Unicode
Universal standard that covers all languages and all writing
systems
More efficient coding system than ASCII
Uniform encoding – each character is either 16 or 32 bits
Each 16- or 32-bit value always represents the same
character
Parts of the code are reserved for the user to add their own
characters and symbols
Sound Representation
Sound waves (analogue) have Frequency, Amplitude and
Wavelength
Sound waves are sampled at regular time intervals
Sampling Resolution: number of bits per sample (aka Bit
Depth)
Sampling Rate: number of sound samples taken per second
(Hz)
Greater sampling rate/sampling resolution -> more accurate
sound
Benefits and Drawbacks of larger sound sampling resolution:
Benefits Drawbacks
Larger dynamic range Larger file size
Better sound quality Longer to transmit files
Less sound distortion Needs greater processing
power
Lossy: Examples
1. MP3:
Audio files
Reduces file size by 90%
Removes sounds outside of human ear range
If multiple sounds are played together, then only the
louder sound is played. Softer sound is eliminated
2. MP4:
Works similar to MP3, but can store multimedia – videos
with audio
3. JPEG:
Compresses images
Reduces colour shades which the human eye cannot
differentiate between properly
Splits images into 8x8 pixel blocks, allowing for certain
info to be removed when compressing
Lossless: Examples
Run Length Encoding (RLE):
Repeated strings of adjacent, identical data are compressed
A repeating string is encoded into:
o First value – represents number of identical data items
o Second value – represents ASCII or keyboard code of the
character
A ‘flag’ is used preceding data to indicate that what follows is
repeating units