Huffman Encoding: Farhad Muhammad Riaz

Huffman encoding is a method of data compression that assigns variable-length codes to input characters, lengths that are based on the frequencies of characters. It creates a binary tree where the most common characters are represented by shorter bit strings. This allows the entire message to be represented using fewer bits than other fixed-length encoding schemes. It works by having the unique prefix property, where no code is a prefix of another, allowing unambiguous decoding of the compressed data. While simple, it demonstrates how frequency analysis can be used to compress data.

Uploaded by

Farhad Muhammad Riaz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views

Huffman Encoding: Farhad Muhammad Riaz

Uploaded by

Farhad Muhammad Riaz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 17

Huffman Encoding

Farhad Muhammad Riaz

Entropy
• Entropy is a measure of information content: the
number of bits actually required to store data.
• Entropy is sometimes called a measure of surprise
– A highly predictable sequence contains little actual
information
• Example: 11011011011011011011011011 (what’s next?)
• Example: I didn’t win the lottery this week
– A completely unpredictable sequence of n bits contains
n bits of information
• Example: 01000001110110011010010000 (what’s next?)
• Example: I just won $10 million in the lottery!!!!
– Note that nothing says the information has to have any
“meaning” (whatever that is)
Actual information content
• A partially predictable sequence of n bits carries
less than n bits of information
– Example #1: 111110101111111100101111101100
– Blocks of 3: 111110101111111100101111101100
– Example #2: 101111011111110111111011111100
– Unequal probabilities: p(1) = 0.75, p(0) = 0.25
– Example #3: "We, the people, in order to form a..."
– Unequal character probabilities: e and t are common, j
and q are uncommon
– Example #4: {we, the, people, in, order, to, ...}
– Unequal word probabilities: the is very common
Fixed and variable bit widths
• To encode English text, we need 26 lower case
letters, 26 upper case letters, and a handful of
punctuation
• We can get by with 64 characters (6 bits) in all
• Each character is therefore 6 bits wide
• We can do better, provided:
– Some characters are more frequent than others
– Characters may be different bit widths, so that for
example, e use only one or two bits, while x uses several
– We have a way of decoding the bit stream
• Must tell where each character begins and ends
Example Huffman encoding
• A=0
B = 100
C = 1010
D = 1011
R = 11
• ABRACADABRA = 01001101010010110100110
• This is eleven letters in 23 bits
• A fixed-width encoding would require 3 bits for
five different letters, or 33 bits for 11 letters
• Notice that the encoded bit string can be decoded!
Why it works
• In this example, A was the most common letter
• In ABRACADABRA:
– 5 As code for A is 1 bit long
– 2 Rs code for R is 2 bits long
– 2 Bs code for B is 3 bits long
– 1C code for C is 4 bits long
– 1D code for D is 4 bits long
Creating a Huffman encoding
• For each encoding unit (letter, in this example),
associate a frequency (number of times it occurs)
– You can also use a percentage or a probability
• Create a binary tree whose children are the
encoding units with the smallest frequencies
– The frequency of the root is the sum of the frequencies
of the leaves
• Repeat this procedure until all the encoding units
are in the binary tree
Example, step I
• Assume that relative frequencies are:
– A: 40
– B: 20
– C: 30
– D: 10
– R: 50
• (I chose simpler numbers than the real frequencies)
• Smallest number are 10 and 10 (C and D), so connect those
Example, step II
• C and D have already been used, and the new node
above them (call it C+D) has value 20
• The smallest values are B, C+D, and R, all of
which have value 20
– Connect any two of these
Example, step III
• The smallest values is R, while A and B+C+D all
have value 40
• Connect R to either of the others
Example, step IV
• Connect the final two nodes
Example, step V
• Assign 0 to left branches, 1 to right branches
• Each encoding is a path from the root
• A=0
B = 100
C = 1010
D = 1011
R = 11
• Each path
terminates at a
leaf
• Do you see
why encoded
strings are
decodable?
Unique prefix property
• A=0
B = 100
C = 1010
D = 1011
R = 11
• No bit string is a prefix of any other bit string
• For example, if we added E=01, then A (0) would
be a prefix of E
• Similarly, if we added F=10, then it would be a
prefix of three other encodings (B=100, C=1010,
and D=1011)
• The unique prefix property holds because, in a
binary tree, a leaf is not on a path to any other node
Practical considerations
• It is not practical to create a Huffman encoding for
a single short string, such as ABRACADABRA
– To decode it, you would need the code table
– If you include the code table in the entire message, the
whole thing is bigger than just the ASCII message
• Huffman encoding is practical if:
– The encoded string is large relative to the code table, OR
– We agree on the code table beforehand
• For example, it’s easy to find a table of letter frequencies for
English (or any other alphabet-based language)
About the example
• My example gave a nice, good-looking binary
tree, with no lines crossing other lines
– That’s because I chose my example and numbers
carefully
– If you do this for real data, you can expect your
drawing will be a lot messier—that’s OK
Data compression
• Huffman encoding is a simple example of data
compression: representing data in fewer bits than
it would otherwise need
• A more sophisticated method is GIF (Graphics
Interchange Format) compression, for .gif files
• Another is JPEG (Joint Photographic Experts
Group), for .jpg files
– Unlike the others, JPEG is lossy—it loses information
– Generally OK for photographs (if you don’t compress
them too much), because decompression adds “fake”
data very similiar to the original
The End

Maintenance Instructions: BG 190TA-4
100% (2)
Maintenance Instructions: BG 190TA-4
68 pages
Pi World 2020 Lab Pi Vision - Migrating Pi Processbook Displays
No ratings yet
Pi World 2020 Lab Pi Vision - Migrating Pi Processbook Displays
18 pages
Huffman
No ratings yet
Huffman
17 pages
CGIP Huffman EX
No ratings yet
CGIP Huffman EX
17 pages
Huffman Encoding: WWW - Cis.Upenn - Edu/ Matuszek/Cit594-2002/SLIDES/HUFFMAN
No ratings yet
Huffman Encoding: WWW - Cis.Upenn - Edu/ Matuszek/Cit594-2002/SLIDES/HUFFMAN
13 pages
Lecture 3-Huffman Coding
No ratings yet
Lecture 3-Huffman Coding
30 pages
Graph Theory - Important Application of Trees Huffman Coding
No ratings yet
Graph Theory - Important Application of Trees Huffman Coding
50 pages
Lesson - Huffman and Entropy Coding
No ratings yet
Lesson - Huffman and Entropy Coding
31 pages
Data Structure: Huffman Tree:Project Submitted To: Sir Abdul Wahab
No ratings yet
Data Structure: Huffman Tree:Project Submitted To: Sir Abdul Wahab
24 pages
Data Compression
No ratings yet
Data Compression
28 pages
12 - Huffman Coding Algorithm
No ratings yet
12 - Huffman Coding Algorithm
16 pages
210 Huffman Encoding
No ratings yet
210 Huffman Encoding
10 pages
Entropy & Run Length Coding
No ratings yet
Entropy & Run Length Coding
45 pages
Huffman Coding
No ratings yet
Huffman Coding
16 pages
Mini Project 2
No ratings yet
Mini Project 2
4 pages
Huffman Coding
No ratings yet
Huffman Coding
12 pages
Huffman Coding
No ratings yet
Huffman Coding
40 pages
What Is Huffman Coding and Its History
No ratings yet
What Is Huffman Coding and Its History
5 pages
Module IV
No ratings yet
Module IV
37 pages
Huffman Code
No ratings yet
Huffman Code
47 pages
2.7. Huffman Cod
No ratings yet
2.7. Huffman Cod
12 pages
Data Compression - Unit 2
No ratings yet
Data Compression - Unit 2
31 pages
Term Paper Huffman Coding
No ratings yet
Term Paper Huffman Coding
9 pages
L10 Huffman Encoding Greedy
No ratings yet
L10 Huffman Encoding Greedy
52 pages
Unite 4-Greedy Method - CSE
No ratings yet
Unite 4-Greedy Method - CSE
41 pages
Huffman Coding Ms 140400147 Sadia Yunas Butt
No ratings yet
Huffman Coding Ms 140400147 Sadia Yunas Butt
9 pages
11 Huffman Coding
No ratings yet
11 Huffman Coding
25 pages
Spectra, Signals Report
No ratings yet
Spectra, Signals Report
8 pages
Data_Compression__Unit-5 (1)
No ratings yet
Data_Compression__Unit-5 (1)
17 pages
KMA SS05 Kap03 Compression
No ratings yet
KMA SS05 Kap03 Compression
54 pages
05 Compression
No ratings yet
05 Compression
46 pages
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
No ratings yet
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
37 pages
Assignment 6: Huffman Encoding: Assignment Overview and Starter Files
No ratings yet
Assignment 6: Huffman Encoding: Assignment Overview and Starter Files
20 pages
Huff Man
No ratings yet
Huff Man
8 pages
Huffman Code
No ratings yet
Huffman Code
29 pages
Compression For Sending and Storing Information: Text, Audio, Images, Videos
No ratings yet
Compression For Sending and Storing Information: Text, Audio, Images, Videos
28 pages
Huffman Code
No ratings yet
Huffman Code
25 pages
ICT - Module 1 Lecture 3
No ratings yet
ICT - Module 1 Lecture 3
43 pages
Huffman Trees and Codes: Greedy Technique
No ratings yet
Huffman Trees and Codes: Greedy Technique
6 pages
2. Coding Theory
No ratings yet
2. Coding Theory
49 pages
Mad Unit 3-Jntuworld
No ratings yet
Mad Unit 3-Jntuworld
53 pages
chapter 5 math
No ratings yet
chapter 5 math
67 pages
Entropy
No ratings yet
Entropy
10 pages
Text and Text Compression
No ratings yet
Text and Text Compression
28 pages
4 Huffman and shannon fano coding
No ratings yet
4 Huffman and shannon fano coding
23 pages
huffman-encoding-supplement
No ratings yet
huffman-encoding-supplement
10 pages
Steps of Huffman Encoding:: Calculate The Frequency of Each Character Build A Priority Queue Build A Binary Tree
No ratings yet
Steps of Huffman Encoding:: Calculate The Frequency of Each Character Build A Priority Queue Build A Binary Tree
1 page
236-7
No ratings yet
236-7
116 pages
Huffman Code
No ratings yet
Huffman Code
7 pages
Huffman Coding
No ratings yet
Huffman Coding
65 pages
Huffman Trees and Codes-v1
No ratings yet
Huffman Trees and Codes-v1
15 pages
Information Theory in Dynamic Systems
No ratings yet
Information Theory in Dynamic Systems
44 pages
Huffman Code
No ratings yet
Huffman Code
5 pages
Ultimedia OF ATA Ompression: IS502:M D I S
No ratings yet
Ultimedia OF ATA Ompression: IS502:M D I S
29 pages
31 Huffman Encoding
No ratings yet
31 Huffman Encoding
10 pages
Huffman Coding: Greedy Algorithm
No ratings yet
Huffman Coding: Greedy Algorithm
27 pages
Unit 2 CA209
No ratings yet
Unit 2 CA209
29 pages
Huffman Coding Technique
No ratings yet
Huffman Coding Technique
13 pages
Mmis G1 Ass
No ratings yet
Mmis G1 Ass
13 pages
I Wish I Knew That: Math
From Everand
I Wish I Knew That: Math
GOLDSMITH, MICHAEL
3.5/5 (3)
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Artificial Intelligence 3451: UNIT: 05 Planning and Reasoning
No ratings yet
Artificial Intelligence 3451: UNIT: 05 Planning and Reasoning
8 pages
Artificial Intelligence 3451: UNIT: 05 Planning and Reasoning
No ratings yet
Artificial Intelligence 3451: UNIT: 05 Planning and Reasoning
32 pages
Artificial Intelligence 3451: UNIT: 03 Knowledge Representation (First Order Logic)
No ratings yet
Artificial Intelligence 3451: UNIT: 03 Knowledge Representation (First Order Logic)
23 pages
Artificial Intelligence 3451: UNIT: 03 Knowledge Representation
No ratings yet
Artificial Intelligence 3451: UNIT: 03 Knowledge Representation
28 pages
Artificial Intelligence 3451: UNIT: 07 Prolog Programming
No ratings yet
Artificial Intelligence 3451: UNIT: 07 Prolog Programming
15 pages
Elementry Data Structure
No ratings yet
Elementry Data Structure
19 pages
Artificial Intelligence 3451: UNIT: 02 Problem-Solving Through Search
No ratings yet
Artificial Intelligence 3451: UNIT: 02 Problem-Solving Through Search
34 pages
Database Administration & Management
No ratings yet
Database Administration & Management
16 pages
Asymptotic Analysis-1
No ratings yet
Asymptotic Analysis-1
29 pages
Unit 3: Knowledge Representation: Farhad Muhammad Riaz
No ratings yet
Unit 3: Knowledge Representation: Farhad Muhammad Riaz
19 pages
Database Administration & Management
No ratings yet
Database Administration & Management
24 pages
Database Administration & Management: Introduction To Course
No ratings yet
Database Administration & Management: Introduction To Course
12 pages
Database Administration & Management: Exploring The Oracle Database Architecture
No ratings yet
Database Administration & Management: Exploring The Oracle Database Architecture
25 pages
Database Administration & Management: Exploring The Oracle Database Architecture (Part 2)
No ratings yet
Database Administration & Management: Exploring The Oracle Database Architecture (Part 2)
17 pages
LIPS Programming: Farhad Muhammad Riaz
No ratings yet
LIPS Programming: Farhad Muhammad Riaz
26 pages
Scope72 PDF
No ratings yet
Scope72 PDF
7 pages
AP30J7692
No ratings yet
AP30J7692
1 page
RPH REPORT-INDIGENOUS PEOPLE
No ratings yet
RPH REPORT-INDIGENOUS PEOPLE
17 pages
23 11 13 - Fuel Oil Piping
No ratings yet
23 11 13 - Fuel Oil Piping
2 pages
Brochure - LightCycler 480 Instrument
No ratings yet
Brochure - LightCycler 480 Instrument
12 pages
Kags Fireworks Price List
No ratings yet
Kags Fireworks Price List
4 pages
Rajvi Savla - CV
No ratings yet
Rajvi Savla - CV
2 pages
A Study On Occlusal Satability in Shortened Dental Arches
No ratings yet
A Study On Occlusal Satability in Shortened Dental Arches
6 pages
Food Safety New
100% (1)
Food Safety New
19 pages
Samsung RV415 - SCALA2-AMD BA41-01532A BA41-01533A BA41-01534A REV 1.0 PDF
No ratings yet
Samsung RV415 - SCALA2-AMD BA41-01532A BA41-01533A BA41-01534A REV 1.0 PDF
54 pages
DS-9019-G TrangoLINK-45
No ratings yet
DS-9019-G TrangoLINK-45
2 pages
Hydrocyclones Maintenance PDF
No ratings yet
Hydrocyclones Maintenance PDF
6 pages
Resume - Khadija Sankoh
No ratings yet
Resume - Khadija Sankoh
2 pages
Velan Valves Catalog PDF
No ratings yet
Velan Valves Catalog PDF
12 pages
Thesis On Plastic Waste Management
100% (2)
Thesis On Plastic Waste Management
7 pages
Problem Solving Questions
No ratings yet
Problem Solving Questions
2 pages
Coi Disclosure
No ratings yet
Coi Disclosure
3 pages
BLDG Specifications Sample
No ratings yet
BLDG Specifications Sample
5 pages
Section 7.4
No ratings yet
Section 7.4
16 pages
AMC Lecture 2
No ratings yet
AMC Lecture 2
23 pages
Applying Virtue Ethics To Business: The Agent-Based Approach
No ratings yet
Applying Virtue Ethics To Business: The Agent-Based Approach
14 pages
Ferri Yanto: Mengapa Merekrut Saya?
No ratings yet
Ferri Yanto: Mengapa Merekrut Saya?
4 pages
Final Scoping Report Lonrho
No ratings yet
Final Scoping Report Lonrho
92 pages
Complete Download The Casting Powders Book 1st Edition Kenneth C. Mills PDF All Chapters
100% (9)
Complete Download The Casting Powders Book 1st Edition Kenneth C. Mills PDF All Chapters
52 pages
Kegels Done Right v1.1
No ratings yet
Kegels Done Right v1.1
7 pages
Functions Review Packet
No ratings yet
Functions Review Packet
7 pages
Interface Changes
No ratings yet
Interface Changes
2 pages
Introduction to Linear Circuit Analysis and Modelling From DC to RF 1st Edition Luis Moura All Chapters Instant Download
100% (1)
Introduction to Linear Circuit Analysis and Modelling From DC to RF 1st Edition Luis Moura All Chapters Instant Download
77 pages

Huffman Encoding: Farhad Muhammad Riaz

Uploaded by

Huffman Encoding: Farhad Muhammad Riaz

Uploaded by

Huffman Encoding

Farhad Muhammad Riaz

You might also like