0% found this document useful (0 votes)

50 views

Entropy 2

This document summarizes key concepts in source coding and entropy. It defines entropy as the expected number of binary questions needed to determine the outcome of a random variable. Source coding is defined as mapping outcomes to binary codewords. Prefix codes allow instant decoding by not having one codeword be a prefix of another. The source coding theorem states that the expected code length must be greater than the entropy, but there exists a code within one bit of the entropy. The Kraft inequality provides a necessary and sufficient condition for a set of codeword lengths to correspond to a prefix code.

Uploaded by

Harsha

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views

Entropy 2

Uploaded by

Harsha

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Lecture 1: Entropy and Source Coding

September 25, 2006

1 Entropy
Let X be a discrete random variable with alphabet X = {1, 2, . . . m}. Assume there
is a probability mass function p(x) over X . How many binary questions, on average,
does it take to determine the outcome?

Definition 1.1: The entropy of a discrete random variable X is defined as:

X 1
H(X) = p(x)log
p(x)
x∈X

which can interpreted as the expected value

1
H(X) = Ep [log ].
p(x)

2 Source Coding
Definition 2.1: A (binary) source code C for a random variable X is a mapping from
X to a (finite) binary string. Let C(x) be the codeword corresponding to x and let l(x)
denote the length of C(x).

We focus on codes that are “instantaneous”.

Definition 2.2: A code is called a prefix code or an instantaneous code if no codeword

is a prefix of any other codeword.

The nice property of a prefix code is that one can transmit multiple outcomes x1 , x2 , . . . xn
by just concatenating the strings into C(x1 )C(x2 ) . . . C(xn ), where the latter denotes
the concatenation of C(x1 ),C(x2 ) up to C(xn ), and this leads to decoding xi instantly
after xi is received. In this sense, prefix codes are “self punctuating”.
Let the expected length of C be:
X
L(C) = p(x)l(x)
x∈X

1
Theorem 2.3: The expected length of any (prefix) code is greater than the entropy, i.e.

L(C) ≥ H(X)

Furthermore, there exists a code such that

L(C) ≤ H(X) + 1

This theorem is actually more general and applies to uniquely extendable codes.

2.1 The Kraft Inequality

Theorem 2.4: (Kraft Inequality) Any prefix code satisfies:
X
2l(x) ≤ 1
x∈X

Conversely, given a set of codeword lengths which satisfy this inequality, then there
exists a prefix code with these lengths.

Proof: Consider flipping (unbiased) coins until either we have a codeword or no

codeword is possible. This process will terminate as the codewords are of finite length.
Furthermore, as the codewords are a prefix code, the process will terminate instantly
upon obtaining a codeword.
Hence
X
Pr[obtaining a codeword] = Pr[codeword x]
x∈X
X
= 2−l(x)
x∈X
≤1

where the last step follows since probabilities are bounded by 1. This proves the first
statement.
We proved the forward direction with a technique know as the “probabilistic method”.
For the converse, order the lengths in ascending order l1 to lm . Pick code words in this
order subject to the constraint that any previous codeword is not a prefix of the selected
code. To prove that this works, consider a full binary tree of depth lm . Associate each
codeword with a path on the tree — from the root to some internal node (the end node
of the codeword). The prefix condition states that the path of each codeword must
not contain an endpoint of another codeword’s path. With each leaf node, associate a
probability mass of 2−lm . Assign the probability mass of codeword i as 2−li . Note that
each codeword removes 2−li from the remaining mass to be allocated. Furthermore,
allocation is always possible, if there is enough remaining mass, by the prefix condition.
As the lengths satisfy the Kraft inequality, there is enough initial mass (of 1) to assign
all the items to valid codewords.

2
2.2 The proof of the source coding theorem
We first show that there exists a code within one bit of the entropy. Choose the lengths
as:
1
l(x) = dlog e
p(x)
This choice is integer and satisfies the craft inequality, hence there exists a code. Also,
we can upper bound the average code length as follows:
X X 1
p(x)l(x) = p(x)dlog e
p(x)
x∈X x∈X
X 1
≤ p(x)(log + 1)
p(x)
x∈X
= H(X) + 1

Now, let us prove the lower bound on L(C). Consider the optimization problem
X X
min p(x)l(x) such that 2−l(x) ≤ 1
l(x)
x∈X x∈X

The above finds the shortest possible code length subject to satisfying the Kraft in-
equality. If we relax the the codelengths to be non-integer, then we can obtain a lower
bound.
To do this, the Lagrangian is:
X X
L= p(x)l(x) + λ( 2−l(x) − 1)
x∈X x∈X

Taking derivatives with respect to l(x) and λ and setting to 0, leads to:

p(x) + ln 2λ2−l(x) = 0
X
2−l(x) − 1 = 0
x∈X

1
Solving this for l(x) leads to l(x) = log p(x) , which can be verified by direct substitu-
tion. This proves the lower bound.

(Ebook PDF) Calculus: Single and Multivariable, 7th Edition All Chapter Instant Download
100% (2)
(Ebook PDF) Calculus: Single and Multivariable, 7th Edition All Chapter Instant Download
21 pages
Ex 06 e
No ratings yet
Ex 06 e
7 pages
5 Data Compression
No ratings yet
5 Data Compression
6 pages
Uniquely Decodable Codes
No ratings yet
Uniquely Decodable Codes
10 pages
Shannon's Theory of Secure Communication: CSG 252 Fall 2006 Riccardo Pucella
No ratings yet
Shannon's Theory of Secure Communication: CSG 252 Fall 2006 Riccardo Pucella
23 pages
Untitled
No ratings yet
Untitled
4 pages
Notes
No ratings yet
Notes
32 pages
Information Theory Lecture Notes
No ratings yet
Information Theory Lecture Notes
37 pages
Data Compression Arithmetic Coding
No ratings yet
Data Compression Arithmetic Coding
38 pages
Lec40 - 210102096 - VEDIKA GARG
No ratings yet
Lec40 - 210102096 - VEDIKA GARG
5 pages
Lect2 PDF
No ratings yet
Lect2 PDF
25 pages
Lecture 3: Linearity Testing
No ratings yet
Lecture 3: Linearity Testing
7 pages
E2 201: Information Theory (2018) Homework 1: Instructor: Himanshu Tyagi
No ratings yet
E2 201: Information Theory (2018) Homework 1: Instructor: Himanshu Tyagi
2 pages
Data Compression Techniques
100% (2)
Data Compression Techniques
20 pages
y65
No ratings yet
y65
6 pages
E2 201: Information Theory (2019) Homework 4: Instructor: Himanshu Tyagi
No ratings yet
E2 201: Information Theory (2019) Homework 4: Instructor: Himanshu Tyagi
3 pages
Shannon-Fano-Elias Coding
No ratings yet
Shannon-Fano-Elias Coding
3 pages
Class Notes 3
No ratings yet
Class Notes 3
18 pages
Report Lucas Slot Sebastian Zur
No ratings yet
Report Lucas Slot Sebastian Zur
13 pages
Paper DCRE
No ratings yet
Paper DCRE
13 pages
ICE513 Module 4 - Source Coding
No ratings yet
ICE513 Module 4 - Source Coding
26 pages
HW3
No ratings yet
HW3
4 pages
dist-of-primes-notes
No ratings yet
dist-of-primes-notes
8 pages
tut2425_1
No ratings yet
tut2425_1
2 pages
Information Theory-Homework Exercises: 1 Entropy, Source Coding
No ratings yet
Information Theory-Homework Exercises: 1 Entropy, Source Coding
18 pages
Channel Capacity: 1 Preliminaries and Definitions
No ratings yet
Channel Capacity: 1 Preliminaries and Definitions
5 pages
Communication Theory and Coding: Basics
No ratings yet
Communication Theory and Coding: Basics
17 pages
Parks Mcclellan Fir Filter Design 3
No ratings yet
Parks Mcclellan Fir Filter Design 3
9 pages
Cyclic Codes For Error Detection
No ratings yet
Cyclic Codes For Error Detection
32 pages
Complexity Theory and Cryptography
No ratings yet
Complexity Theory and Cryptography
7 pages
Final Practice
No ratings yet
Final Practice
12 pages
15ec54 PDF
No ratings yet
15ec54 PDF
56 pages
NOTES 10.3 Epsilon-Delta Proofs
No ratings yet
NOTES 10.3 Epsilon-Delta Proofs
4 pages
A Taste of Analytic Number Theory: Ayan Nath
No ratings yet
A Taste of Analytic Number Theory: Ayan Nath
26 pages
Lecture 10
No ratings yet
Lecture 10
4 pages
Lecture 2: Entropy and Mutual Information: 2.1 Example
No ratings yet
Lecture 2: Entropy and Mutual Information: 2.1 Example
8 pages
Polynomial Codes and BCH Codes: Alvin Dizon, Harold Jeff Espineda, Joseph Jimenez
No ratings yet
Polynomial Codes and BCH Codes: Alvin Dizon, Harold Jeff Espineda, Joseph Jimenez
8 pages
A Taste of Analytic Number Theory: Ayan Nath
No ratings yet
A Taste of Analytic Number Theory: Ayan Nath
26 pages
Homework 1
No ratings yet
Homework 1
2 pages
a2
No ratings yet
a2
2 pages
Lecture 22: The Leftover Hash Lemma and Explicit Extractors
No ratings yet
Lecture 22: The Leftover Hash Lemma and Explicit Extractors
4 pages
Tutorial 4
No ratings yet
Tutorial 4
2 pages
Stochastic Convergence
No ratings yet
Stochastic Convergence
20 pages
Fundamental Approximation Theorems: Kunal Narayan Chaudhury
No ratings yet
Fundamental Approximation Theorems: Kunal Narayan Chaudhury
4 pages
session4
No ratings yet
session4
66 pages
Kraft-McMillan Inequality
No ratings yet
Kraft-McMillan Inequality
5 pages
2023_QSE_200___CHEM_200___ENG_SCI_200_Homework_1
No ratings yet
2023_QSE_200___CHEM_200___ENG_SCI_200_Homework_1
3 pages
EECS 554 hw4
No ratings yet
EECS 554 hw4
1 page
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
No ratings yet
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
5 pages
3 Infinities Infinitesimals Continuousfunctions
No ratings yet
3 Infinities Infinitesimals Continuousfunctions
15 pages
Module-2 (Correction and Regression)
No ratings yet
Module-2 (Correction and Regression)
120 pages
28 Entropy III Shannon
No ratings yet
28 Entropy III Shannon
7 pages
Chapter 4
No ratings yet
Chapter 4
36 pages
MIT18 S096F13 Pset2
No ratings yet
MIT18 S096F13 Pset2
4 pages
Solving Multiple-Root Polynomials
No ratings yet
Solving Multiple-Root Polynomials
5 pages
Digital Communication Systems by Simon Haykin-104
No ratings yet
Digital Communication Systems by Simon Haykin-104
6 pages
TJUSAMO 2011 - Discrete Calculus: Mitchell Lee, Andre Kessler
No ratings yet
TJUSAMO 2011 - Discrete Calculus: Mitchell Lee, Andre Kessler
3 pages
Lecture 1: Entropy and Mutual Information: 2.1 Example
No ratings yet
Lecture 1: Entropy and Mutual Information: 2.1 Example
8 pages
L01
No ratings yet
L01
5 pages
session3 (1)
No ratings yet
session3 (1)
44 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
802 11ac
No ratings yet
802 11ac
29 pages
Electric Circuits and Electron Devices
No ratings yet
Electric Circuits and Electron Devices
62 pages
Verisity SOCverify
No ratings yet
Verisity SOCverify
7 pages
Image Segmentation Algorithms: - Otsu (1979) - Fisher (1936)
No ratings yet
Image Segmentation Algorithms: - Otsu (1979) - Fisher (1936)
13 pages
IEEE 802.11 Wireless LAN: Draft Standard
No ratings yet
IEEE 802.11 Wireless LAN: Draft Standard
23 pages
Emf Notes
60% (5)
Emf Notes
79 pages
Edge Detection
No ratings yet
Edge Detection
36 pages
Gunn Diodes
No ratings yet
Gunn Diodes
4 pages
Mobile Communications
100% (1)
Mobile Communications
49 pages
X Band Oscillator
No ratings yet
X Band Oscillator
6 pages
Serial Comm in 8051
No ratings yet
Serial Comm in 8051
21 pages
Real Time Operating Systems
No ratings yet
Real Time Operating Systems
44 pages
Thumb Instruction Set
No ratings yet
Thumb Instruction Set
30 pages
529 - Microwave Circuit Design by Prashanth
No ratings yet
529 - Microwave Circuit Design by Prashanth
38 pages
Z-Transform - CH 7
No ratings yet
Z-Transform - CH 7
13 pages
Serial Communication
No ratings yet
Serial Communication
48 pages
Final Chapter 6 PDF
No ratings yet
Final Chapter 6 PDF
15 pages
Lesson Plan 1 (Sequences and Series)
No ratings yet
Lesson Plan 1 (Sequences and Series)
84 pages
Vision in Elementary Mathematics W. W. Sawyer - Download the full ebook version right now
No ratings yet
Vision in Elementary Mathematics W. W. Sawyer - Download the full ebook version right now
48 pages
16-Interpolation With Non Equvi-Spaced Data Points - Lagrange Interpolation-28-Jan-2019Reference Materia PDF
No ratings yet
16-Interpolation With Non Equvi-Spaced Data Points - Lagrange Interpolation-28-Jan-2019Reference Materia PDF
47 pages
Lect 8 Simplex Method - 1
No ratings yet
Lect 8 Simplex Method - 1
32 pages
The University of Edinburgh Dynamical Systems Problem Set
No ratings yet
The University of Edinburgh Dynamical Systems Problem Set
4 pages
My Concept Sheet: CLASS-10
100% (1)
My Concept Sheet: CLASS-10
26 pages
Lesson 4.6 Graphs of Reciprocal Function and Transformations
No ratings yet
Lesson 4.6 Graphs of Reciprocal Function and Transformations
4 pages
MATH 255 LINEAR ALGEBRAcouse outline
No ratings yet
MATH 255 LINEAR ALGEBRAcouse outline
3 pages
5 Subspace
No ratings yet
5 Subspace
4 pages
Maths Formulas
No ratings yet
Maths Formulas
19 pages
Changes Between Forms of Quadratic Functions
No ratings yet
Changes Between Forms of Quadratic Functions
4 pages
Mathematics For Computing
No ratings yet
Mathematics For Computing
4 pages
Tutorial Sheet - Module-3
No ratings yet
Tutorial Sheet - Module-3
3 pages
Microsoft Word - Arbitrary Reference Frame Theory
0% (1)
Microsoft Word - Arbitrary Reference Frame Theory
8 pages
The Diophantine Equation X 2 +3 M y N: International Journal of Mathematics and Mathematical Sciences January 1998
No ratings yet
The Diophantine Equation X 2 +3 M y N: International Journal of Mathematics and Mathematical Sciences January 1998
4 pages
Tridiagonal Matrix Algorithm - Wikipedia
No ratings yet
Tridiagonal Matrix Algorithm - Wikipedia
6 pages
Bijective Proof Problems: August 18, 2009
0% (1)
Bijective Proof Problems: August 18, 2009
70 pages
Numerical Methods by Jain & Iyengar
83% (6)
Numerical Methods by Jain & Iyengar
326 pages
Lecture 4solution of Linear System of Equations
No ratings yet
Lecture 4solution of Linear System of Equations
25 pages
Numerical Ranges of Unbounded Operators
No ratings yet
Numerical Ranges of Unbounded Operators
22 pages
Linear Congruences: Presented By: Ana Marie B. Valenzuela MILE-Mathematics
No ratings yet
Linear Congruences: Presented By: Ana Marie B. Valenzuela MILE-Mathematics
28 pages
Signals Sampling Theorem
No ratings yet
Signals Sampling Theorem
3 pages
Gen Math Pre Test
No ratings yet
Gen Math Pre Test
4 pages
630 Engineering Math BEG201SH BE III
No ratings yet
630 Engineering Math BEG201SH BE III
2 pages
Maths - 1 Paper - 2 Guess March 2022
No ratings yet
Maths - 1 Paper - 2 Guess March 2022
4 pages
HL Trigonometry (13!2!2024) (MS)
No ratings yet
HL Trigonometry (13!2!2024) (MS)
32 pages
SAT II Math Level 2 Subject Test Notes: Trigonometric Functions
100% (1)
SAT II Math Level 2 Subject Test Notes: Trigonometric Functions
4 pages
How To Calculate Portfolio Risk and Return
No ratings yet
How To Calculate Portfolio Risk and Return
3 pages
Limits at Infinity
No ratings yet
Limits at Infinity
5 pages

Entropy 2

Uploaded by

Entropy 2

Uploaded by

Lecture 1: Entropy and Source Coding

September 25, 2006

Definition 1.1: The entropy of a discrete random variable X is defined as:

which can interpreted as the expected value

We focus on codes that are “instantaneous”.

Definition 2.2: A code is called a prefix code or an instantaneous code if no codeword

Furthermore, there exists a code such that

2.1 The Kraft Inequality

Proof: Consider flipping (unbiased) coins until either we have a codeword or no

You might also like