0% found this document useful (0 votes)

2 views

Lecture6

This document discusses differential entropy and the Gaussian channel in the context of information theory. It introduces the concept of differential entropy for continuous random variables, provides examples including uniform and Gaussian distributions, and explains the properties of differential entropy. Additionally, it defines the Gaussian channel, its capacity, and demonstrates that the Gaussian distribution maximizes differential entropy.

Uploaded by

zhangxbkimmich

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Lecture6

Uploaded by

zhangxbkimmich

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Information Theory

Lecture 6: Differential Entropy and the Gaussian Channel

Basak Guler

1 / 17
Differential Entropy
• So far we have only worked with discrete random variables.

• We will now extend our information measures to continuous random

variables.

• Recall that continuous random variables are characterized by a PDF

(probability density function) f (x) instead of the PMF p(x). The PDF
R∞
satisfies the properties f (x) ≥ 0 and −∞ f (x)dx = 1.

2 / 17
Differential Entropy
• So far we have only worked with discrete random variables.

• We will now extend our information measures to continuous random

variables.

• Recall that continuous random variables are characterized by a PDF

(probability density function) f (x) instead of the PMF p(x). The PDF
R∞
satisfies the properties f (x) ≥ 0 and −∞ f (x)dx = 1.

• Definition 38 (Differential Entropy). Differential entropy of a

continuous random variable X with a PDF f (x) is defined as:
Z Z
1
h(X ) = − f (x) log f (x)dx = f (x) log dx = E[− log f (X )]
S S f (x)

where S is the set of all x for which f (x) > 0 (support set of X ).

2 / 17
Example 26
• Example 26 (Uniform Random Variable). Let X ∼ U[a, b] be a
uniform random variable with PDF
(
1
for x ∈ [a, b]
f (x) = b−a
0 otherwise

for some b > a. Find its differential entropy.

3 / 17
Example 26
• Example 26 (Uniform Random Variable). Let X ∼ U[a, b] be a
uniform random variable with PDF
(
1
for x ∈ [a, b]
f (x) = b−a
0 otherwise

for some b > a. Find its differential entropy.

• Solution.
Z Z b
1 1
h(X ) = − f (x) log f (x)dx = − log dx = log(b − a)
S a b−a b−a

• Unlike regular entropy which is always ≥ 0, differential entropy can be

positive or negative.

• For example, in the above example, if b − a < 1, then h(X ) < 0!

3 / 17
Example 27
• Example 27 (Gaussian Random Variable). Let X ∼ N (0, σ 2 ) be a
Gaussian random variable with mean 0 and variance σ 2 . The
corresponding PDF is:

1 x2
f (x) = √ e− 2σ2
2πσ 2
Find its differential entropy.

4 / 17
Example 27
• Solution.
Z
h(X ) = − f (x) log f (x)dx
ZS∞
1 − x2 1 − x2
=− √ e 2σ 2 log √ e 2σ 2 dx
−∞ 2πσ 2 2πσ 2
Z ∞
x2

1 x2 1
=− √ e− 2σ2 log √ log− e dx
−∞ 2πσ 2 2πσ 2 2σ 2
∞
log e ∞ x2
Z Z
1 1 − x2
2 x2
= − log √ √ e 2σ dx + 2 √ e− 2σ2 dx
2πσ 2 −∞ 2πσ 2 2σ −∞ 2πσ 2
| {z } | {z }
1 =E[X 2 ]=σ 2 +02 =σ 2
1 1
= log(2πσ 2 ) + log e
2 2
1
= log(2πeσ 2 ) bits
2

5 / 17
Other properties of Differential Entropy
• Relative entropy (KL-distance) between two PDFs f (x) and g(x) is:
Z
f (x)
D(f ||g) = f (x) log dx ≥ 0
g(x)

• Joint and conditional different entropy are similar to the discrete case:

h(X , Y ) = E[− log f (X , Y )] (joint differential entropy)

h(X |Y ) = E[− log f (X |Y )] (conditional differential entropy)

where

I(X ; Y ) = h(X ) + h(Y ) − h(X , Y )

= h(X ) − h(X |Y )
= h(Y ) − h(Y |X )
≥0 Mutual information is always non-negative!

6 / 17
Other properties of Differential Entropy
• We also have the following properties, for any constant c,

h(X + c) = h(X ) (translation does not change entropy)

h(X · c) = h(X ) + log|c| for c 6= 0

7 / 17
Gaussian Distribution Maximizes Differential Entropy
• Recall that entropy was maximized by a uniform distribution.

• Differential entropy is maximized by a Gaussian distribution.

8 / 17
Gaussian Distribution Maximizes Differential Entropy
• Recall that entropy was maximized by a uniform distribution.

• Differential entropy is maximized by a Gaussian distribution.

• Theorem 36 (Gaussian Maximizes Differential Entropy): Let X be

a random variable with PDF f (x) and second moment E[X 2 ] ≤ σ 2 .
Then,
1
h(X ) ≤ log(2πeσ 2 )
2
with equality if and only if X ∼ N (0, σ 2 ).

• In other words, the differential entropy h(X ) is maximized when X is a

Gaussian random variable.

8 / 17
Gaussian Distribution Maximizes Differential Entropy
x2
• Proof. Let g(x) = √ 1 e− 2σ2 be the Gaussian PDF for N (0, σ 2 ).
2πσ 2

9 / 17
Gaussian Distribution Maximizes Differential Entropy
x2
• Proof. Let g(x) = √ 1 e− 2σ2 be the Gaussian PDF for N (0, σ 2 ).
√2
2πσ
• Then, log g(x) = x2
− log 2πσ 2 − 2σ 2 log e and therefore,

0 ≤ D(f ||g) (1)

Z ∞
f (x)
= f (x) log dx
−∞ g(x)
Z ∞ Z ∞
= f (x) log f (x)dx − f (x) log g(x)dx
−∞ −∞
Z ∞ Z ∞ √ x2

= f (x) log f (x)dx − f (x) − log 2πσ − 2 log e dx
−∞ −∞ 2σ 2
√ 1
Z ∞
= −h(X ) + log 2πσ 2 + log e x 2 f (x)dx
2σ 2 −∞
√ 1
= −h(X ) + log 2πσ 2 + log eE[X 2 ]
2σ 2
√ 1
≤ −h(X ) + log 2πσ 2 + (log e)σ 2
2σ 2
1
= −h(X ) + log 2πeσ 2
2
9 / 17
Gaussian Distribution Maximizes Differential Entropy
• Hence
1
h(X ) ≤ log 2πeσ 2
2

• For this to become an equality, we must have in equation (1),

D(f ||g) = 0 (2)

which occurs if and only if f (x) = g(x), which means f (x) should be a
Gaussian distribution N (0, σ 2 ).

10 / 17
The Gaussian Channel
• Definition 39 (Gaussian Channel). The Gaussian channel is
defined as:
Yi = Xi + Zi
where Xi is the channel input (at time i), Yi is the channel output, and
Zi ∼ N (0, N) is the noise, drawn i.i.d. from a Gaussian distribution. Zi
is independent from Xi .
Noise
Zi
Channel Input Channel Output
Xi + Yi

11 / 17
The Gaussian Channel
• Definition 39 (Gaussian Channel). The Gaussian channel is
defined as:
Yi = Xi + Zi
where Xi is the channel input (at time i), Yi is the channel output, and
Zi ∼ N (0, N) is the noise, drawn i.i.d. from a Gaussian distribution. Zi
is independent from Xi .
Noise
Zi
Channel Input Channel Output
Xi + Yi

• Most widely known continuous channel model.

• Channel is memoryless & discrete-time, but continuous alphabet.
• Simple and tractable model for “real-life” communication channels.

11 / 17
The Gaussian Channel
Intuition behind “Gaussian”:

• The intuition for using a Gaussian random variable to model the

channel noise is based on the central limit theorem.

• Communication channels (wireless or wired) are subject to a variety of

“random noise events”, such as thermal noise from electrical circuits,
signals originating from unknown devices etc.

• The central limit theorem says that the aggregate effect of a large
number of these random events will have a Gaussian distribution.

• So we represent this “aggregate effect” as a Gaussian random

variable.

12 / 17
Gaussian Channel Capacity
• Recall the channel capacity for the discrete alphabet case:

C = max I(X ; Y ) (3)

p(x)

• For the discrete case, we know that C ≤ min{log |X |, log |Y|}.

13 / 17
Gaussian Channel Capacity
• Recall the channel capacity for the discrete alphabet case:

C = max I(X ; Y ) (3)

p(x)

• For the discrete case, we know that C ≤ min{log |X |, log |Y|}.

13 / 17
Gaussian Channel Capacity
• Recall the channel capacity for the discrete alphabet case:

C = max I(X ; Y ) (3)

p(x)

• For the discrete case, we know that C ≤ min{log |X |, log |Y|}.

• For the continuous case, with no restrictions on X , (3) does not have
an upper bound, X can take any value!
• For example, using very large powers we could send an infinite
number of inputs by placing X ’s far away from each other, so that each
signal is distinguishable with arbitrarily low error probability.
• However, this requires transmissions with infinite power, which is not
physically meaningful.
• A common technique is to have an average power constraint
E[X 2 ] ≤ P. For codeword {x1 , . . . , xn } of length n → ∞ this would
Pn
mean n1 i=1 xi2 → E[X 2 ] ≤ P.
13 / 17
Gaussian Channel Capacity
• Definition 40. The capacity of the Gaussian channel with average
power constraint P:

C= max I(X ; Y ) (4)

p(x):E[X 2 ]≤P

Noise
Z
Channel Input Channel Output
X + Y

14 / 17
Gaussian Channel Capacity
• Theorem 37 (Gaussian Channel Capacity). The capacity of the
Gaussian channel with average power constraint P and noise
variance N is equal to:

1 P
C = log 1 + (5)
2 N

which is attained by a Gaussian input distribution X ∼ N (0, P).

15 / 17
Gaussian Channel Capacity
• Proof. Let Y = X + Z where X and N are independent from each
other, X ∼ p(x) and Z ∼ N (0, N).

16 / 17
Gaussian Channel Capacity
• Proof. Let Y = X + Z where X and N are independent from each
other, X ∼ p(x) and Z ∼ N (0, N).
• Then,

I(X ; Y ) = h(Y ) − h(Y |X )

= h(Y ) − h(X + Z |X )
= h(Y ) − h(Z |X )
= h(Y ) − h(Z ) (As X and Z are independent)
1
= h(Y ) − log 2πeN (Since Z ∼ N (0, N))
2

16 / 17
Gaussian Channel Capacity
• Now, note that the second moment of Y is:

E[Y 2 ] = [(X + Z )2 ] = E[X 2 ] +2 E[XZ ] + E[Z 2 ] ≤ P + N (6)

| {z } | {z } | {z }
≤P E[X ]E[Z ]=0 N

where E[XZ ]=E[X ]E[Z ] = 0 as X and Z are independent and E[Z ] = 0.

17 / 17
Gaussian Channel Capacity
• Now, note that the second moment of Y is:

E[Y 2 ] = [(X + Z )2 ] = E[X 2 ] +2 E[XZ ] + E[Z 2 ] ≤ P + N (6)

| {z } | {z } | {z }
≤P E[X ]E[Z ]=0 N

where E[XZ ]=E[X ]E[Z ] = 0 as X and Z are independent and E[Z ] = 0.

• We know that for a given second moment E[Y 2 ], the Gaussian
random variable maximizes the differential entropy.

17 / 17
Gaussian Channel Capacity
• Now, note that the second moment of Y is:

E[Y 2 ] = [(X + Z )2 ] = E[X 2 ] +2 E[XZ ] + E[Z 2 ] ≤ P + N (6)

| {z } | {z } | {z }
≤P E[X ]E[Z ]=0 N

where E[XZ ]=E[X ]E[Z ] = 0 as X and Z are independent and E[Z ] = 0.

• We know that for a given second moment E[Y 2 ], the Gaussian
random variable maximizes the differential entropy.
• So h(Y ) is maximized if Y is a Gaussian random variable with
variance P + N, in which case:
1
h(Y ) = log 2πe(P + N) (7)
2
and therefore:
1 1 1 P
I(X ; Y ) ≤ log 2πe(P + N) − log 2πeN = log(1 + ) (8)
2 2 2 N
where the upper bound can be achieved by choosing X ∼ N (0, P).
17 / 17

Summary MAS291
No ratings yet
Summary MAS291
9 pages
Integral Transforms Formula Sheet
No ratings yet
Integral Transforms Formula Sheet
2 pages
lec6_diff_ent_awgn_cap_print_4a7d3e28-3fb7-4bb2-907a-3918e66ac08d
No ratings yet
lec6_diff_ent_awgn_cap_print_4a7d3e28-3fb7-4bb2-907a-3918e66ac08d
20 pages
Chapter 3
No ratings yet
Chapter 3
27 pages
Final
No ratings yet
Final
8 pages
Differential Entropy: Peng-Hua Wang
No ratings yet
Differential Entropy: Peng-Hua Wang
24 pages
Continuous Random Variables: Wei-Yang Lin Department of Computer Science & Information Engineering
No ratings yet
Continuous Random Variables: Wei-Yang Lin Department of Computer Science & Information Engineering
26 pages
Quiz02 Review
No ratings yet
Quiz02 Review
7 pages
Transcribe
No ratings yet
Transcribe
5 pages
Section06 Solutions
No ratings yet
Section06 Solutions
11 pages
Integral Equations
No ratings yet
Integral Equations
46 pages
Answer: B
No ratings yet
Answer: B
10 pages
MA212 2023 Solns To Publish
No ratings yet
MA212 2023 Solns To Publish
10 pages
CONTINUITY AND DIFFERENTIABILITY
No ratings yet
CONTINUITY AND DIFFERENTIABILITY
56 pages
W5 Notes
No ratings yet
W5 Notes
3 pages
Week 5
No ratings yet
Week 5
3 pages
Lect - 22 Khalil
No ratings yet
Lect - 22 Khalil
17 pages
Lect_22
No ratings yet
Lect_22
17 pages
Math 121A: Sample Midterm Solutions
No ratings yet
Math 121A: Sample Midterm Solutions
7 pages
Tutorial 1
No ratings yet
Tutorial 1
4 pages
Tutorial 1
No ratings yet
Tutorial 1
4 pages
Week 10 - Differential Entropy
No ratings yet
Week 10 - Differential Entropy
22 pages
Mit6 041SCF13 L09
No ratings yet
Mit6 041SCF13 L09
3 pages
Arfken MMCH 9 S 2 e 1
No ratings yet
Arfken MMCH 9 S 2 e 1
1 page
Probability 2017 Homework 3: 1 Q1 (10 Points)
No ratings yet
Probability 2017 Homework 3: 1 Q1 (10 Points)
3 pages
Brief Notes #4 Random Vectors: and X
No ratings yet
Brief Notes #4 Random Vectors: and X
4 pages
My Notes For Discrete and Continuous Distributions 987654
No ratings yet
My Notes For Discrete and Continuous Distributions 987654
28 pages
Mathematical Statistics (MA212M) : Lecture Slides
No ratings yet
Mathematical Statistics (MA212M) : Lecture Slides
8 pages
Supp Ex Impro Int Sol
No ratings yet
Supp Ex Impro Int Sol
8 pages
The Exponential Family of Distributions: P (X) H (X) e
No ratings yet
The Exponential Family of Distributions: P (X) H (X) e
13 pages
Mathematics For Economics (ECON 104)
No ratings yet
Mathematics For Economics (ECON 104)
51 pages
Solution To Assignment 5 (1001) - 2018
No ratings yet
Solution To Assignment 5 (1001) - 2018
5 pages
No of Flips For First Head
No ratings yet
No of Flips For First Head
8 pages
(Galerkin) Finite Element Approximations: e N e 1 e e e
No ratings yet
(Galerkin) Finite Element Approximations: e N e 1 e e e
93 pages
Solution Series 7
No ratings yet
Solution Series 7
6 pages
Elementary prime counting estimates
No ratings yet
Elementary prime counting estimates
7 pages
Chebyshevs Inequality - Problemspdf
No ratings yet
Chebyshevs Inequality - Problemspdf
6 pages
F (X) DX 1: Peubah Acak (Variabel Random) Univariate
No ratings yet
F (X) DX 1: Peubah Acak (Variabel Random) Univariate
4 pages
Math_Stat_341_Review_Final_Practice_Problems Solutions Problems 4-9
No ratings yet
Math_Stat_341_Review_Final_Practice_Problems Solutions Problems 4-9
4 pages
Random Variables Cheatsheet
No ratings yet
Random Variables Cheatsheet
3 pages
Leibniz Rule
No ratings yet
Leibniz Rule
6 pages
Stat 235 Midterm Review
No ratings yet
Stat 235 Midterm Review
5 pages
Tutorial 6new
No ratings yet
Tutorial 6new
1 page
lec-note-6-2025_ffb04277-b5b0-4829-8d64-15053e1b235b
No ratings yet
lec-note-6-2025_ffb04277-b5b0-4829-8d64-15053e1b235b
6 pages
Calculus I - Lecture 11 - Derivatives of General Exponential and Inverse Functions
No ratings yet
Calculus I - Lecture 11 - Derivatives of General Exponential and Inverse Functions
15 pages
Homework 2 Solution_ME 5895_Spring 2024
No ratings yet
Homework 2 Solution_ME 5895_Spring 2024
6 pages
Problems_soln (2)
No ratings yet
Problems_soln (2)
4 pages
Week01 Workshop Soln
No ratings yet
Week01 Workshop Soln
5 pages
Conditioning On An Event Multiple Continuous R.V. 'S
No ratings yet
Conditioning On An Event Multiple Continuous R.V. 'S
20 pages
A Course in Mechanics by Dr. J. Tinsley Oden Part II - Homework 3 - Solutions
No ratings yet
A Course in Mechanics by Dr. J. Tinsley Oden Part II - Homework 3 - Solutions
7 pages
Sol T2
No ratings yet
Sol T2
2 pages
Introduction To Probability Theory
No ratings yet
Introduction To Probability Theory
13 pages
بلا عنوان
No ratings yet
بلا عنوان
47 pages
STAT732: Solutions For Homework 2: Due: Wednesday, Feb 14
No ratings yet
STAT732: Solutions For Homework 2: Due: Wednesday, Feb 14
7 pages
Solutions of Homework 3-520.651: 1 Problem 1-Stark and Woods (3.11)
No ratings yet
Solutions of Homework 3-520.651: 1 Problem 1-Stark and Woods (3.11)
3 pages
Mathematics For Machine Learning Multivariate Calculus Formula Sheet
No ratings yet
Mathematics For Machine Learning Multivariate Calculus Formula Sheet
2 pages
Statistical_Computing
No ratings yet
Statistical_Computing
6 pages
Week 5-8 Short Notes
No ratings yet
Week 5-8 Short Notes
10 pages
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
From Everand
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
Andrew Igla
No ratings yet
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Chapter 6 - Discrete Random Variables
No ratings yet
Chapter 6 - Discrete Random Variables
2 pages
Complete Business Statistics: by Amir D. Aczel & Jayavel Sounderpandian 6 Edition (SIE)
No ratings yet
Complete Business Statistics: by Amir D. Aczel & Jayavel Sounderpandian 6 Edition (SIE)
62 pages
M3 - FDS
No ratings yet
M3 - FDS
38 pages
Monte Carlo Methods and Bayesian Computation: Importance Sampling
No ratings yet
Monte Carlo Methods and Bayesian Computation: Importance Sampling
5 pages
Complete Download Topics in Circular Statistics S. Rao Jammalamadaka PDF All Chapters
No ratings yet
Complete Download Topics in Circular Statistics S. Rao Jammalamadaka PDF All Chapters
77 pages
Standard Normal Distribution Tables PDF
No ratings yet
Standard Normal Distribution Tables PDF
2 pages
SRM Institute of Science and Technology Department of Mathematics 21MAB204T-Probability and Queueing Theory Unit - V Tutorial Sheet - 15 Questions
No ratings yet
SRM Institute of Science and Technology Department of Mathematics 21MAB204T-Probability and Queueing Theory Unit - V Tutorial Sheet - 15 Questions
1 page
Some Methods of Climatological Analysis - WMO 1966 PDF
No ratings yet
Some Methods of Climatological Analysis - WMO 1966 PDF
69 pages
1 IEOR 4701: Continuous-Time Markov Chains
No ratings yet
1 IEOR 4701: Continuous-Time Markov Chains
22 pages
Regression Statistics
No ratings yet
Regression Statistics
11 pages
Examples 2
No ratings yet
Examples 2
18 pages
EX1 A Fast Food Franchise Is Considering Operating A Drive Up Window
No ratings yet
EX1 A Fast Food Franchise Is Considering Operating A Drive Up Window
2 pages
Cat 2 Material Chandrasekar Ralph PDF
No ratings yet
Cat 2 Material Chandrasekar Ralph PDF
32 pages
Exercices Chapter2: Exercise 1: Probability Experiment, Sample Space
No ratings yet
Exercices Chapter2: Exercise 1: Probability Experiment, Sample Space
2 pages
Chapter 2.
No ratings yet
Chapter 2.
24 pages
Measure of Variability
No ratings yet
Measure of Variability
30 pages
Lab2 Fitting Probability Distributions
No ratings yet
Lab2 Fitting Probability Distributions
19 pages
Anybody Can Do Value at Risk
No ratings yet
Anybody Can Do Value at Risk
18 pages
Discrete Probability Distribution: Poisson Distribution: Principal Investigator
No ratings yet
Discrete Probability Distribution: Poisson Distribution: Principal Investigator
11 pages
PSYC2001 Practice Exam 2024
No ratings yet
PSYC2001 Practice Exam 2024
15 pages
Unit 3 - Probability and Probability Distributions Vs2-Merged
No ratings yet
Unit 3 - Probability and Probability Distributions Vs2-Merged
28 pages
WINSEM2023-24 MAT2001 ETH VL2023240505528 2024-02-27 Reference-Material-I
No ratings yet
WINSEM2023-24 MAT2001 ETH VL2023240505528 2024-02-27 Reference-Material-I
69 pages
Statistics
No ratings yet
Statistics
23 pages
Statistics and Probability 3rd Quarter 2nd Assessment
No ratings yet
Statistics and Probability 3rd Quarter 2nd Assessment
7 pages
Sequence Model:: Hidden Markov Models
No ratings yet
Sequence Model:: Hidden Markov Models
60 pages
All complete report 1.
No ratings yet
All complete report 1.
9 pages
ANSYS Mechanical APDL Advanced Tutorials
No ratings yet
ANSYS Mechanical APDL Advanced Tutorials
380 pages
Definition: Order Statistics of A Sample
No ratings yet
Definition: Order Statistics of A Sample
11 pages
Binomial Distribution Q and Answers
No ratings yet
Binomial Distribution Q and Answers
15 pages
Python For Finance: Risk Measurement
No ratings yet
Python For Finance: Risk Measurement
36 pages

Lecture6

Uploaded by

Lecture6

Uploaded by

Information Theory

Lecture 6: Differential Entropy and the Gaussian Channel

• We will now extend our information measures to continuous random

• Recall that continuous random variables are characterized by a PDF

• We will now extend our information measures to continuous random

• Recall that continuous random variables are characterized by a PDF

• Definition 38 (Differential Entropy). Differential entropy of a

for some b > a. Find its differential entropy.

for some b > a. Find its differential entropy.

• Unlike regular entropy which is always ≥ 0, differential entropy can be

• For example, in the above example, if b − a < 1, then h(X ) < 0!

h(X , Y ) = E[− log f (X , Y )] (joint differential entropy)

I(X ; Y ) = h(X ) + h(Y ) − h(X , Y )

h(X + c) = h(X ) (translation does not change entropy)

• Differential entropy is maximized by a Gaussian distribution.

• Differential entropy is maximized by a Gaussian distribution.

• Theorem 36 (Gaussian Maximizes Differential Entropy): Let X be

• In other words, the differential entropy h(X ) is maximized when X is a

0 ≤ D(f ||g) (1)

• For this to become an equality, we must have in equation (1),

D(f ||g) = 0 (2)

• Most widely known continuous channel model.

• The intuition for using a Gaussian random variable to model the

• Communication channels (wireless or wired) are subject to a variety of

• So we represent this “aggregate effect” as a Gaussian random

C = max I(X ; Y ) (3)

• For the discrete case, we know that C ≤ min{log |X |, log |Y|}.

C = max I(X ; Y ) (3)

• For the discrete case, we know that C ≤ min{log |X |, log |Y|}.

C = max I(X ; Y ) (3)

• For the discrete case, we know that C ≤ min{log |X |, log |Y|}.

C= max I(X ; Y ) (4)

which is attained by a Gaussian input distribution X ∼ N (0, P).

I(X ; Y ) = h(Y ) − h(Y |X )

E[Y 2 ] = [(X + Z )2 ] = E[X 2 ] +2 E[XZ ] + E[Z 2 ] ≤ P + N (6)

where E[XZ ]=E[X ]E[Z ] = 0 as X and Z are independent and E[Z ] = 0.

E[Y 2 ] = [(X + Z )2 ] = E[X 2 ] +2 E[XZ ] + E[Z 2 ] ≤ P + N (6)

where E[XZ ]=E[X ]E[Z ] = 0 as X and Z are independent and E[Z ] = 0.

E[Y 2 ] = [(X + Z )2 ] = E[X 2 ] +2 E[XZ ] + E[Z 2 ] ≤ P + N (6)

where E[XZ ]=E[X ]E[Z ] = 0 as X and Z are independent and E[Z ] = 0.

You might also like