0% found this document useful (0 votes)

12 views5 pages

Stat520 Ch.4

The document provides a comprehensive overview of various statistical concepts and theorems, including Bayesian linear regression, probability laws, and distributions. It covers topics such as the addition law of probability, Bayes' theorem, and Cochran's theorem, along with proofs and applications related to univariate Gaussian data. The content is organized by topic, making it a useful reference for statistical analysis and modeling.

Uploaded by

gohocel840

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views5 pages

Stat520 Ch.4

Uploaded by

gohocel840

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

3.

PROOF BY TOPIC 655

3 Proof by Topic
A
• Accuracy and complexity for Bayesian linear regression, 497
• Accuracy and complexity for Bayesian linear regression with known covariance, 521
• Accuracy and complexity for the univariate Gaussian, 358
• Accuracy and complexity for the univariate Gaussian with known variance, 370
• Addition law of probability, 13
• Addition of the differential entropy upon multiplication with a constant, 92
• Addition of the differential entropy upon multiplication with invertible matrix, 94
• Additivity of the Kullback-Leibler divergence for independent distributions, 109
• Additivity of the variance for independent random variables, 59
• Akaike information criterion for multiple linear regression, 489
• Application of Cochran’s theorem to two-way analysis of variance, 397

B
• Bayes’ rule, 133
• Bayes’ theorem, 132
• Bayesian information criterion for multiple linear regression, 490
• Best linear unbiased estimator for the inverse general linear model, 537
• Binomial test, 550
• Brier scoring rule is strictly proper scoring rule, 140

C
• Characteristic function of a function of a random variable, 35
• Chi-squared distribution is a special case of gamma distribution, 252
• Combined posterior distribution for Bayesian linear regression when analyzing conditionally inde-
pendent data sets, 509
• Combined posterior distributions in terms of individual posterior distributions obtained from con-
ditionally independent data, 127
• Concavity of the Shannon entropy, 86
• Conditional distributions of the multivariate normal distribution, 299
• Conditional distributions of the normal-gamma distribution, 318
• Conjugate prior distribution for Bayesian linear regression, 492
• Conjugate prior distribution for Bayesian linear regression with known covariance, 515
• Conjugate prior distribution for binomial observations, 554
• Conjugate prior distribution for multinomial observations, 564
• Conjugate prior distribution for multivariate Bayesian linear regression, 543
• Conjugate prior distribution for Poisson-distributed data, 573
• Conjugate prior distribution for the Poisson distribution with exposure values, 580
• Conjugate prior distribution for the univariate Gaussian, 351
• Conjugate prior distribution for the univariate Gaussian with known variance, 364
• Construction of confidence intervals using Wilks’ theorem, 114
• Construction of unbiased estimator for variance, 602
• Construction of unbiased estimator for variance in multiple linear regression, 603
• Continuous uniform distribution maximizes differential entropy for fixed range, 184
• Convexity of the cross-entropy, 88
• Convexity of the Kullback-Leibler divergence, 108
190 CHAPTER II. PROBABILITY DISTRIBUTIONS

Xi − µ
Ui = (5)
σ
which follows a standard normal distribution (→ II/3.2.4)

Ui ∼ N (0, 1) . (6)
Then, the sum of squared random variables Ui can be rewritten as

X
n n
X 2
Xi − µ
Ui2 =
i=1 i=1
σ
Xn 2
(Xi − X̄) + (X̄ − µ)
=
i=1
σ
(7)
X
n
(Xi − X̄)2 Xn
(X̄ − µ)2 Xn
(Xi − X̄)(X̄ − µ)
= 2
+ 2
+2
i=1
σ i=1
σ i=1
σ2
Xn 2 X n 2
(X̄ − µ) X
n
Xi − X̄ X̄ − µ
= + +2 (Xi − X̄) .
i=1
σ i=1
σ σ2 i=1

Because the following sum is zero

X
n X
n
(Xi − X̄) = Xi − nX̄
i=1 i=1
Xn
1X
n
= Xi − n · Xi
i=1
n i=1 (8)
Xn X
n
= Xi − Xi
i=1 i=1
=0,

the third term disappears, i.e.

X
n n
X 2 Xn 2
Xi − X̄ X̄ − µ
Ui2 = + . (9)
i=1 i=1
σ i=1
σ
Cochran’s theorem states that, if a sum of squared standard normal (→ II/3.2.3) random variables
(→ I/1.2.2) can be written as a sum of squared forms

X
n X
m X
n X
n
(j)
Ui2 = Qj where Qj = Uk Bkl Ul
i=1 j=1 k=1 l=1
X
m
(10)
with B (j) = In
j=1

and rj = rank(B (j) ) ,

1. UNIVARIATE DISCRETE DISTRIBUTIONS 161

Y ∼ Bin(n, pq) . (3)

Proof: We are interested in the probability that Y equals a number m. According to the law of
marginal probability (→ I/1.3.3) or the law of total probability (→ I/1.4.7), this probability can be
expressed as:
X
∞
Pr(Y = m) = Pr(Y = m|X = k) · Pr(X = k) . (4)
k=0

Since, by definitions (2) and (1), Pr(X = k) = 0 when k > n and Pr(Y = m|X = k) = 0 when
k < m, we have:
X
n
Pr(Y = m) = Pr(Y = m|X = k) · Pr(X = k) . (5)
k=m

Now we can take the probability mass function of the binomial distribution (→ II/1.3.2) and plug it
in for the terms in the sum of (5) to get:
n
X
k n k
Pr(Y = m) = q (1 − q)
m
· p (1 − p)n−k .
k−m
(6)
k=m
m k
k n−m
Applying the binomial coeﬀicient identity nk m = mn
k−m
and rearranging the terms, we have:
n
X
n n−m
Pr(Y = m) = pk q m (1 − p)n−k (1 − q)k−m . (7)
k=m
m k−m

Now we partition pk = pm · pk−m and pull all terms dependent on k out of the sum:

Xn
n n − m k−m
Pr(Y = m) = m m
p q p (1 − p)n−k (1 − q)k−m
m k=m
k−m
X n − m
n (8)
n
= (pq) m
(p(1 − q))k−m (1 − p)n−k .
m k=m
k − m

Then we subsititute i = k − m, such that k = i + m:

X n − m
n−m
n
Pr(Y = m) = (pq)m
(p − pq)i (1 − p)n−m−i . (9)
m i=0
i
According to the binomial theorem
n
X
n n
(x + y) = xn−k y k , (10)
k=0
k
the sum in equation (9) is equal to

X
n−m
n−m

(p − pq)i (1 − p)n−m−i = ((p − pq) + (1 − p))n−m . (11)
i=0
i
1. UNIVARIATE NORMAL DATA 369

1.2.8 Log model evidence

Theorem: Let

m : y = {y1 , . . . , yn } , yi ∼ N (µ, σ 2 ), i = 1, . . . , n (1)

be a univariate Gaussian data set (→ III/1.2.1) with unknown mean µ and known variance σ 2 .
Moreover, assume a normal distribution (→ III/1.2.6) over the model parameter µ:

p(µ) = N (µ; µ0 , λ−1

0 ) . (2)
Then, the log model evidence (→ IV/??) for this model is
τ 1
n λ0 1
log p(y|m) = log + log − τ y T y + λ0 µ20 − λn µ2n . (3)
2 2π 2 λn 2
where the posterior hyperparameters (→ I/5.1.7) are given by

λ0 µ0 + τ nȳ
µn =
λ0 + τ n (4)
λn = λ0 + τ n

with the sample mean (→ I/1.10.2) ȳ and the inverse variance or precision (→ I/1.11.12) τ = 1/σ 2 .

Proof: According to the law of marginal probability (→ I/1.3.3), the model evidence (→ I/5.1.11)
for this model is:
Z
p(y|m) = p(y|µ) p(µ) dµ . (5)

According to the law of conditional probability (→ I/1.3.4), the integrand is equivalent to the joint
likelihood (→ I/5.1.5):
Z
p(y|m) = p(y, µ) dµ . (6)

Equation (1) implies the following likelihood function (→ I/5.1.2)

Y
n
2
p(y|µ, σ ) = N (yi ; µ, σ 2 )
i=1
" 2 #
Yn
1 1 yi − µ
= √ · exp − (7)
i=1
2πσ 2 σ
r !n " #
1 X
n
1
= · exp − 2 (yi − µ) 2
2πσ 2 2σ i=1

which, for mathematical convenience, can also be parametrized as

352 CHAPTER III. STATISTICAL MODELS

Y
n
2
p(y|µ, σ ) = N (yi ; µ, σ 2 )
i=1
" 2 #
Y
n
1 1 yi − µ
= √ · exp − (3)
2πσ 2 σ
i=1
" #
1 X
n
1
= √ · exp − 2 (yi − µ)2
( 2πσ )2 n 2σ i=1

which, for mathematical convenience, can also be parametrized as

Y
n
p(y|µ, τ ) = N (yi ; µ, τ −1 )
i=1
Yn r h τ i
τ
= · exp − (yi − µ) 2
(4)
2π 2
i=1
r n " #
τX
n
τ
= · exp − (yi − µ)2
2π 2 i=1

using the inverse variance or precision τ = 1/σ 2 .

Separating constant and variable terms, we have:

s " #
1 τ Xn
p(y|µ, τ ) = · τ n/2 · exp − (yi − µ) .
2
(5)
(2π)n 2 i=1
Expanding the product in the exponent, we have

s " #
1 τ Xn

p(y|µ, τ ) = · τ n/2 · exp − y 2 − 2µyi + µ2
(2π)n 2 i=1 i
s " !#
1 τ Xn Xn
= · τ n/2 · exp − y 2 − 2µ yi + nµ2
(2π)n 2 i=1 i i=1
s (6)
1 h τ T i
= · τ n/2
· exp − y y − 2µnȳ + nµ 2
(2π)n 2
s
1 τn 1 T
= ·τ n/2
· exp − y y − 2µȳ + µ 2
(2π)n 2 n
P P
where ȳ = n1 ni=1 yi is the mean of data points and y T y = ni=1 yi2 is the sum of squared data points.
Completing the square over µ, finally gives
s
1 τn 1 T
p(y|µ, τ ) = ·τ n/2
· exp − (µ − ȳ) − ȳ + y y
2 2
(7)
(2π)n 2 n

A Probability and Statistics Cheatsheet
No ratings yet
A Probability and Statistics Cheatsheet
28 pages
Mood - Graybill - Boes (1974) Introduction To The Theory of Statistics PDF
75% (8)
Mood - Graybill - Boes (1974) Introduction To The Theory of Statistics PDF
577 pages
MGB 3rd Edition (Part 1 - Stat 121 122 - Chap 1 To 4) PDF
No ratings yet
MGB 3rd Edition (Part 1 - Stat 121 122 - Chap 1 To 4) PDF
187 pages
Readers Solution Manual For Probability, Random Processes and Statistical Analysis (HISASHI KOBAYASHI)
No ratings yet
Readers Solution Manual For Probability, Random Processes and Statistical Analysis (HISASHI KOBAYASHI)
119 pages
Advanced Econometrics PDF
No ratings yet
Advanced Econometrics PDF
58 pages
Arroyo Oscar The World of Tomorrow
100% (1)
Arroyo Oscar The World of Tomorrow
5 pages
Probability and Statistics Cheat Sheet
100% (2)
Probability and Statistics Cheat Sheet
28 pages
Stat520 Ch.4
No ratings yet
Stat520 Ch.4
5 pages
Chapter V. Appendix
No ratings yet
Chapter V. Appendix
5 pages
Mood An Introduction To The Theory of Statistics
No ratings yet
Mood An Introduction To The Theory of Statistics
577 pages
Math Stats Text
100% (1)
Math Stats Text
577 pages
$$$MGB3rdSearchable PDF
100% (1)
$$$MGB3rdSearchable PDF
577 pages
Chapter Iii. Statistical Models: I I N I I N I I N I I N
No ratings yet
Chapter Iii. Statistical Models: I I N I I N I I N I I N
5 pages
Probability and Statistics: Cookbook
No ratings yet
Probability and Statistics: Cookbook
28 pages
Formulasheetensvnew
No ratings yet
Formulasheetensvnew
15 pages
Mood Introduction To The Theory of Statistics
0% (1)
Mood Introduction To The Theory of Statistics
577 pages
Mood A.m., Graybill F.a., Boes D.C. Introduction To The Theory of Statistics (3rd Ed., McGraw-Hil - 0
No ratings yet
Mood A.m., Graybill F.a., Boes D.C. Introduction To The Theory of Statistics (3rd Ed., McGraw-Hil - 0
577 pages
Introduction To Theory of Statistics, A. Mood, F. Graybill and B. Boes, McGrow-Hill
100% (1)
Introduction To Theory of Statistics, A. Mood, F. Graybill and B. Boes, McGrow-Hill
578 pages
Formulario Ep Probability and Statistics
No ratings yet
Formulario Ep Probability and Statistics
28 pages
Probability and Statistics Cookbook
No ratings yet
Probability and Statistics Cookbook
28 pages
Stats Cheat Sheet
No ratings yet
Stats Cheat Sheet
28 pages
Probability and Statistics: Cookbook
No ratings yet
Probability and Statistics: Cookbook
28 pages
Probability and Statistics - Cookbook
No ratings yet
Probability and Statistics - Cookbook
28 pages
MA204 FinalTest 2022
No ratings yet
MA204 FinalTest 2022
14 pages
Stat520 Ch.5
No ratings yet
Stat520 Ch.5
5 pages
Statistical+Inference+1 Shaw2007
No ratings yet
Statistical+Inference+1 Shaw2007
66 pages
1950 Theil - A Rank-Invariant Method of Linear and Polynomial Regression Analysis
No ratings yet
1950 Theil - A Rank-Invariant Method of Linear and Polynomial Regression Analysis
16 pages
1) Alexander McFarlane Mood, Franklin A. Graybill, Duane C. Boes - Introduction To The Theory of Statistics
No ratings yet
1) Alexander McFarlane Mood, Franklin A. Graybill, Duane C. Boes - Introduction To The Theory of Statistics
577 pages
Mood - Graybill - Boes (1974) Introduction To The Theory of Statistics
100% (1)
Mood - Graybill - Boes (1974) Introduction To The Theory of Statistics
577 pages
Probability Theory For Machine Learning: Chris Cremer September 2015
No ratings yet
Probability Theory For Machine Learning: Chris Cremer September 2015
40 pages
Lecture Notes 1 36-705 Brief Review of Basic Probability
No ratings yet
Lecture Notes 1 36-705 Brief Review of Basic Probability
7 pages
Cheat Sheet For The Final Exam
No ratings yet
Cheat Sheet For The Final Exam
6 pages
Categorical Notes Ch1
No ratings yet
Categorical Notes Ch1
18 pages
Stat520 Ch.3
No ratings yet
Stat520 Ch.3
5 pages
18.443 MIT Stats Course
No ratings yet
18.443 MIT Stats Course
139 pages
Osobine Var
No ratings yet
Osobine Var
19 pages
Final Soln
No ratings yet
Final Soln
5 pages
MS Theory Exam Study Guide
No ratings yet
MS Theory Exam Study Guide
50 pages
22-23 323 Week5Notes
No ratings yet
22-23 323 Week5Notes
8 pages
Lecture Notes For STAT2602
No ratings yet
Lecture Notes For STAT2602
104 pages
STA301 SHORT NOTES (23 To 45) Final Term by JUNAID
100% (2)
STA301 SHORT NOTES (23 To 45) Final Term by JUNAID
16 pages
Stat520 Ch.5
No ratings yet
Stat520 Ch.5
5 pages
Ps Formuale
No ratings yet
Ps Formuale
7 pages
MA40189 Notes
No ratings yet
MA40189 Notes
70 pages
College Statistics
No ratings yet
College Statistics
244 pages
PRML Slides 2
No ratings yet
PRML Slides 2
86 pages
Sta301 Solved Subjective Final Term by Junaid
No ratings yet
Sta301 Solved Subjective Final Term by Junaid
16 pages
Unit 2 (2) - 1
No ratings yet
Unit 2 (2) - 1
37 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Transformation of Axes (Geometry) Mathematics Question Bank
From Everand
Transformation of Axes (Geometry) Mathematics Question Bank
Mohmmad Khaja Shareef
3/5 (1)
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
From Everand
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
From Everand
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
CSPacademic
No ratings yet
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
From Everand
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
No ratings yet
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
From Everand
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
Andrew Igla
No ratings yet
Useful Formulae: Mathematical & Physical
From Everand
Useful Formulae: Mathematical & Physical
Matthew Watkins
No ratings yet
Application of Derivatives Tangents and Normals (Calculus) Mathematics E-Book For Public Exams
From Everand
Application of Derivatives Tangents and Normals (Calculus) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
5/5 (1)
Differentiation (Calculus) Mathematics Question Bank
From Everand
Differentiation (Calculus) Mathematics Question Bank
Mohmmad Khaja Shareef
4/5 (1)
Ray Martinez - Resume 03 11 2023 - Most Recent
No ratings yet
Ray Martinez - Resume 03 11 2023 - Most Recent
3 pages
La Liberación Del Libro. Una Crítica Del Sistema de Precio Fijo. Pedro Schwartz.
No ratings yet
La Liberación Del Libro. Una Crítica Del Sistema de Precio Fijo. Pedro Schwartz.
79 pages
Dhupguri Report
No ratings yet
Dhupguri Report
11 pages
Complete Seminar Presentation-3
No ratings yet
Complete Seminar Presentation-3
16 pages
TD Sba0 en
No ratings yet
TD Sba0 en
3 pages
Environment Notes by Akshay Jadhav Sir Rank52
No ratings yet
Environment Notes by Akshay Jadhav Sir Rank52
176 pages
Amaravathi Bye Laws
No ratings yet
Amaravathi Bye Laws
5 pages
UG - CAO - .00132-002 Tools & Equipment
No ratings yet
UG - CAO - .00132-002 Tools & Equipment
39 pages
LP 4TH Grade 10 Day1
No ratings yet
LP 4TH Grade 10 Day1
3 pages
Module of Applied Entomology Only Agricultural Part
No ratings yet
Module of Applied Entomology Only Agricultural Part
53 pages
Final Project Report MRI Reconstruction
No ratings yet
Final Project Report MRI Reconstruction
19 pages
Contest1 Tasks
No ratings yet
Contest1 Tasks
9 pages
Chawimawi Ru
No ratings yet
Chawimawi Ru
1 page
In Mathematics Facts and Concepts
No ratings yet
In Mathematics Facts and Concepts
1 page
Spectrele Lui Marx - Derrida PDF
100% (1)
Spectrele Lui Marx - Derrida PDF
35 pages
About The Author: Fabio Saccomanno Was Born in Genoa, Italy in 1933. He Received The Laurea
No ratings yet
About The Author: Fabio Saccomanno Was Born in Genoa, Italy in 1933. He Received The Laurea
2 pages
Abbotsford VFR Terminal Procedures Chart Rwy 01 & 19
No ratings yet
Abbotsford VFR Terminal Procedures Chart Rwy 01 & 19
3 pages
The Ghosts of Adichanallur - Artefacts That Suggest An Ancient Tamil Civilisation of Great Sophistication - The Hindu
No ratings yet
The Ghosts of Adichanallur - Artefacts That Suggest An Ancient Tamil Civilisation of Great Sophistication - The Hindu
12 pages
Uniu S2466 Sti Ii Ul
No ratings yet
Uniu S2466 Sti Ii Ul
1 page
Vaginal Exam Learning Guide
No ratings yet
Vaginal Exam Learning Guide
2 pages
6936 PDF
100% (2)
6936 PDF
2 pages
Energy Relationships in Chemical Reactions
No ratings yet
Energy Relationships in Chemical Reactions
11 pages
How To Use DNA Baser - 2 Minutes Video Tutorial - Url
No ratings yet
How To Use DNA Baser - 2 Minutes Video Tutorial - Url
13 pages
Ethical Considerations in Civic Engagement
80% (5)
Ethical Considerations in Civic Engagement
2 pages
TIẾNG ANH CHUYÊN NGÀNH 2
No ratings yet
TIẾNG ANH CHUYÊN NGÀNH 2
12 pages
AB Salts WKST Key
No ratings yet
AB Salts WKST Key
10 pages
Mini Research On Homeless
No ratings yet
Mini Research On Homeless
6 pages
Trevithick Second Steam Locomotive PDF
50% (2)
Trevithick Second Steam Locomotive PDF
6 pages
Black Death
No ratings yet
Black Death
34 pages

Stat520 Ch.4

Uploaded by

Stat520 Ch.4

Uploaded by

3.

PROOF BY TOPIC 655

Because the following sum is zero

the third term disappears, i.e.

and rj = rank(B (j) ) ,

Y ∼ Bin(n, pq) . (3)

Then we subsititute i = k − m, such that k = i + m:

1.2.8 Log model evidence

m : y = {y1 , . . . , yn } , yi ∼ N (µ, σ 2 ), i = 1, . . . , n (1)

p(µ) = N (µ; µ0 , λ−1

Equation (1) implies the following likelihood function (→ I/5.1.2)

which, for mathematical convenience, can also be parametrized as

which, for mathematical convenience, can also be parametrized as

using the inverse variance or precision τ = 1/σ 2 .

Separating constant and variable terms, we have:

You might also like