0% found this document useful (0 votes)

70 views34 pages

IDS22Bayes Applications

This document discusses Bayesian and frequentist approaches to probability. It explains that Bayesian probability represents degrees of belief that are updated with new information, while frequentist probability is based on long-run relative frequencies. It also introduces key Bayesian concepts like priors, posteriors, Bayes' theorem, Bayes factors, and credible intervals. Credible intervals provide the range of most likely values for a parameter based on combining prior beliefs with new data, unlike frequentist confidence intervals.

Uploaded by

T Do

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views34 pages

IDS22Bayes Applications

Uploaded by

T Do

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

Introduction to Data Science

Bayesian statistics II

Applications of Bayes theorem

The concept of probability
in the two frameworks
• Frequentist conception: Relative frequency of the
outcome of interest as a proportion of the whole
sample space, in the long run. (“Objective”
probability)

• Bayesian conception: Degree of belief, plausibility.

Beliefs are constantly updated with new
information. (“Subjective” probability)
Probability vs. Likelihood: Probability

Probability

p( z > 1.65 | normal distribution with mean = 0, SD = 1) = 0.05

Probability vs. Likelihood: Probability

Probability

p( z > 1.96 | normal distribution with mean = 0, SD = 1) = 0.025

Probability vs. Likelihood: Likelihood
0.35

0.3

0.25
Probability density

0.2

0.15

0.1

0.05

0
-5 -4 -3 -2 -1 0 1 2 3 4 5
x

L(normal distribution with mean = 0, SD = 1.3 | x = 2) = 0.094

Probability vs. Likelihood: Likelihood
0.35

0.3

0.25
Probability density

0.2

0.15

0.1

0.05

0
-5 -4 -3 -2 -1 0 1 2 3 4 5
x

L(normal distribution with mean = 1, SD = 1.3 | x = 2) = 0.2283

Maximum likelihood estimates:
Introducing Bayes Factors
• NHST has a somewhat convoluted (yet solid) logic:
Assuming that something makes no difference, then
showing that the observed results are unlikely, given
that assumption. Then rejecting this assumptions on
these grounds. And concluding that there probably was
a difference after all.
• Also somewhat weak: What is the difference?
• Bayes Factors provide an alternative/complement to
classical null hypothesis significance testing that
addresses both of these issues:
• p(D|H) à p (H|D)
• Bayes Theorem: Allows to invert conditional
probabilities if (and only if) we know the prior
probabilities of the hypotheses.
The Bayes Factor
!(#! |%) !(%|#! ) !(#! )
= ×
!(#" |%) !(%|#" ) !(#" )

Posterior Bayes Prior

odds Factor odds
So what are Bayes factors?
• In essence, likelihood ratios:

Likelihood
p(A)∗ p(B | A)
p(A | B) =
p(B) Priors
Posterior

p(D | H1 )
BF =
p(D | H 0 )
The history of Bayes factors
• Developed by Jeffreys (1935).
• Called “significance tests”.
• This alternative was not really appreciated until
about the 1990s.
• They are still “catching on”
What do Bayes Factors give us?
• They quantify the strength of the evidence in favor
of a hypothesis, given the data.
• Can be used for model comparison (which model is
more likely, given the data).
Deriving the Bayes factor
Strength of evidence for H1 or H0, given they are
equally likely a priori, from Lee & Wagenmakers, 2013
Example: Determining the most likely effectiveness of
a medication using maximum likelihood estimation
Bayes theorem also allows us to assess
the veracity of the published literature
• What is the probability that a published result
is actually true?
• If we set the significance level α to 0.05, does
this mean that the false positive rate in the
field is 5%?
• No!
• This is a common misconception.
• It also depends on the prior probability of a
true effect in a given field as well as the
statistical power of the study.
Introducing PPV (Ioannidis, 2005):
• PPV = “positive predictive value” – *post* study
probability that a result is true:
• R: Ratio of true to false effects in a field.
• PPV links power, alpha and R:

PPV = (1− β )R / (R − β R + α )
PPV = (1− β )R / ((1− β )R + α )
Deriving PPV
RESEARCH/REALITY
TRUE FALSE TOTAL
SIG 1-β α 1-β + α

NONSIG β 1-α β + 1-α

TOTAL 1 1 2

RESEARCH/REALITY
TRUE FALSE TOTAL
SIG NT (1-β) NFα NT (1-β) + NF α
NONSIG NTβ NF (1-α) NT β + NF (1-α)
TOTAL NT NF NT+NF=c
R = NT/NF R NF=c - NF R = NT/NF NT = cR/(R+1)
NT = R NF R NF + NF= c R/(R+1) = (NT/NF)/c/NF
NT = c - NF R + 1 = c/NF R/(R+1) = NT/c NF = c/(R+1)
Deriving PPV
RESEARCH/REALITY
TRUE FALSE TOTAL
SIG NT (1-β) NFα NT (1-β) + NF α
NONSIG NTβ NF (1-α) NT β + NF (1-α)
TOTAL NT NF NT+NF=c
R = NT/NF R NF=c - NF R = NT/NF NT = cR/(R+1)
NT = R NF R NF + NF= c R/(R+1) = (NT/NF)/c/NF
NT = c - NF R + 1 = c/NF R/(R+1) = NT/c NF = c/(R+1)

RESEARCH/REALITY
TRUE FALSE TOTAL
SIG c(1-β)R/(R+1) cα/(R+1) c(R(1-β)+α)/(R+1)
NONSIG cβR/(R+1) c(1-α) /(R+1) c(1 + Rβ-α)/(R+1)
TOTAL cR/(R+1) c/(R+1) c
Deriving PPV
RESEARCH/REALITY
TRUE FALSE TOTAL
SIG c(1-β)R/(R+1) cα/(R+1) c(R(1-β)+α)/(R+1)
NONSIG cβR/(R+1) c(1-α) /(R+1) c(1 + Rβ-α)/(R+1)
TOTAL cR/(R+1) c/(R+1) c
PPV = p(effect true | significant)= p(true)/p(sig)p(sig|true)
p(true) = NT/c = R/(R+1)
p(sig|true) = 1-β
p(sig) = p(sig|true)p(true)+p(sig|false)p(false)
p(sig) = 1-βR/(R+1)+αNF/c
p(sig) = 1-βR/(R+1)+α(R+1)
PPV = R/(R+1)/1-βR/(R+1)+α(R+1)1-β
PPV = (1-β)R/((1-β)R+α)
A study converts the prior odds of the
effect being true (R) into the posterior –
PPV, as a function of statistical power:
Confidence interval vs. credible interval
• Frequentist approach: The value of a parameter 𝜽 is
unknown, but fixed.
• We can estimate it by taking samples from the
population. This yields a (tychenic) distribution of
sample means, which we can use to calculate the CI.
• Example: You are an epidemiologist and want to
estimate the prevalence of Herpes Simplex in the
population using a frequentist approach:
𝜽: The prevalence of Herpes simplex in
the population
0.14

0.12

0.1

0.08
Proportion

0.06

0.04

0.02

0
0.44 0.46 0.48 0.5 0.52 0.54 0.56
In a Bayesian framework, 𝜽 is a RV and has
a probability distribution
• This probability distribution corresponds to our
degree of belief.
• If this is our prior belief about the value of the
parameter, this is called the prior distribution.
• The prior distribution can take any shape. A
completely flat prior distribution is called an
“uninformative prior”.
• The sharper the prior distribution, the more
informative it is.
• Often used to model the prior distribution of a
proportion: The Beta distribution
Prior distributions of varying degrees of
informativeness:
So what?
• In Bayesian analysis, we can use the data from a
study (which yields the likelihood) in combination
with the prior distribution to compute a posterior
distribution:
• Posterior = Prior x Likelihood
• In something like Herpes simplex, we could model
the likelihood with a binominal distribution (how
many people infected, as a proportion of sample
size:
• p(𝜽 | y) = p(𝜽) x p (y | 𝜽)
• Data = y
Example: We are epidemiologists and want to
know the likely value of 𝜽 in a certain location.
• We have a somewhat informative prior from the
literature (say a Beta distribution with α = β = 10).
• We take a local sample of 100 people and see how
many are infected with the virus.
• This yields a posterior distribution of 𝜽:
What does our study yield?

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

Posterior = Prior x Likelihood

How do we get a new estimate of 𝜽
from the credible interval?
The credible interval is the area under the
posterior distribution:
𝜽 = 0.373, at 95% credibility

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

The relative strength of the prior distribution to
determine the location and shape of the
posterior distribution at fixed likelihood:
The impact of larger samples (stronger
likelihood) on the posterior distribution at
a fixed prior
At small sample sizes, the prior matters a lot:
n = 20 n = 20 n = 20

0 0.5 1 0 0.5 1 0 0.5 1

n = 20 n = 20 n = 20

0 0.5 1 0 0.5 1 0 0.5 1

n = 20 n = 20 n = 20

0 0.5 1 0 0.5 1 0 0.5 1

At large sample sizes, the prior doesn’t
matter much:
n = 500 n = 500 n = 500

0 0.5 1 0 0.5 1 0 0.5 1

n = 500 n = 500 n = 500

0 0.5 1 0 0.5 1 0 0.5 1

n = 500 n = 500 n = 500

0 0.5 1 0 0.5 1 0 0.5 1

19-Bayesian 2
No ratings yet
19-Bayesian 2
39 pages
COMMON 3 Perform Industry Calculation FINAL
100% (1)
COMMON 3 Perform Industry Calculation FINAL
53 pages
Bayesian Ibrahim
No ratings yet
Bayesian Ibrahim
370 pages
1 Bayesian Talk
No ratings yet
1 Bayesian Talk
84 pages
Single Parameter Models
No ratings yet
Single Parameter Models
37 pages
25 Intro to Bayesian Inference (1)
No ratings yet
25 Intro to Bayesian Inference (1)
31 pages
Bayesian-inference-slides-2021
No ratings yet
Bayesian-inference-slides-2021
37 pages
Lecture 10
No ratings yet
Lecture 10
33 pages
확통1 LectureNote09 on Bayesian Statistical Inference
No ratings yet
확통1 LectureNote09 on Bayesian Statistical Inference
78 pages
2.2 Bayesian Statistics
No ratings yet
2.2 Bayesian Statistics
12 pages
Lectures 5
No ratings yet
Lectures 5
31 pages
1-MS2 (Intro Bayes)
No ratings yet
1-MS2 (Intro Bayes)
38 pages
CH 5
No ratings yet
CH 5
45 pages
Lecture1 introToBayes
No ratings yet
Lecture1 introToBayes
65 pages
Mstat Note14 Bayesian Inference FSP
No ratings yet
Mstat Note14 Bayesian Inference FSP
30 pages
Analytics of Observational data lec 10
No ratings yet
Analytics of Observational data lec 10
23 pages
24 Intro to Bayesian Inference (1)
No ratings yet
24 Intro to Bayesian Inference (1)
33 pages
bayesian-inference
No ratings yet
bayesian-inference
18 pages
BDM Final
No ratings yet
BDM Final
41 pages
Bayesian Statistics
No ratings yet
Bayesian Statistics
76 pages
Single Parametric Models
No ratings yet
Single Parametric Models
10 pages
Bayes
No ratings yet
Bayes
27 pages
BST413 12jan Page1to11
No ratings yet
BST413 12jan Page1to11
11 pages
Sample Size Determination For Clinical Trials: Paivand Jalalian
No ratings yet
Sample Size Determination For Clinical Trials: Paivand Jalalian
26 pages
Baysian Inferences
No ratings yet
Baysian Inferences
20 pages
Intro-Bayes theory
No ratings yet
Intro-Bayes theory
17 pages
Introduction To Bayesian Methods With An Example
No ratings yet
Introduction To Bayesian Methods With An Example
25 pages
IDS21 Bayes Theorem
No ratings yet
IDS21 Bayes Theorem
22 pages
2021 - Nature - Bayesian Statistics and Modelling
100% (1)
2021 - Nature - Bayesian Statistics and Modelling
26 pages
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
No ratings yet
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
53 pages
Notes4_BayesianLearning
No ratings yet
Notes4_BayesianLearning
8 pages
Introduction To Probabilities, Bayesian and Frequentist Statistics
No ratings yet
Introduction To Probabilities, Bayesian and Frequentist Statistics
23 pages
Advance Statistics
No ratings yet
Advance Statistics
23 pages
Art Appreciation Reviewer FNL
No ratings yet
Art Appreciation Reviewer FNL
22 pages
Bayesian Lecture Notes
No ratings yet
Bayesian Lecture Notes
28 pages
Bayesian Statistics
No ratings yet
Bayesian Statistics
20 pages
Bayes For Beginners: Luca Chech and Jolanda Malamud Supervisor: Thomas Parr 13 February 2019
No ratings yet
Bayes For Beginners: Luca Chech and Jolanda Malamud Supervisor: Thomas Parr 13 February 2019
41 pages
Bayesian-Statistics Final 20140416 3
No ratings yet
Bayesian-Statistics Final 20140416 3
38 pages
Lecture 1
No ratings yet
Lecture 1
17 pages
An Overview of Bayesian Econometrics
No ratings yet
An Overview of Bayesian Econometrics
30 pages
MIT18 650F16 Bayesian Statistics
No ratings yet
MIT18 650F16 Bayesian Statistics
18 pages
Bayesian Inference: The Basics
No ratings yet
Bayesian Inference: The Basics
37 pages
18.650 - Fundamentals of Statistics
No ratings yet
18.650 - Fundamentals of Statistics
20 pages
Lecture Notes For Probability and Statistics
No ratings yet
Lecture Notes For Probability and Statistics
7 pages
Bayesian Statistics 01
100% (1)
Bayesian Statistics 01
22 pages
Applied Elasticity - Chapter 1
No ratings yet
Applied Elasticity - Chapter 1
59 pages
Baysian-Slides 16 Bayes Intro
No ratings yet
Baysian-Slides 16 Bayes Intro
49 pages
ECS32A F23 Unit2 FirstPrograms
No ratings yet
ECS32A F23 Unit2 FirstPrograms
109 pages
Course On Bayesian Methods in Environmental Valuation: Basics (Continued) : Models For Proportions and Means
No ratings yet
Course On Bayesian Methods in Environmental Valuation: Basics (Continued) : Models For Proportions and Means
34 pages
Bayesian Statistics: Thomas Bayes
No ratings yet
Bayesian Statistics: Thomas Bayes
22 pages
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
No ratings yet
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
23 pages
A Beginner's Notes On Bayesian Econometrics (Art)
No ratings yet
A Beginner's Notes On Bayesian Econometrics (Art)
21 pages
Bayesian Analysis - Explanation
No ratings yet
Bayesian Analysis - Explanation
20 pages
Chinese Standard GB 709-2019 Translation English
No ratings yet
Chinese Standard GB 709-2019 Translation English
6 pages
DarkArts Handout
No ratings yet
DarkArts Handout
11 pages
A Tutorial On Support Vector Regression
No ratings yet
A Tutorial On Support Vector Regression
77 pages
Effect of Tapered Thickness On The Logitudinal Free Vibrations of Cantilever Beam
No ratings yet
Effect of Tapered Thickness On The Logitudinal Free Vibrations of Cantilever Beam
14 pages
Bayesian Modelling Tuts-4-9
No ratings yet
Bayesian Modelling Tuts-4-9
6 pages
Week 4 - Binary Search Trees
No ratings yet
Week 4 - Binary Search Trees
65 pages
KS3 Maths 03
No ratings yet
KS3 Maths 03
42 pages
Grade 8 5 Most and Least Learned Competencies Per Grading
No ratings yet
Grade 8 5 Most and Least Learned Competencies Per Grading
3 pages
Week 1 - Analysis
No ratings yet
Week 1 - Analysis
49 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Chapter 7
No ratings yet
Chapter 7
48 pages
Bayesian Inference
No ratings yet
Bayesian Inference
5 pages
Week 3 - Trees
No ratings yet
Week 3 - Trees
42 pages
IDS2a Data, Science, Data Science
No ratings yet
IDS2a Data, Science, Data Science
40 pages
Bayesian Inference: A Practical Primer: Outline
No ratings yet
Bayesian Inference: A Practical Primer: Outline
28 pages
Ho Dac Et Al 2013 The Effects of Positive and Negative Online Customer Reviews Do Brand Strength and Category Maturity
No ratings yet
Ho Dac Et Al 2013 The Effects of Positive and Negative Online Customer Reviews Do Brand Strength and Category Maturity
17 pages
Notes 2 BayesianStatistics
No ratings yet
Notes 2 BayesianStatistics
6 pages
MAT136 W2021 LEC9110 Week 3
No ratings yet
MAT136 W2021 LEC9110 Week 3
29 pages
IDS2b Data, Science, Data Science
No ratings yet
IDS2b Data, Science, Data Science
33 pages
Week 5 - Priority Queues
No ratings yet
Week 5 - Priority Queues
32 pages
IDS26 Clustering and Classification
No ratings yet
IDS26 Clustering and Classification
30 pages
2d and 3d Shapes and Memo 1
No ratings yet
2d and 3d Shapes and Memo 1
4 pages
STATS 225: Bayesian Analysis Lecture 1: Introduction: Babak Shahbaba
No ratings yet
STATS 225: Bayesian Analysis Lecture 1: Introduction: Babak Shahbaba
49 pages
Revision - Bayesian Inference
No ratings yet
Revision - Bayesian Inference
4 pages
Statistics Industry: Multiple Regression
No ratings yet
Statistics Industry: Multiple Regression
37 pages
Samozino2016 Hal
No ratings yet
Samozino2016 Hal
12 pages
Testing of Nonlinear Diamond-Turned Reflaxicons
No ratings yet
Testing of Nonlinear Diamond-Turned Reflaxicons
5 pages
How To Interpret Multiple Regression Output in Spss
50% (2)
How To Interpret Multiple Regression Output in Spss
3 pages
MEE 233 - Centroids
No ratings yet
MEE 233 - Centroids
13 pages
Group 6 Comprehension
No ratings yet
Group 6 Comprehension
10 pages
FILE
No ratings yet
FILE
19 pages
Mean Percentage Scores (MPS) Per Grade Level 1St Quarter: Punturin 1 Elementary School
No ratings yet
Mean Percentage Scores (MPS) Per Grade Level 1St Quarter: Punturin 1 Elementary School
3 pages
03 Bay Est He or em
No ratings yet
03 Bay Est He or em
13 pages
Anpr System: Sietk 1
No ratings yet
Anpr System: Sietk 1
20 pages
LPP Model Formation, Module-1
No ratings yet
LPP Model Formation, Module-1
4 pages
Data Science Syllabus
No ratings yet
Data Science Syllabus
5 pages
3 Day-Fundamental of Mathematics
No ratings yet
3 Day-Fundamental of Mathematics
3 pages
GPS Accuracy Decimal Points - Rev1
No ratings yet
GPS Accuracy Decimal Points - Rev1
3 pages
Basic Numeracy Simplification
No ratings yet
Basic Numeracy Simplification
6 pages
Part-A Assignment No. 3
No ratings yet
Part-A Assignment No. 3
2 pages
Traffic Number
No ratings yet
Traffic Number
2 pages
Applied Optimization For Wireless, Machine Learning, Big Data
No ratings yet
Applied Optimization For Wireless, Machine Learning, Big Data
1 page
Graph Description Practice
No ratings yet
Graph Description Practice
1 page
Databsecse 301
No ratings yet
Databsecse 301
6 pages
Foundations of Elementary Analysis
From Everand
Foundations of Elementary Analysis
Roshan Trivedi
No ratings yet
BAYES Theorem
From Everand
BAYES Theorem
Jeffery Short
2/5 (5)

IDS22Bayes Applications

Uploaded by

IDS22Bayes Applications

Uploaded by

Introduction to Data Science

Applications of Bayes theorem

• Bayesian conception: Degree of belief, plausibility.

p( z > 1.65 | normal distribution with mean = 0, SD = 1) = 0.05

p( z > 1.96 | normal distribution with mean = 0, SD = 1) = 0.025

L(normal distribution with mean = 0, SD = 1.3 | x = 2) = 0.094

L(normal distribution with mean = 1, SD = 1.3 | x = 2) = 0.2283

Posterior Bayes Prior

NONSIG β 1-α β + 1-α

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

Posterior = Prior x Likelihood

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0 0.5 1 0 0.5 1 0 0.5 1

0 0.5 1 0 0.5 1 0 0.5 1

0 0.5 1 0 0.5 1 0 0.5 1

0 0.5 1 0 0.5 1 0 0.5 1

n = 500 n = 500 n = 500

0 0.5 1 0 0.5 1 0 0.5 1

n = 500 n = 500 n = 500

0 0.5 1 0 0.5 1 0 0.5 1

You might also like