Bayes Theorem
Bayes Theorem
THEOREM
Submitted To: Dr. Rachana P G
Submitted By:
Shashti D B 4GM20CS095
Sushma B C 4GM20CS110
Vanamali 4GM20CS121
Sumanth 4GM21CS408
Contents
B AY E S T H E O R E M
INTRODUCTION B AY E S T H E O R E M AND CONCEPT
LEARNING
• B AY E S I A N
• PROBABILISTIC
• C O M B I N AT I O N
• S TA N D A R D
P R A C T I A L D I F F I C U LT Y
• PROBABILITIES
• C O S T E S T I M AT I O N
B AY E S T H E O R E M
Bayes theorem provides a way to calculate the probability of a hypothesis based on its prior probability, the
probabilities of observing various data given the hypothesis, and the observed data itself.
What is the relationship between Bayes theorem and the problem of concept learning?
Since Bayes theorem provides a principled way to calculate the posterior probability of each
hypothesis given the training data, and can use it as the basis for a straightforward learning algorithm
that calculates the probability for each possible hypothesis, then outputs the most probable.
B R U T E - F O R C E B AY E S C O N C E P T L E A R N I N G
We can design a straightforward concept learning algorithm to output the maximum a posteriori hypothesis,
based on Bayes theorem, as follows:
W H AT C H O I C E S H A L L W E M A K E F O R P ( D | H ) ?
To summarize, Bayes theorem implies that the posterior probability P(h|D) under our assumed P(h)
and P(D|h) is
Example:
• FIND-S outputs a consistent hypothesis, it will output a MAP hypothesis under the probability distributions P(h) and
P(D|h) defined above.
• Are there other probability distributions for P(h) and P(D|h) under which FIND-S outputs MAP hypotheses? Yes.
• Because FIND-S outputs a maximally specific hypothesis from the version space, its output hypothesis will be a MAP
hypothesis relative to any prior probability distribution that favours more specific hypotheses.
• Bayesian framework is a way to characterize the behaviour of learning algorithms
• By identifying probability distributions P(h) and P(D|h) under which the output is a optimal hypothesis, implicit
assumptions of the algorithm can be characterized (Inductive Bias)
• Inductive inference is modelled by an equivalent probabilistic reasoning system based on Bayes theorem
THANK YOU