0% found this document useful (0 votes)
4 views

Lecture 7

Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views

Lecture 7

Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Lecture 7:

Topic: Review of Statistics - Regression, Probability, PMF, PDF

As you have already covered these topics in details in the last semesters, today we will do
the revision. Like, Can you compare Correlation and Regression?

Source: https://keydifferences.com/difference-between-correlation-and-regression.html
Probability: The extent to which an event is likely to occur, measured by the ratio of the
favourable cases to the whole number of cases possible.

Probability Distribution: A probability distribution is a way to represent the possible


values and the respective probabilities of a random variable. There are two types of
probability distributions: discrete and continuous probability distribution.

Probability Distribution Table:

- Tossing a coin once

Outcome Probability

H 1/2

T 1/2

- Tossing a coin three times

Outcome HHH HHT HTH HTT TTT THH THT TTH

Probabilit 1/8 1/8 1/8 1/8 1/8 1/8 1/8 1/8


y

- Probability distribution of number of heads when a coin is flipped 3 times.


Sample Space = {0,1,2,3}

No of Heads 0 1 2 3

Probability 1/8 3/8 3/8 1/8


Dist

Probability Mass Function:


A probability mass function (PMF)— also called a frequency function— gives you
probabilities for discrete random variables. “Random variables” are variables from
experiments like dice rolls, choosing a number out of a hat, or getting a high score on a
test. The “discrete” part means that there’s a set number of outcomes. For example, you
can only roll a 1,2,3,4,5, or 6 on a die.
The probability mass function, f(x) = P(X = x), of a discrete random variable X has the
following properties:

- All probabilities are positive: fx(x) ≥ 0.


- Any event in the distribution (e.g. “scoring between 20 and 30”) has a probability of
happening of between 0 and 1 (e.g. 0% and 100%).
- The sum of all probabilities is 100% (i.e. 1 as a decimal): Σfx(x) = 1.

Probability Density Function:


A function of a continuous random variable, whose integral across an interval gives the
probability that the value of the variable lies within the same interval.

Points to remember

● A probability distribution is a way to represent the possible values and the


respective probabilities of a random variable. There are two types of
probability distributions:
○ Discrete probability distribution for discrete random variables.
○ Continuous probability distribution for continuous random
variables.
● We can directly calculate probabilities of a discrete random variable, X = x,
as the proportion of times the x value occurs in the random process.
● Probabilities of a continuous random variable taking on a specific value (e.g.
Y = y) are not directly measurable. Instead, we calculate the probability
as the proportion of times y∈[a,b]
● Probability mass functions (pmf) are used to describe discrete probability
distributions. While probability density functions (pdf) are used to describe
continuous probability distributions.
● By assuming a random variable follows an established probability
distribution, we can use its derived pmf/pdf and established principles to
answer questions we have about the data.
Source: http://tinyheero.github.io/2016/03/17/prob-distr.html

Task:
1. For forecasting which statistical technique can be used - Correlation or
Regression? Kindly explain in detail.
2. Why we need to learn PMF and PDF.

You might also like