Statistics Notes
Statistics Notes
taught first.
Inferential statistics - You can’t learn anything about a
- drawing of conclusions/inferences about population of a sample until the
scientific system analyst learns uncertainty in that
- Scientific judgments based on uncertainty and sample
variation
- Quality of process Sampling Procedures
- No statistics = no variability ❖ Simple random sampling- specified
Samples/observations sample size has the same chance of
- gathered from population (scientific system) being selected as any other sample of
Experimental design the same size (avoid bias)
- factors can be selected ➢ Sample size -number of
Observational study elements in the sample
- factor levels isn’t preselected
Descriptive statistics ➢ aids in the elimination of the
Statistical inference problem of having the sample
Means reflect a different (possibly more
Medians confined) population than the
Standard deviation one about which inferences
Single number statistics need to be made.
- Stem-and-leaf plots, dot plots, box plots ❖ Stratified random sampling - random
Probability selection of a sample within each
- transition between descriptive statistics and stratum
inferential methods ➢ Strata- nonoverlapping groups
P-value ➢ In order not to disregard or
- “bottom line” in data interpretation overrepresent any group
❖ Experimental design
Relationship between probability and statistical ➢ Treatments or treatment
inference combinations
- For a statistical problem the sample ➢ Variability
along with inferential statistics allows us ➢ Experimental unit
to draw conclusions about the ➢ Completely randomized design
population, with inferential statistics
making clear use of elements of Why assign experimental units randomly?
probability. This reasoning is inductive in - Variability (avoid bias) - “wash away”
nature. - Descriptive statistics
- problems in probability allow us to draw
conclusions about the characteristics of Measures of Location/Central Tendency
hypothetical data taken from the
population based on known features of Sample mean - average
the population. This type of reasoning is Sample median - reflect central tendency that is
deductive in nature. uninfluenced by outliers
- The only certainty concerning the x (n+1)÷2 n is odd
pedagogy of the two disciplines lies in 1
n is even
2( x n/2 + x n/2+1 )
the fact that if statistics is to be taught:
at more than merely a "cookbook" level,
Trimmed mean Relative frequency histogram
- removing outliers when averaging
Statistical quality control Quartiles - tails of distribution
Sample standard deviation - Third quartile - separates upper quarter
- measure of variability from the rest
(n-1) - Degrees of freedom associated with the - Second quartile - median
variance estimate; depicts the number of - First quartile - lower quartile from the
independent pieces of information available for rest
computing variability
* Large variability in data set = large variance Parallelism - same
Independent squared deviations Observational study - if factors are not controlled
Average squared deviation Retrospective study - historical data
Variance Disadvantages:
- measure of the average squared (i) Validity and reliability of historical data are
deviation from mean often in doubt.
- measures how far a set of (random)
numbers are spread out from their (ii) If time is an important aspect of the structure
average value. of the data, there may be data missing.
Population parameters (characteristic of
population) (iii) There may be errors in collection of the data
- Population mean that are not known.
- Population variance
(iv) Again, as is the case of observational data,
Discrete - countable as a whole there is no control on the ranges of the
Continuous - measured measured variables (the factors in a study).
Count data Indeed, the ranges found in historical data may
Sample proportion - mean of the ones and not be relevant for current studies.
zeroes
Statistical modelling examples ---------------------
● Postulated model Relative frequency = f/n
● Regression model Stem-and-leaf = * separates 0-4 and 5-9
● Estimation theory
√
N
∑ (x−x) 2
Note: Sample STDev s = i=1
- just a subset
N −1
(1) The type of model used to describe the data
of the population
often depends on the goal of the experiment;
and
Why is it n-1?
(2) the structure of the model should take
- Make the distribution smaller
advantage of nonstatistical scientific input.
- Actual Standard deviation
- When we get a sample, we are tryna get
Fundamental assumption -selection of model
a conclusion about a population
Exploratory data analysis (plots)
- Population std might be wrong because
Violation of assumptions
there are data outside the range which
might affect the std. The data away from
Probability distribution
peak the makes the variance larger
- bell-shaped(symmetric or skewed)
- Variance = measures how far a
Stem-and-leaf plot
set of (random) numbers are
- can be either double or single
spread out from their average
value
√
N
∑ (x−x) 2
- Subtract 1 to make it smaller which gets Population STDev σ = i=1
N
a slightly bigger value now reflects the
real std
- Removes the bias
- everything
HOW TO STEM-AND-LEAF
(sample problem)
2.3, 2.5, 2.5, 2.7, 2.8 3.2, 3.6, 3.6, 4.5, 5.0 WITH *
(plain = 0 to 4 leaf digit; * = 5 to 9 leaf digit)
STEM LEAF
2 35578
3 266
4 5
5 0
Chapter 2 Definition 5
Experiment - generates set of data ➢ Two events A and B are mutually
Definition 1
Sample space(S) - set of all possible outcomes
exclusive, or disjoint, if A ∩ B = ϕ , that
of a statistical experiment is, if A and B have no elements in
- Each outcome is called element/sample common
point
- S = { A, B, C } - Finite Definition 6
- statement/rule - infinite/large sample ➢ The union o nd B,
f the two events A a
points denoted by the symbol AU B, is the
Definition 2 event containing all the elements that
Event - subset of sample space belong to A o r B or both
Definition 3 Definition 7
➢ The complement of an event A with Permutation is an arrangement of all or part of a
respect to S is the subset of all elements set of objects.
of S that are not in A. We denote the EXAMPLE: three letters a, b, and c. The
complement, of A b y the symbol A'. possible permutations are abc, acb, bac, bca,
cab, and cba.
Definition 4
➢ The intersection of two events A and B, Definition 8 - n factorial
denoted by the symbol A D B, is the Definition 9
event containing all elements that are - The probability of an event A is the sum
common to A and B. of the weights of all sample joints in A.
Therefore,
0 < P(A) < 1 (S)=1
P (ϕ) = 0, and P
3,... i s a sequence of
Furthermore if A1, A2. A Rule 1
mutually exclusive events, then - If events A and B come from the same
sample space, the probability that both
P(A1U A2 U A3 U - ••) = P( A
1)+ P(A2) + P(A3) + … A and B occur is equal to the probability
the event A occurs times the probability
Definition 10 that B occurs, given that A has
Conditional probability - “the probability that B occurred.
occurs given that A occurs” or “the probability of Rule 2
B, given A” - If an operation can be performed in n1
ways, and if for each of these a second
operation can be performed in n2 ways,
and for each of the first two a third
operation can be performed in n3 ways,
Definition 10 and so forth, then the sequence of k
operations can be performed in n1n2- •
ays.
-nk w
Rule 3
Definition 11 - If an experiment can result in any one of
N different equally likely outcomes, and
if exactly n of these outcomes
correspond to event A, then the
probability of event A is
Definition 12 P (A) = Nn
Theorem 1
- The number of permutations of n objects is n!.
Theorem 2
n!
nP r = (n−r)! where n objects taken r at a time
n! 6! n! 40!
nP r = (n−r)! = 2! = 360 nP r = (n−r)! = 37! = 59, 280
Theorem 3 (Circular Permutations)
bjects arranged in a circle is (n — 1)!.
- The number of permutations of n o
Theorem 4
- The number of distinct permutations of n things of which n1 are one kind, n2 of a second kind, …,
nk of a kth kind is
n!
n 1 !n 2 !... n k !
Theorem 5
- The number of ways of partitioning a set of n objects into r cells with n1 elements in the first
cell, n2 elements in the second, and so forth is
n!
= n 1 !n 2 !... n k !
Theorem 6
- The number of combinations of n distinct objects taken r at a time is
Theorem 11
Theorem 12
Theory 13
Example:
Sample Problem:
1.
● 1% of women have breast cancer (and therefore 99% do not).
● 80% of mammograms detect breast cancer when it is there (and therefore 20% miss it).
● 9.6% of mammograms detect breast cancer when it’s not there (and therefore 90.4% correctly
return a negative result).
Question: What are the chances you have cancer? (true positive)
(cross multiply)
desired event
P = true positive + f alse positive
(all possibilities = all positives)
true positive
P = true positive + f alse positive
0.008
P = 0.008 + 0.09504
P = 0.0776 = 7.8%
A firm is accustomed to training operators who do certain tasks on a production line. Those operators who
attend the training course are known to be able to meet their production quotas 90% of the time. New
operators who do not take the training course only meet their quotas 65% of the time. Fifty percent of new
operators attend the course. Given that a new operator meets her production quota, what is the
probability that she attended the program?
Corollary 1
Corollary 2
Corollary 3
Chapter 3
PROBABILITY DENSITY FUNCTION
=
=1
● For PMF, you can make use of binomial distribution instead of listing following formula
n is sample space,
x is number of success or binomial random variable,
p is probability of success, and
(1-p) is probability of failure or complement of p
Example:
(WITH REPLACEMENT)
(WITH REPLACEMENT)
(WITHOUT REPLACEMENT)
An urn contains five green balls, two blue balls, (0 red balls ; 3 non-red balls)
and three red balls. You remove three balls at
random without replacement. Let X denote the
number of red balls. Find the probability mass = 7/24
function describing the distribution of X.
(1 red ball; 2 non-red balls)
(WITHOUT REPLACEMENT)
= 21/40
= 1/120
QUESTIONS ANSWERS
Chapter 4 (Expectations?)
σ 2 = E(X 2 ) − μ 2
μ = E (X) = ∑ xf (x ) if x is discrete
x
∞
μ = E (X) = ∫ xf (x )dx if x is continuous
−∞
Chapter 5
Binomial Distribution
-number of successes in Bernoulli experiment is binomial random variable
FORMULA:
SAMPLE PROBLEMS:
Hypergeometric Distribution
-used in situations where no replacement is done upon trial
FORMULA:
Where N is population, n is sample size, k is successes, N-k is failures.
TL;DR add combinations on top per category all over total combinations
SAMPLE PROBLEMS:
Geometric Distribution
Basically only one success out of many attempts (stops after success)
Formula: p × q x−1 where p is success probability and q is complement.
SAMPLE PROBLEMS
Questions Answer
Negative Binomial Distribution
- The experiment consists of x repeated trials. (e.g. 2 successes in 7 trials)
- Each trial can result in just two possible outcomes (success/failure)
- The probability of success, denoted by P, is the same on every trial.
- The trials are independent
- The experiment continues until r successes are observed, where r is specified in advance
Poisson Distribution
-used when population is not known but we have an observation or data such as deaths/yr
-used when no one knows the probability of a success of a single entity
FORMULA:
Where λ is the poisson variable
t is time
x is number of success
SAMPLE PROBLEMS:
b) mean is lambda x t
NORMAL DISTRIBUTION
Basically just the z table lmao
x−μ
Z= σ
The probability is the value on the z-table of the z score.
also
Also
inf inity 2
− x2
∫ 1
√2x
e
−inf inity
Concepts discussed:
1.18 Stem and leaf
Relative Frequency
Histogram
Mean, median, s
2.41. Permutation
2.105. Bayes Theorem
2.126 Bayes Theorem
5.4 Binomial
5.5 Binomial
5.16 Binomial Distribution
5.20 Multinomial
5.27 Binomial
5.33 Hypergeometric distribution
5.47 Hypergeometric
5.50 Negative binomial distribution
5.56 Poisson distribution
5.65 Poisson
5.67 Poisson
5.79 Hypergeometric
5.81 Multinomial
5.85 Binomial
5.92 Negative binomial
5.97 Binomial