B Statistics
B Statistics
Chemistry
Reading –lots to revise and learn
Chapter 3
Chapter 4
Chapter 5-1 and 5-2
Chapter 5-3 will be necessary background
for the AA lab
Chapter 5-4 we will use later
Data Analysis
Most data quantitative - derived from
measurements
Never really know error
With more measurements you get a better
idea what it might be
Don’t spend a lot of time on an answer
-where only 20% accuracy is required
-or where sampling error is big - although
you don’t want to make the error worse
Significant Figure Convention
1200 ????
12 x 102 - 2 significant figures
Rounding Off
Round a 5 to nearest even number
4.55 to 4.6
12.2
00.365
01.04
13.605 goes to 13.6
When errors are known
Rr =(A a) + (B b) + (C c)
where r2 = a2 + b2 + c2
Difficult to know
15
10
5 y = 1.9311x + 1.1127
R 2 = 0.9888
0
0 1 2 3 4 5 6 7
[Ca] (mg/L)
Correlation coefficients
[( xi x)( yi y )] 2
R 2
( xi x) ( yi y )
2 2
LINEST
Calculates slope and intercept
Calculates the uncertainty in the slope and
the intercept
Calculates R2
Calculates s.d. of the population of y values
See page pp 68-72, Harris.
Use these values to determine the number of sig figs
for the slope and intercept
Dealing with Random Errors
Indeterminate Error
Repeating a coarse measurement gives the
same result
eg weighing 50 g object to nearest g - only
error would be determinate - such as there
being a fault in the balance
If same object was weighed to several
decimal places -get random errors
How many eggs in a dozen?
How wide is your desk?
( xi u ) 2
e 2 2
y
2
xi u
Let z
Then
2
z
e 2
y
2
If z is abscissa (x axis)
Same curve is always obtained
as
z expresses the deviation from the mean in
units of standard deviation
Statistics
(x i x) 2
s i 1
N 1
For use N in denominator
For s use N-1 in denominator (we have one
less degree of freedom - don’t know )
At end, round s to 2 sig figs or less if there
are not enough sig figs in data
Confidence Interval
We are doing an analysis to find the true
mean - it is unknown
What we measure is x but it may not be
the same as
Set a confidence limit eg 4.5 ± 0.3 g
The mean of the measurements was 4.5 g
The true mean is in the interval 4.2-4.8
with some specified degree of confidence
Confidence limit
A measure of the reliability (Re)
The reliability of a mean (x ) increases as
more measurements are taken
Re = k(n)1/2
Reliability increases with square root of
number of measurements
Quickly reach a condition of limiting return
Reliability
Would you want a car that is 95% reliable?
( xi u )
t
s
t depends on desired confidence limit
degrees of freedom (N-1)
Degrees of Values of t for Various degrees of
Freedom Probability
80% 90% 95% 99.9%
ts
x
N
Example
Cal Culator obtained the following results for
replicate determinations of calcium in limestone
14.35%, 14.41%, 14.40%, 14.32%, 14.37%
each is xi
Calculate the confidence interval
Answer
Average = 14.37 %
S = 0.037%
Choose a 95 % confidence limit
Degrees of freedom = N-1 = 5-1 =4
From t-table, t = 2.78
14.37% ± ts/N½
14.37 % ± 2.78 x 0.037% / 5 ½
14.37 ± 0.05 %
Significant figures
I say: Use two or less significant figures in a
confidence limit. Then use the same number
of decimal places in both (guided by the CL)
When less than two sig figs in the CL?
When using two would require you to have
more decimal places than were in the actual
data.
The bunny gave up
Pooled standard deviation
(x x ) (x x
i 1
2
i 2 ) 2
sp i 1 i 1
n1 n2 2
Comparison of Means
We analyze several samples and want
to know if they are the same or different
For each sample we take several
measurements and obtain a mean
2 2
s s
If x1 x 2 t 1
2
n1 n2
there is no significant difference
Comparing two means
ts1 ts2
Compare x1 x2 to
n1 n2
If s is a pooled sd
x1 x 2 n1n2
tcalc
s n1 n2
If tcalc ttable then the difference is
not significan t at the chosen CL
Comparing two means
s1 2 s2 2
x t. ( ) ( )
n1 n2
If s is the pooled s
x1 x 2 n1n2
t
s n1 n2
If tcalc ttable then the difference is
not significan t at the chosen CL
Example
Two barrels of wine were analyzed for their
alcohol content to determine whether or not
they were from different sources:
12.61% (6 analyses),
12.53% (4 analyses)
Pooled standard deviation = 0.07 %
12.61% 12.53% 6 * 4
t 1.77
0.07% 64
4 y = 1.9311x + 1.1127
2
R = 0.9888
0
0 1 2 3 4
Ideally, the average blank = b (the
intercept)
However, if b > average blank, then
recalculate LUR using LUR = b + k
s.d.blank
Usually say LUR = b + 3 sd
LOD = 5.2 mg/L (k = 3)
Note the 2 significant figures
Quality Assurance
Begins with sampling
Calibration Check
Run standards every few samples.
Reference standards are of known
concentration. Do you get the right answer?
Include in Table of Results.
SOP’s are very important
SOP (Standard operating
procedure)
Set of written instructions that document a
routine or repetitive activity which is followed
by employees in an organization.
The development and use of SOPs is an
integral part of a successful quality system.
Provides information to perform a job
properly and consistently in order to achieve
pre-determined specifications and quality.
http://people.stfx.ca/tsmithpa/Chem361/
Numerical Criteria for Selecting
Analytical Methods
Precision
Bias
Sensitivity
Detection Limit
Concentration Range
Selectivity
Other characteristics to be
considered
Speed
Ease and convenience
Skill required of operator
Cost and availibility of equipment
Per-sample-cost
Criterion Figure of Merit
Precision Absolute sd, relative sd, coefficient of
variation, variance
Bias Absolute systematic error, relative
systematic error
Sensitivity Calibration sensitivity, analytical sensitivity