CPT Statistical Manual
CPT Statistical Manual
Statistical Manual
Contents
1. INTRODUCTION ............................................................................................. 3
2. SUFFICIENT HOMOGENEITY TESTING ............................................................... 3
2.1. Sample Selection and Measurement ................................................................................................... 3
2.2. Statistical Analysis of Homogeneity Data ........................................................................................... 3
2.2.1. Visual Appraisal for Data Pathologies ............................................................................................. 4
2.2.2. Cochran’s Test ................................................................................................................................. 5
2.2.3. Estimate of Analytical and Sampling Variances .............................................................................. 5
2.2.4. Test for Sufficient Analytical Precision (san < 0.5 σ ) ....................................................................... 6
2.2.5. Test for Acceptable Between Sample Variance .............................................................................. 6
2.3. Uncertainty Due to Inhomogeneity ...................................................................................................... 7
2.4. Alternative Homogeneity Testing Procedure used in NMI CPT ........................................................7
7. REFERENCES.............................................................................................. 14
8 REVISION HISTORY ....................................................................................... 15
1. Introduction
The Chemical Proficiency Testing (CPT) Statistical Manual outlines the statistical methods used by
CPT. These methods are based on the procedures described in ISO 13528:2005 (E) “Statistical
1
methods for use in proficiency testing by interlaboratory comparisons” and “The International
2
Harmonized Protocol for the Proficiency Testing of Analytical Chemistry Laboratories” .
The role of the CPT Statistical Manual is to set out the procedures used in assessing the homogeneity
of the test materials sent to the participants’, the method of establishing the assigned value and the
target standard deviation of a PT study as well as the tools used to assess and compare individual
laboratory performance.
Select a minimum of 7 (but preferably 10) of the packaged units strictly at random from the entire
batch, or by stratified random sampling throughout the fill sequence if fill trend effects are suspected.
This must be done in a formal way, by assigning a sequential number to the units (either by label or by
their position in a linear sequence). The selection is made by use of a random number table or
computer random number generation software. It is not acceptable to select the units in any other
way (eg by “shuffling” or “selection at random”).
Homogenise each selected test unit within its container, then take two appropriately sized test portions
from each. Label the test portions as “1a”, “1b”, “2a”, “2b” etc. Test portions must be sufficiently
large, particularly for solid samples, so as not to compromise the precision of the test results.
Sort the entire set of test portions into a random order, again using a random number table or
computer random number generation software.
Analyse each test portion for each analyte of interest, maintaining this random order throughout. The
testing should be performed under repeatability conditions (in as short a time as is practical, by a
single analyst, preferably in a single sample batch). The analytical method selected must be
sufficiently precise to allow a satisfactory estimation of between-sample variance and therefore should
have a repeatability standard deviation (san) of less than half of the target standard deviation ( σ ) set
for the study.
Include appropriate quality control samples (blanks, recoveries, control samples) with each batch of
test samples.
The data in the Table 1 are taken from AQA 06-02, Sample S1 Endosulfan Sulfate
Table 1 Duplicated results for ten distribution units and intermediate stages of calculation in
Cochran’s test
A B 2 2
Sample D = A-B S = A+B D =(A-B)
(mg/kg) (mg/kg)
6 1.041 1.014 0.027 2.055 0.00070
87 1.034 0.995 0.039 2.029 0.00151
97 1.120 1.033 0.087 2.153 0.00756
159 1.076 1.086 -0.010 2.161 0.00010
174 1.078 1.061 0.017 2.139 0.00028
211 1.023 0.980 0.042 2.003 0.00178
212 1.058 1.072 -0.013 2.130 0.00018
228 1.001 0.998 0.002 1.999 0.00001
232 1.012 1.028 -0.015 2.040 0.00023
246 0.987 0.969 0.019 1.956 0.00035
1.20
1.00
0.80
0.60
0.40
0.20
0.00
6 87 97 159 174 211 212 228 232 246
Vial No
0.00756
=
0.0127
= 0.595
where C = Cochran’s statistic test
Table 2 Critical values for the Cochran test statistic for duplicates
1
m 95%
7 0.727
8 0.680
9 0.638
10 0.602
11 0.570
12 0.541
13 0.515
14 0.492
15 0.471
16 0.452
17 0.434
18 0.418
19 0.403
20 0.389
1
m is the number of samples that have been measured in duplicate.
ANOVA
Source of P-
Variation SS df MS F value F crit
Between
0.0244 9 0.00271 4.27 0.0166 3.020
Groups
Within Groups 0.00635 10 0.000635
2
san = MS within
So
= 0.0006351
2
where san = the analytical variance
and
MSbetween − MS within
2
ssam =
2
0.00271 − 0.000635
=
2
= 0.00104
2
where ssam = the between-sample variance
σ = χ ∗ CV
= 1.03 ∗ 0.15
= 0.155 mg/kg
The analytical standard deviation (san) is the square root of the analytical variance estimated from
ANOVA above.
0.0252
σan / σ =
0.155
= 0.163
This is less than the critical value of 0.5. The method is precise enough to detect significant in-
homogeneity.
σ all
2
= (0.3 * σ ) 2
= (0.3 * 0.155)
2
= 0.00216
where σ = target standard deviation
c = F1s all
2
+ F2 san
2
Note: even though statistically significant differences between the test samples have been detected
using one-way Anova (P value < 0.02), the inhomogeneity is small enough to be of no practical
consequence when compared to the expected between laboratory variability.
If F > 1, sampling variance has been observed, so this can be used to estimate the uncertainty due to
inhomogeneity.
If F < 1, then the sampling variance is smaller than the analytical variance. This means that any
inhomogeneity is so small that the homogeneity testing does not have the power to detect it. The
observed variation is almost all due to analytical variance. However this is not proof that the samples
are perfectly homogeneous. Inhomogeneity is somewhere between zero, and the analytical variance
(estimated as the standard deviation of all results, stotal), and it is likely to be closer to 0 than to stotal.
This approximates a triangular distribution, hence the choice of root 6 as the divisor.
CPT will calculate an assigned value by this method only if there is a minimum of six results to ensure
a reasonable estimate.
The assigned value for the test material used in a proficiency study is the robust average of the results
reported by all the participants in the round. This is a modern approach to the outlier problems in a
proficiency study in which the influence of the outliers and heavy tails is down-weighted and is
calculated using the procedure described in “ISO13258:2015(E), Statistical methods for use in
1
proficiency testing by interlaboratory comparisons – Annex C” .
When the assigned value is derived from robust average the uncertainty is estimated as:
where:
The expanded uncertainty (Urob mean)` is the standard uncertainty multiplied by a coverage factor k = 2
at approximately 95% confidence level.
Concentration
Lab Code
Sample S3
2 71.2
3 57.0
4 55.4
5 58.1
6 55.4
7 58.4
8 60.67
9 55.65
10 57.2
11 55.4
12 59.6
13 45.9
14 57.3
15 56.0
16 55.3
17 61
18 56.5
19 57.7
20 100
21 58.4
22 54.3
NMI uses primary methods such as Isotope Dilution Mass Spectrometry for which the result is
traceable directly to SI and is of the smallest achievable uncertainty. When reference value is used as
the assigned value, performance scores are calculated for any number of participants.
3.4. Formulation
Formulation is the addition of a known amount or concentration of analyte to a base material which is
either free of the analyte or its concentration accurately known. The assigned value is then determined
from the proportions of the materials used and the known concentrations added.
This method is advantageous if pure substances are available to spike the test samples, as the added
amount can be measured extremely accurate by gravimetric or volumetric methods. Consequently,
there is usually no difficulty in establishing the traceability of the assigned value.
The uncertainty is estimated from the uncertainties in analyte concentrations of the materials used and
gravimetric and volumetric uncertainties, through moisture content or any other changes during mixing
if significant. For more details to estimate standard uncertainty follow the approach described in the
5
“Guide to the expression of uncertainty in measurement” .
The between laboratory coefficient of variation is a measure of the between laboratory variation that in
the judgement of the study coordinator would be expected from participants given the analyte
concentration. It is important to note that this is not the coefficient of variation of participants results.
4.1. By Perception
The target standard deviation could be fixed arbitrarily by the study coordinator based on a perception
of how laboratory should perform. The perception is based on practical experience and published
4, 5, 6
models and varies depending on the concentration in the matrix. The values of target standard
deviation for various projects are presented in the CPT Study Protocol.
where c = concentration, (eg. the assigned value Χ expressed as a dimensionless mass ratio 1ppm ≡
10 or % ≡ 10 )
-6 -2
Laboratory performance is assessed by comparing reported test results to the assigned value using
both z-scores and En-scores.
(χ − X )
z=
σ
where:
z = z-score
χ = individual laboratory result
Χ = assigned value
σ = target standard deviation.
z-scores are interpreted as follows:
• ІzІ ≤ 2 satisfactory.
• 2 < ІzІ < 3 questionable
• ІzІ of ≥ 3 unsatisfactory
The En-score is an objective measure of whether or not an individual result is consistent with the
assigned value. Unlike z-scores, En-scores do not require the setting of a target standard deviation.
(χ − X )
En =
U χ2 + U X2
where:
En = En-score
χ = individual laboratory result
Uχ = expanded uncertainty of the individual laboratory result
Χ = assigned value
UΧ = expanded uncertainty of the assigned value
• ІEnІ ≤ 1 satisfactory.
• ІEnІ > 1 questionable.
Uncertainties
reported by
participants.
4
17
2 7
19
1 21
5
10
12 14
S2
0 18
3
-1 6
1116
15
-2
4
-3
-4 22
-4 -3 -2 -1 0 1 2 3 4
S1
The plot has two squares, the inner square corresponding to a z-score of |2|, the outer square
corresponding to a z-score of |3|. Laboratories falling within the centre square have z-scores with |z| <
2 for both samples. Laboratories falling between the inner and outer squares have z-sores with |z|
between 2 and 3 for at least one sample. Laboratories falling outside the outer square have at least
one z-score with |z| > 3.
Within laboratory and between laboratory variability is indicated in the same fashion as for a
conventional Youden Plot. For laboratories plotted in the upper right and lower left quadrants,
between laboratory variability predominates. For laboratories plotted in the upper left and lower right
quadrants, within laboratory variation predominates.
between the upper and lower quartiles. The median can also be indicated by dividing the box into two.
The "whiskers" are straight line extending from the ends of the box to the maximum and minimum
values. Example is presented in Figure 4.
The Kernel density plot is used to identify modes in the distribution of participants’ results. It is also
used to identify outlying results.
7. References
1. ISO13528:2015 (E), Statistical methods for use in proficiency testing by interlaboratory
comparisons, ISO, Geneva, Switzerland.
2. Thompson, M., Ellison, S. L. and Wood, R., The International Harmonised Protocol for the
Proficiency Testing of Analytical Chemistry Laboratories, Pure Appl. Chem., 78 (1), 145-196, 2006.
3. Lawn, R. E., Thompson, M. and Walker, R. F., Proficiency Testing in Analytical Chemistry, LGC,
Teddington, UK, 1997.
4. Horwitz, W., Evaluation of analytical methods used for regulations of food and drugs, Anal. Chem.,
54, 67A-76A.6, 1982.
5. Thompson, M., and Lowthian, P.J., A Horwitz-like function describes precision in a proficiency test,
Analyst, 120, 271-272, 1995.
6. Thompson, M., Recent trends in inter-laboratory precision at ppb and sub-ppb concentrations in
relation to fitness for purpose criteria in proficiency testing, Analyst, 125, 385-386, 2000.
7. Hibbert, D. B. and Gooding J. J., Data Analysis for Chemists – An introductory guide for students
and laboratory scientists, first edition, Unversity Press, New York, 2006.
8. Stephen L. R. E., Barwick V. J. and Farrant T. J. D., Practical Statistics for the Analytical Scientist –
nd
A bench guide, 2 edition, RSC Publishing, Cambridge, 2009.
8 Revision History