PTB Flouda EaP PTs 13528 NABs P - Day - 2
PTB Flouda EaP PTs 13528 NABs P - Day - 2
COOPERATION
• Introduction
• Scope, Normative references, Terms and definitions
• General Principles –basic requirements–statistical
model and design considerations
• Initial review of PT items and results
• Determination an the assigned value
• Determination of evaluation criteria
• Performance scores
2
TECHNICAL
COOPERATION
• Graphical techniques
• Qualitative data
• Normative Annexes
✓ A. Symbols
✓ B. Homogeneity and Stability
✓ C. Robust analysis
• Informative Annexes
✓ D. Additional guidance
✓ E. Illustrative examples
3
TECHNICAL
COOPERATION
5
TECHNICAL
COOPERATION
6
TECHNICAL
COOPERATION
7
TECHNICAL
COOPERATION
SDPA
• Measure of dispersion used in the evaluation of results of
proficiency testing
• NOTE 1 This can be interpreted as the population standard
deviation of results from a hypothetical population of
laboratories performing exactly in accordance with
requirements.
• NOTE 2 The standard deviation for proficiency assessment
applies only to ratio and interval scale results.
• NOTE 3 Not all proficiency testing schemes evaluate
performance based on the dispersion of results. [revised from
ISO/IEC 17043]
8
TECHNICAL
COOPERATION
Section 3: Terms and Definitions
Assigned Value
Consensus Value
• value derived from a collection of results in an interlaboratory
comparison
• NOTE The phrase ‘consensus value’ is typically used to
describe estimates of location and dispersion derived from
participant results in a proficiency test round, but may also be
used to refer to values derived from results of a specified
subset of such results or, for example, from a number of
expert laboratories
9
TECHNICAL
COOPERATION
Outlier
10
TECHNICAL
COOPERATION
Action Signal
proficiency test
item
• sample, product, artefact, reference material, piece of
equipment, measurement standard, data set or other
information used to assess participant performance in
proficiency testing
11
TECHNICAL
4 General Principles COOPERATION
12
TECHNICAL
13
TECHNICAL
COOPERATION
4 General Principles
14
TECHNICAL
COOPERATION
16
TECHNICAL
COOPERATION
5.1 Introduction to the Statistical Design
17
TECHNICAL
COOPERATION
5.1 Introduction to the Statistical Design
18
TECHNICAL
COOPERATION
5.2 Basis of Statistical Design
19
TECHNICAL
5.2 Basis of Statistical Design COOPERATION
21
TECHNICAL
5.5 Guidelines for Report Format COOPERATION
22
TECHNICAL
COOPERATION
6 Initial review of proficiency testing items and results
24
TECHNICAL
COOPERATION
25
TECHNICAL
COOPERATION
6.3 Blunders
Most techniques are based (in the first step) on the median
and the range of the central 50% of results
✓Simple estimators: Median, scaled Median Absolute
Deviation (MADe) and normalized IQR (nIQR).
• Algorithm A (and Algorithm S for precision).
✓Qn and Q methods for estimating SD.
28
TECHNICAL
COOPERATION
Most techniques are based (in the first step) on the median
and the range of the central 50% of results
✓Simple estimators: Median, scaled Median Absolute
Deviation (MADe) and normalized IQR (nIQR).
• Algorithm A (and Algorithm S for precision).
✓Qn and Q methods for estimating SD.
29
TECHNICAL
COOPERATION
30
TECHNICAL
COOPERATION
31
TECHNICAL
COOPERATION
7 The• assigned
Alternative value
methods mayits
and bestandard
used if theyuncertainty
have a sound
statistical basis and the method is described in the plan
for the scheme.
7.1 Five✓different
Regardless of the method
alternatives chosen, it must
for determining be checked
the assigned
value. for every round
• The method used must be fully described to participants
in every report (or referenced)
7.2 Uncertainty of the assigned value.
A measurement is incomplete without its uncertainty
35
TECHNICAL
COOPERATION
1. Known
value
5.
Consensus 2. Certified
of Reference
participants value
Document
how it is
determined
4.
Consensus 3. Reference
of experts value
36
TECHNICAL
37
TECHNICAL
COOPERATION
7.3 Formulation
• Concerns:
✓The base material must be effectively free of the added
constituent, or its proportion is accurately known.
✓The constituents are mixed together homogeneously (when
required)
✓All significant sources of error are identified
✓There is no adverse interaction between the constituents and
the matrix.
✓The behaviour of PT items containing added material is similar
to customer samples that are routinely tested.
38
TECHNICAL
7.3 Formulation COOPERATION
Concerns:
The base material must be effectively free of the added
constituent, or its proportion is accurately known.
39
TECHNICAL
Cost!
4141
TECHNICAL
COOPERATION
7.5 Results from one laboratory
• 7.5.1 Single laboratory using a reference method (such a
primary one)
✓The reference method used should be completely described
and understood.
✓Complete uncertainty statement and appropriate
documented metrological traceability.
✓The reference method should be commutable for all
measurement methods used by participants.
• 7.5.2 Value obtained by calibration against a CRM
✓Value from result and difference, with uncertainty from the
result and the difference
42 42
TECHNICAL
COOPERATION
7.5 Results from one laboratory
ഥ
𝒙𝒑𝒕 = 𝒙𝑪𝑹𝑴 + 𝒅 𝒖 𝒙𝒑𝒕 = 𝒖𝟐𝑪𝑹𝑴 + 𝒖𝟐𝒅
• 7.5.1 Single laboratory using a reference method (such a
𝑥𝐶𝑅𝑀 is the assigned value for the CRM
primary one)
𝑥𝑝𝑡 is the assigned value for the PT item
✓The reference method
𝑑𝑖 is theused shouldbetween
difference be completely described
the average results
and understood. for the PT item and the CRM on the ith samples
𝑑ҧ is the
✓Complete uncertainty statement
average ofand
theappropriate
differences 𝑑𝑖
documented metrological traceability.
✓The reference method should be commutable for all
measurement methods used by participants.
• 7.5.2 Value obtained by calibration against a CRM
✓Value from result and difference, with uncertainty from the
result and the difference
43 43
TECHNICAL
COOPERATION
7.5 Results from one laboratory
• 7.5.3 Check for metrological compatibility of results from before
scheme and after it.
When a reference value is assigned before and after a round of a
sequential proficiency testing scheme, the difference between the
values shall be less than two times the uncertainty of that
difference.
44 44
TECHNICAL
COOPERATION
7.6 Consensus value from expert laboratories
• Using a design for an interlaboratory study for characterization
of CRMs, as described in ISO Guide 35
✓Each participant must provide
their uncertainty
✓PTP must have a procedure to combine uncertainties
45 45
TECHNICAL
COOPERATION
7.7 Consensus value from participants results
Careful application of
techniques in clauses 6.2-6.6 to No additional measurements
assure that adequate
agreement exists and
needed
assumptions are demonstrated
to be reasonable
May be necessary with a
May wish to use a subset of
standardized operationally-
participants
defined method
Can use other calculation
methods with sound statistical
basis
46 46
TECHNICAL
COOPERATION
49 49
TECHNICAL
COOPERATION
the comparison value and assigned value are not traceable to the
same metrological reference
50 50
TECHNICAL
COOPERATION
53
TECHNICAL
COOPERATION
8.2 Perception of experts
54
TECHNICAL
COOPERATION
R = 0,02c 0,8495
The results of collaborative trials
seem to obey this law regardless
✓ of the nature of the analyte
✓ the test material
✓ The physical principle
underlying the measurement
method.
So far, nobody has managed to explain the strange empirical
exponent from basic principles, although several people have
made conjectures.
57
TECHNICAL
COOPERATION
58
TECHNICAL
COOPERATION
8.6 From data obtained in the same round of PT
• Advantages
✓Easy, commonly used, may be the only feasible approach
• Disadvantages
✓SD can vary widely from round to round
✓Can be unreliable with small number of labs
✓Can lead to approximately same proportion of “action signals”
(unacceptable)
✓There is no useful interpretation of suitability of a result
based on intended use (shows only that a lab agrees with
others in the scheme). This can be important when the
measurand involves health or safety.
60
TECHNICAL
COOPERATION
61
TECHNICAL
COOPERATION
9.4-9.7 Scores
64
TECHNICAL
COOPERATION
𝑥𝑖 − 𝑥𝑝𝑡
𝑧𝑖 =
𝜎𝑝𝑡
• if |z| < 2, the performance is satisfactory;
• if 2 <|z| < 3, the performance is questionable -“warning signal”
• if |z| > 3, the performance is unsatisfactory –“action signal”
• Assumption: the individual z scores have a Gaussian or
normal distribution with a mean of zero and a standard
deviation of one. On this basis analytical results can be
described as 'well-behaved’.
67
TECHNICAL
COOPERATION
9.5 z’ score
• A slight variation to z score, to allow consideration of
uncertainty of xpt
𝑥𝑖 − 𝑥𝑝𝑡
𝑧𝑖′ =
2
𝜎𝑝𝑡 + 𝑢2 𝑥𝑝𝑡
68
TECHNICAL
COOPERATION
9.6 ζ score
69
TECHNICAL
COOPERATION
9.7 En scores
• En (Error, normalized) is a conventional score for PT in
calibration, but can be applied anywhere
𝑥𝑖 − 𝑥𝑝𝑡
𝐸𝑛𝑖 =
𝑈 2 𝑥𝑖 + 𝑈 2 𝑥𝑝𝑡
70
TECHNICAL
COOPERATION
En and ζ scores
• Scores that evaluate performance compared to claimed
uncertainty must be interpreted with caution, because some
participants might not calculate uncertainty correctly (GUM), or
report them correctly.
• A large uncertainty leads to lower scores; small uncertainty
leads to higher scores
• Often useful to report En and ζ in addition to a conventional
score (e.g., z z’ D D%)
71
TECHNICAL
COOPERATION
72
72
TECHNICAL
|z| score was satisfactory COOPERATION
|En| value indicates that some results are not in agreement with the assigned
values when the uncertainties are taken into consideration.
73
73
TECHNICAL
Performance statistics and criteria COOPERATION
74
74
TECHNICAL
COOPERATION
9.8 Evaluation of participant measurement uncertainties
75
TECHNICAL
COOPERATION
76
TECHNICAL
COOPERATION
77
TECHNICAL
COOPERATION
9.9 Combined performance scores
78
TECHNICAL
COOPERATION
10 Graphical techniques
79
TECHNICAL
COOPERATION
10.2 Histograms
8 d1
7
6
5
Nr of Labs
4
3
2
1
0
z-score
81
TECHNICAL
COOPERATION
82
TECHNICAL
COOPERATION
Measurand
83
TECHNICAL
COOPERATION
3.00
2.00
1.00
0.00
A B C D E F G H I J K L M N O P Q R S T U V WX Y Z a
-1.00
-2.00
-3.00
d1 f1 e3
-4.00
85 85
TECHNICAL
COOPERATION
86
86
TECHNICAL
COOPERATION
87
87
TECHNICAL
COOPERATION
8888
TECHNICAL
COOPERATION
10.5 Youden Plot- Interpretation
95% confidence ellipse with dashed lines indicating median values for
each of the samples
significant
systematic error
components
random error
components
89 89
TECHNICAL
COOPERATION
91
91
TECHNICAL
10.6 Repeatability Standard Deviations Plots
COOPERATION
1.6
1%
1.4 0.10%
1.2
0.8
0.6
0.4
0.2
0
0.8 1.3 1.8 2.3 2.8
92
92
TECHNICAL
COOPERATION
93
TECHNICAL
COOPERATION
94
TECHNICAL
COOPERATION
95
TECHNICAL
COOPERATION
96
TECHNICAL
COOPERATION
11.2 Statistical design
• Homogeneity
✓Test suitable number of items
✓All results should be the same
• Stability
✓Should not be a factor in identity
✓Concern for presence if not stable
• Performance criterion based on expert judgment, often after
review of results
✓Preferred to have a panel of experts, and defined criteria for
their agreement
97
TECHNICAL
COOPERATION
98
TECHNICAL
COOPERATION
99
TECHNICAL
COOPERATION
Annexes
100
TECHNICAL
COOPERATION
Annex B.4-Stability
102
TECHNICAL
COOPERATION
Annex B.4-Stability
103
TECHNICAL
COOPERATION
Annex B.4-Stability
104
TECHNICAL
COOPERATION
106
TECHNICAL
COOPERATION
107
TECHNICAL
COOPERATION
Annex C: Robust analysis
108
TECHNICAL
COOPERATION
Annex C: Robust analysis
Computationally
Simple techniques: Conventional
intense techniques:
109
TECHNICAL
COOPERATION
110
TECHNICAL
COOPERATION
2b) i
x *
(x *
−x )
* 2
x =
* i =1
s = 1,134
* i
p p −1
113
TECHNICAL
COOPERATION
1,25 s *
ux =
p
114
TECHNICAL
COOPERATION
115
TECHNICAL
COOPERATION
Annex D Additional Guidance
• Few participants, but the assigned value or the dispersion, or
both, need to be derived from participant results.
• Identify outliers
✓Robust statistics are recommended when populations are
outlier-contaminated- Not recommended for very small
data sets.
✓Possible scenario: Identify outliers, reject them, calculate
mean and SD
116
TECHNICAL
COOPERATION
Annex D Additional Guidance
• Few participants, but the assigned value or the dispersion, or
both, need to be derived from participant results.
• Considerations for estimates of location (mean) and dispersion
(SD)
✓Efficiency and breakdown points for robust procedures for
the criterion on limiting the uncertainty (u(xpt) < 0.3σpt or
u(xpt) < 0.1δE )
▪ Simple mean: p=12
▪ Median: p=18
▪ Algorithm A: p=12
• Breakdown point is the proportion of values in the data set that
can be outliers without the estimate being adversely affected.
117
TECHNICAL
COOPERATION
118
TECHNICAL
Annex D Additional Guidance COOPERATION
119
TECHNICAL
COOPERATION
Semi-Quantitative-Example
121
TECHNICAL
COOPERATION
Semi-Quantitative-Example
Sample A Sample B
122
TECHNICAL
COOPERATION
40% 40%
35% 35%
30% 30%
25% 25%
20% 20%
15% 15%
10% 10%
5% 5%
0% 0%
1 2 3 4 1 2 3 4
123
TECHNICAL
COOPERATION
𝑛
1 2
𝑈 =2× × 𝑋𝑖 − 𝑋𝑅
𝑛 𝑛−1
𝑖=1
𝑛
1
𝐷𝑖 = 𝑋𝑖 − 𝑋𝑖
𝑛
𝑖=1
Di offset
Xi measuring result of a participant
n number of participants
i count index
TECHNICAL
COOPERATION
𝑛
1 2 2
𝑢2 2
𝐷𝑖 = 2 𝑢 + 1 − 𝑢𝑖
𝑛 𝑛
𝑖=1
Di offset
u standard measuring uncertainty of the offset
uj standard measuring uncertainty of a participant
n number of participants
i count index
TECHNICAL
COOPERATION
𝑛 2
𝜕𝑋𝑅
𝑈𝑅 = 2 × × 𝑢𝑖2
𝜕𝑋𝑖
𝑖=1
133
TECHNICAL
COOPERATION
134
TECHNICAL
COOPERATION
9 Bi-modal Distribution
8
0
2 3 4 5 6 7 8 9
136
TECHNICAL
COOPERATION