Asif Hanif: Analysis With Chi-Square
Asif Hanif: Analysis With Chi-Square
2
2
0.5
ij ij
ij
O E
E
_
(
=
Co-efficient of Contingency
Contingency coefficient. A measure of association based
on chi-square. The value ranges between zero and 1,
with zero indicating no association between the row and
column variables and values close to 1 indicating a high
degree of association between the variables.
2
2
C
n
_
_
=
+
Phi & Cramers V
The Phi coefficient is a degree of association
between two attributes and is calculated as:
( )( )( )( )
2
ad bc
Phi
n
a b c d a c b d
_
|
= = =
+ + + +
Kendalls Tau b
Kendall's tau-b: This test is used to measure strength of
correlation when we have ordinal data. The sign of the
coefficient indicates the direction of the relationship, and its
absolute value indicates the strength, with larger absolute
values indicating stronger relationships. Possible values
range from -1 to 1, but a value of -1 or +1 can only be
obtained from square tables.
Where S=P-Q
P=Concordant pairs of observation
Q=Discordant pairs of observation
m=min(r,c)
( )( )
0 0
b
S
P Q X P Q Y
t =
+ + + +
Example
An animal epidemiologist tested dairy cows for the presence of a
bacterial disease. The disease is detected by the analysis of
blood samples, and the disease severity for each animal was
classified as None (1), Low (2) and High (2). Moreover, the size
of the herd that each cow belongs to a category is classified as
Large (3), Medium (2) and Small (1). The number of animals in
each of the 9 cells are recorded as:
Size of the
herd
None (1) Low (2) High (3) Total
Small (1) 9 5 9 23
Medium (2) 18 4 19 41
Large (3) 11 88 136 235
Total 38 97 164 299