FCM2063 Printed Notes.
FCM2063 Printed Notes.
 
,
Example 1.6: 
8 teaching assistants are available to grade an exam of four questions. Wish to select a 
different assistant to grade each question (only one assistant per question). How many 
possible ways can the assistant are chosen for grading?
   
Solution: 
The number of possible ways is
Combination
Combination: an unordered subset of k objects taken from a set of n distinct objects.
The number of ways of combination of k objects from n distinct objects is denoted by the 
symbol C
k,n
Permutation vs. Combination
Permutations are larger in number than combinations: e.g., the three numbers (1,2, 3), (1, 
3,2)  (2,3,1) ,  (3,1,2), (3,2,1)  are  all different  permutations  of the numbers 1,  2 and  3. 
However, they all represent the same combination of numbers.
Example 1.7:
Fifteen players compete in a tournament. In how many ways can
a) rankings be assigned to the top five competitors?
b) the best five competitors be randomly chosen?
Solution
The number of rankings that can be assigned to the top five competitors is
The number of ways that five competitors can be chosen is
11
1680
8
4
    P P
n
k
! !
!
,
) ( k n k
n
k
n
C C
k
n n k
,
_
 
,
!
!( )! !
k n k
n
n P
n
C
k k n k k
   _
      
   
   ,
360 , 360
15
5
    P P
n
k
003 , 3
! 10 ! 5
! 15
5
15
 
,
_
n
k
C
1.3 Conditional Probability; Independent Events; 
Sometimes it is useful to know the probability that an event will occur given that another 
event occurred. Given two possible events, if we know that one event occurred then this 
information can be applied in calculating the other events probability. 
1.3.1    Conditional Probability
The conditional probability of A, given that B has already occurred, is denoted as P (A |  
B) and defined as:
0 ) ( provided ,
) (
) (
) (   
 B p
B p
B A p
B A p
The conditional probability of B, given that A has already occurred, is denoted as P ( B |  
A) and defined as:
0 ) ( provided ,
) (
) (
) (   
 A p
A p
B A p
A B p
Example 1.8:
The   Information  Resource   Center(IRC),   UTP  displays   three   types   of   books   entitled 
Science (S), Engineering (E),   and Technology   (T). Reading habits of randomly 
selected reader with respect to these types of books are
  Read regularly       S        E        T          SE    ST     ET    SET
    Probability          0.14    0.23     0.37      0.08    0.09     0.13       0.05
Find the following probabilities and interpret
a) P( S | E )
b) P( S |E U T )
c) P( S | reads at least one )
d) P( S U E | T)
Solution:
12
3478 . 0
23 . 0
08 . 0
) (
) (
) (    
E p
E S p
E S p
2553 . 0
47 . 0
12 . 0
) (
) (
) (    
 
 
T E p
T E S p
T E S p
2857 . 0
49 . 0
14 . 0
) (
) (
) (
) (
                                 
) ( ) one least  at  reads (
 
 
  
T E S p
S p
T E S p
T E S S p
T E S S p S p
5946 . 0
37 . 0
22 . 0
) (
) (
) (    
 
 
T p
T E S p
T E S p
1.3.2    Independent Events 
The probability of both events occurring can be calculated by rearranging the terms in the 
expression of conditional probability. 
Two events A and B are called independent if the probability of event A is not affected by 
the occurrence of event B, so                          and 
Example 1.9:
In rolling a fair die, let event A = {1, 3, 5} and event B = {4, 5, 6}.
Are events A and B independent?
Solution:
13
0.07
0.20
0.02
0.04
0.03
0.08
0.05
S
    E
T
39 . 3  x
) ( ) | ( A P B A P   
) ( ) ( ) ( B P B A P B A P    
) ( ) ( ) ( B P A P B A P    
P(A) = , P(B)=1/2 and 
Since                                            , so A and B are not independent events.
Whats the difference between mutually exclusive and independent events?
Two  events  mutually  exclusive (disjoint):  both cannot happen when  the  experiment  is 
performed, so P( A| B) = 0, or vice versa
Two mutually exclusive events:  P(A  B) = 0 and  P( A U B) = P(A) + P(B)
Mutually exclusive events must be dependent.
Two events are independent: P( A U B) = P(A) + P(B)  P(A  B)
Example 1.10:
Toss a single die and observe the events
            A: a number less than 4
               B: a number less than or equal to 2
            C: a number greater than 3
        Are events A and B independent? Are events A and B mutually exclusive?
        Are events A and C independent? Are events A and C mutually exclusive?
   
Solution:
P(A) =  , P(B) = 1/3, P(C ) =  
            P( A | B)  P(A), A and B dependent but not mutually exclusive.
           A and C are dependent but mutually exclusive.
1.4 Bayes theorem
1.4.1 Multiplicative Law of Probability and Independence
For two events A and B, 
( ) ( | ). ( ) P A B P A B P B    
Events A and B are independent if and only if
If events A
1
, .., A
k
 are independent then,
14
( ) ( ). ( ) P A B P A P B    
1 2 1 2
( ... ) ( ) ( ) ( )
k k
P A A A P A P A P A                
) ( ) ( ) ( B P A P B A P    
6 / 1 ) (    B A P
Multiplication  rule  is   most   useful   when  the  experiment   consists   of   several   stages   in 
succession. The conditioning event, B, describes the outcome of the first stage and A the 
outcome of the second, so that P( A| B)  conditioning on what occurs first will often be 
known.
Example 1.11:
During  a  space  shot,   the  primary  computer   system  is   backed  up  by  two  secondary 
systems. They operate independently of one another, and each is 95% reliable. 
What is the probability that all three systems will be operable at the time of the launch?
Solution
Let,
A
1
: event main system is operable
A
2
: event first backup is operable
A
3
: event second backup is operable
Given P(A
1
) = P(A
2
) = P(A
3
) = 0.95
Since they operate independently
P(A
1
  A
2
 A
3
) = P(A
1
)P(A
2
) P(A
3
) = 0.857
1.4.2 The Law of Total Probability
Suppose   B
1
, B
2
 ,, B
n
  are mutually exclusive and exhaustive in  S, then for any event A
1.4.3 Bayes Theorem
Suppose  B
1
, B
2
 ,, B
n
   are mutually exclusive and exhaustive (whose union is S). Let A 
be an event such that P(A) > 0. Then for any event  B
j
 , j =1, 2, , n,
Example 1.12:  
A  store  stocks  bulbs  for  LCD  projector  from  three  suppliers.     Suppliers  A,  B,   and  C 
supply  10%,   20%,   and  70%  of   the  bulbs  respectively.     It   has  been  determined  that 
company As bulbs are 1% defective while company Bs are 3% defective and company 
15
1
( ) ( | ) ( )
( | )
( )
( | ) ( )
k k k
k n
i i
i
P A B P A B P B
P B A
P A
P A B P B
1 1
( ) ( ) ( | ) ( )
n n
i i i
i i
P A P A B P A B P B
   
      
   
Cs are 4% defective.  If a bulb is selected at random and found to be defective, what is 
the probability that it came from supplier B?
Solution:
Let D is a defective, then the probability that it came from supplier B is
16
(   )
  (   )   (   )
(   )   (   )   (   )   (   )   (   )   (   )
|
|
| | |
P B P D B
P B D
P A P D A P B P D B P C P D C
+   +
(   )
(   )   (   )   (   )
0.2 0.03
0.1 0.01 0.2 0.03 0.7 0.04
+   +
0.1714 
Exercise Chapter 1:
1.Each  message  in  a  digital   communication  system  is  classified  as  to  whether  it   is  received 
within  the  time  specified  by  the  system  design.   If  3  messages  are  classified,   what   is  an 
appropriate sample space for this experiment?
2.A digital scale is used that provide weights to the nearest gram. Let event A: a weight exceeds 
11 grams, B: a weight is less than or equal to 15 grams, C: a weight is greater than or equal to 
8 grams and less than 12 grams. What is the sample space for this experiment? and find
(a)  A U B       (b)  A         (c) A  B
(d)  (A U C)   (e)  A  B  C    (f)  B  C 
3. Samples of building materials from three suppliers are classified for conformance to air-
quality specifications. The results from 100 samples are summarized as follows:
                          
Conforms
Yes No
Supplie
r
R 30 10
S 22 8
T 25 5
Let A denote the event that a sample is from supplier R, and B denote the event that a sample 
conforms  to  the  specifications.   If  sample  is  selected  at   random,   determine  the  following 
probabilities:
(a)   P(A)          (b)   P(B)           (c)  P(B)
(d)  P(AUB)     (e)   P(A B)    (f)  P(AUB)
(g)  
) ( B A P
  (h)  
) ( A B P
4. The compact discs from a certain supplier are analyzed for scratch and shock resistance. The 
results from 100 discs tested are summarized as follows:
                          
Scratch 
Resistance
High Low
Shock 
Resistance
High 30 10
Medium 22 8
Low 25 5
Let A denote the event that a disc has high shock resistance, and B denote the event that a 
17
disc has high scratch resistance.  If sample is selected at random,  determine the following 
probabilities:
(a)   P(A)          (b)   P(B)           (c)  P(B)
(d)  P(AUB)     (e)   P(A B)    (f)  P(AUB)
(g)  
) ( B A P
  (h)  
) ( A B P
5. The reaction times ( in minutes) of a reactor for two batches are measured in an experiment.
(a) Define the sample space of the experiment. 
(b) Define event A where the reaction time of the first batch is less than 45 minutes and event 
     B is the reaction time of the second batch is greater than 75 minutes.
(c) Find A U B, A  B and A
(d) Verify whether events A and B are mutually exclusive.
6. When a die is rolled and a coin is tossed, use a tree diagram to describe the set of possible 
outcomes and find the probability that the die shows an odd number and the coin shows a 
head.
7. A bag contains 3 black and 4 while balls. Two balls are drawn at random one at a time 
without replacement. 
(i)  What is the probability that a second ball drawn is black?  
(ii) What is the conditional probability that first ball drawn is black if the second ball is 
               known to be black?
8. An oil-prospecting firm plans to drill two exploratory wells. Past evidence is used to assess 
the possible outcomes listed in the following table:
Find and give description for  
                                                                                     
18
Event Description Probability
A
B
C
Neither well produces oil or gas
Exactly one well produces oil or gas
Both wells produce oil or gas
0.80
0.18
0.02
) ' ( ) ( ), ( B P and C B P B A P    
9. In a residential suburb, 60% of all households subscribe to the metro newspaper published in 
a nearby city, 80% subscribe to the local paper, and 50% of all households subscribe to both 
papers. Draw a Venn diagram for this problem. If a household is selected at random, what is 
the probability that it subscribes to
(a) at least one of the two newspapers
(b) exactly one of the two newspapers
10. In a student organization election, we want to elect one president from five candidates, one 
vice  president  from  six  candidates,     and  one  secretary  from  three  candidates.   How  many 
possible outcomes?
11. Suppose  each  student  is  assigned  a  5  digit   number.   How  many  different  numbers  can  be 
created?
12. A chemical engineer wishes to conduct an experiment to determine how these four factors 
affect   the  quality  of  the  coating.   She  is  interested  in  comparing  three  charge  levels,   five 
density  levels,   four   temperature  levels,   and  three  speed  levels.   How  many  experimental 
conditions are possible? 
13. A menu  has  five appetizers,   three soup, seven main course,  six  salad  dressings  and eight  
desserts. In how many ways can
(a) a full meal be chosen?   
(b) a meal be chosen if either and appetizer or a soup is ordered, but not both?
14. Ten  teaching  assistants  are  available  to  grade  a  test   of   four  questions.   Wish  to  select   a 
different   assistant   to  grade  each  question  (only  one  assistant   per   question).   How  many 
possible ways can the assistant be chosen for grading?
15. Participant samples 8 products and is asked to pick the best, the second best, and the third 
best. How many possible ways?
16. Suppose that in the taste test, each participant samples eight products and is asked to select  
the three best products. What is the number of possible outcomes?
17. A contractor has 8 suppliers from which to purchase electrical supplies. He will select 3 of 
these at random and ask each supplier to submit a project bid. In how many ways can the 
selection of bidders be made?
18. Twenty players compete in a tournament. In how many ways can
(a) rankings be assigned to the top five competitors?
(b) the best five competitors be randomly chosen?
19. Three   balls   are   selected   at   random  without   replacement   from  the   jar   below.   Find  the 
probability that one ball is red and two are black.
19
20. A university warehouse has received shipment of 25 printers, of which 10 are laser printers 
and 15 are inkjet models. If 6 of  these 25 are selected at random by a technician, what is the 
probability that exactly 3 of those selected are laser printers? 
21. There are 17 broken light bulbs in a box of 100 light bulbs. A random sample of 3 light bulbs 
is chosen without replacement.
(a) How many ways are there to choose the sample?
(b) How many samples contain no broken light bulbs?
(c) What is the probability that the sample contains no broken light bulbs?
(d) How many ways to choose a sample that contains exactly 1 broken light bulb?
(e) What is the probability that the sample contains no more than 1 broken light bulb?
22. An agricultural research establishment grows vegetables and grades each one as either good 
or bad for taste, good or bad for its size, and good or bad for its appearance. Overall, 78% of 
the vegetables have a good taste. However, only 69% of the vegetables have both a good 
taste and a good size. Also, 5% of the vegetables have a good taste and a good appearance,  
but a bad size. Finally, 84% of the vegetables have either a good size or a good appearance.
(a)  if a vegetable has a good taste, what is the probability that it   also has a good size?
(b)  if a vegetable has a bad size and a bad appearance, what is the probability that it has a 
           good taste?
23. A  local   library  displays   three   types   of   books   entitled  Science   (S),   Arts   (A),     and 
Novels    (N).  Reading habits of  randomly selected reader with respect to these types  of 
books are
              Read regularly        S        A        N        SA    SN     AN    SAN
                 Probability         0.14    0.23    0.37     0.08      0.09       0.13        0.05
Find the following probabilities and interpret
(a) P( S | A )
(b) P( S | A U N )
(c) P( S | reads at least one )
(d) P( S U A | N)
24. A batch of 500 containers for frozen orange juice contains 5 that are defective. Two are  
selected at random, without replacement, from the batch. Let A and B denote that the first 
and second selected is defective respective
(a) Are  A and B independent events?     
(b) If the sampling were done with replacement, would A and B be independent?
25. Everyday  (Mon  to  Fri)   a  batch  of  components  sent   by  a  first   supplier   arrives  at   certain 
inspection facility.  Two days a week,  a batch also arrives from a second supplier.  Eighty 
percent of all batches from supplier 1 pass inspection, and 90% batches of supplier 2 pass 
inspection.   On  a  randomly  selected  day,   what   is   the   probability  that   two  batches   pass 
inspection? 
20
26. The probability is 1% that an electrical connector that is kept dry fails during the warranty 
period of a portable computer. If the connector is ever wet, the probability of a failure during 
the warranty period is 5%. If 90% of the connectors are kept dry and 10% are wet, what 
proportion of connectors fail during the warranty period?
27. Computer keyboard failures are due to faulty electrical connects (12%) or mechanical defects 
(88%).   Mechanical   defects  are  related  to  loose  keys  (27%)  or  improper  assembly  (73%). 
Electrical connect defects are caused by defective wires (35%), improper connections (13%) 
or poorly welded wires (52%). Find the probability that a failure is due to
(a) loose keys
(b) improperly connected or poorly welded wires.
28. During a space shot, the primary computer system is backed up by two secondary systems. 
They operate independently of one another, and each is 90% reliable. What is the probability 
that all three systems will be operable at the time of the launch?
29. A store stocks light bulbs from three suppliers.  Suppliers A, B, and C supply 10%, 20%, and 
70%  of  the  bulbs  respectively.     It   has  been  determined  that   company  As  bulbs  are  1% 
defective while company Bs are 3% defective and company Cs are 4% defective.  If a bulb 
is selected at random and found to be defective, what is the probability that it came from 
supplier B?
30. A particular city has three airports. Airport A handles 50% of all airline traffic, while airports 
B and C handle 30% and 20%, respectively. The rates of losing a baggage in airport A, B and 
C are 0.3, 0.15 and 0.14 respectively. If a passenger arrives in the city and losses a baggage, 
what is the probability that the passenger arrives at airport A?  
31. A  company  rated  75%  of   its   employees   as   satisfactory  and  25%  unsatisfactory.   Of   the 
satisfactory  ones  80%  had  experience,   of   the  unsatisfactory  only  40%.   If   a  person  with 
experience is hired, what is the probability that (s)he will be satisfactory?
32. In   a   certain   assembly   plant,   three   machines,   B
1
,   B
2
,   B
3
,   make   30%,   45%  and   25%, 
respectively, of the products. It is known from past experience that 2%,3% and 2% of the 
products made by each machine,  respectively, are defective. Now, suppose that a finished 
product is randomly selected. 
(a) What is the probability that it is defective?
(b) If a product was chosen randomly and found to be defective, what is the probability 
that 
   it was produced  by machine B
3
?
33. Three machines A, B and C produce identical items of their respective output 5%,  4%  and 
3% of the items are faulty.  On a certain day A has produced 25%, B has produced 30% and 
C has produced 45% of the total output.   An item selected at random is found to be faulty. 
What are the chances that it was produced by C?
21
34. Suppose that a test for Influenza A, H1N1 disease has a very high success rate: if a tested 
patient has the disease, the test accurately reports this, a positive, 99% of the time, and  if a 
tested patient does not have the disease, the test accurately reports that, a negative, 95% of 
the time. Suppose also, however, that only 0.1% of the population have that disease.
(a) What is the probability that the test returns a positive result?
(b) If the patient has a positive, what is the probability that he has the disease?
(c) What is the probability of a false positive?
35. An insurance company charges younger drivers a higher premium than it 
does older drivers because younger drivers as a group tend to have more 
accidents.   The company has 3 age groups: Group A includes those less 
than 25 years old, have a 22% of all its policyholders.   Group B includes 
those  25-39  years   old,   have  a  43%  of   all   its   policyholders,   Group  C 
includes those 40 years old and older, have 35% of all its policyholders. 
Company  records  show  that   in  any  given  one-year   period,   11%  of   its 
Group A policyholders have an accident.   The percentages for groups B 
and C are 3% and 2%, respectively.
(a)What is the probability that the companys policyholders are expected 
to have an accident during the next 12 months? 
(b)Suppose   Mr.   Chong  has   just   had  a   car   accident.     If   he   is   one   of   the   companys 
policyholders, what is the probability that he is under 25? 
22
Chapter 2
2. Discrete random variable and discrete probability distributions
Learning objectives:
At the end of this chapter, student should be able to:
 Define the random variables
 Differentiate between discrete and continuous random variables
 Define the discrete probability distributions
 Know the special functions for discrete probability distribution 
2.1 Introduction
A random variable  is a rule that assigns a number to each outcome of an experiment. 
These numbers are called the measured values of the random variable. The capital letters 
like X, Y and Z is used to denote a random variable and the small letters like x, y and z to 
denote the measured values.
Example 2.1:
Select a soccer player; the random variable Y is the number of goals the player has 
scored during the season.
The measured values of Y are 0, 1, 2, 3,
The test marks for 100 engineering students; the random variable Z is the average number 
of goals scored by the students. 
The values of Z are 65.4, 67.8, 70.5, 77.3,
23
There   are   two  types   of   random  variables   called   a   discrete   random  variable   and   a 
continuous random variable.
2.2 Discrete random variable
The measured values for a discrete random variable are finite or countable. The values 
are in terms of integer value. The number of students in this class is the example of a 
discrete random variable. 
2.3 Continuous random variable
The measured values for continuous random variables are in terms of real number in the 
range. It can be any values within the range. The weight of students in this class is the 
example of a continuous random variable.
Example 2.2:
Identify whether the random variable below is discrete or continuous random variable.
(i) The number of female students in the class.
(ii) The number of telephone calls.
(iii) The time between two accidents.
(iv) The number of cracks in a certain length of road.
(v) The height of the athletes participated in the Asian Game.
(vi) The volume of water in the tank.
(vii) A score on the statistics final examination. 
(viii) The number of cars on the road at a certain period of time.
Solution:
(i) discrete
(ii) discrete
(iii) continuous
(iv) discrete
(v) continuous
(vi) continuous
(vii) continuous
(viii) discrete
2.4 Discrete probability distribution 
If a random variable is a discrete variable, its probability distribution is called a 
discrete probability distribution or probability mass function, pmf.
24
Suppose the experiment is flipping a coin two times. This simple experiment can have 
four  possible  outcomes  or  sample  space:   HH,   HT,  TH,   and  TT.   Now,   let  the  random 
variable X  represent the number of Heads that result from this experiment. The random 
variable X can only take on the values 0, 1, or 2, so it is a discrete random variable. 
The probability distribution for this experiment appears as below. 
Number of 
heads, X
Probability function, P(X)
0 1/4
1 1/2
2 1/4
The above table represents a discrete probability distribution and the probability function, 
P(X=x
i
) is called probability mass function (pmf) of X because it relates each value of a 
discrete random variable with its probability of occurrence. 
2.4.1 The properties of the probability mass function (pmf)
The pmf, P(X=x
i
) of a discrete random variable X must satisfied two conditions;
(i) 1 ) ( 0     
i
x X P
(ii) 
1 ) (       
i
x
i
x X P
Given pmf, the probability of X occurs can be calculated. For example the probability at 
most one occurs is 
) 1 ( ) 0 ( ) 1 (    +    X P X P X P
Example 2.3:
Two balls are drawn at random in succession without replacement from an urn containing 
4 red balls and 6 black balls. Find the probabilities of all the possible outcomes. 
25
Solution:
Let X denote the number of red balls in the outcome.
Possible 
outcomes
RR RB BR BB
X 2 1 1 0
Here, x
1
 = 2, x
2
 = 1, x
3
 = 1, x
4
 = 0 
Now, the probability of getting 2 red balls when we draw out the balls one at a time is:
Probability of first ball being red = 4/10
Probability of second ball being red = 3/9 (because there are 3 red balls left in the urn, out 
of a total of 9 balls left.) So: 
Likewise, for the probability of red first is 4/10 followed by black is 6/9 (because there 
are 6 black balls still in the urn and 9 balls all together). So: 
Similarly for black then red: 
Finally, for 2 black balls: 
So the probability distribution is:
X 2 1 0
P(X=x) 2/15 8/15 5/15
Example 2.4:
Given the probability distribution,
26
X 0 1 2 3 4 5
P(X=x) 1/10 1/5 k 1/5 3/10 1/10
Find the value of k that makes P(X=x) a valid pmf of X.
Solution:
For P(X=x) is truly pmf of X, its must satisfy
1 ) (       
i
x
i
x X P
. Hence,
10 / 1 1 10 / 1 10 / 3 5 / 1 5 / 1 10 / 1 ) (      + + + + +      k k x X P
i
x
i
.
2.4.2 The cumulative distribution function (cdf) 
The cdf of a discrete random variable X is defined by, 
                   
) ( ) ( ) (         
x x
i
i
x X P x X P x F
Example 2.5:
Given pmf,
X 2 1 0
P(X=x) 2/15 8/15 5/15
Find the cdf of X.
Solution:
For x < 0, 
0 ) ( ) (      x X P x F
For 
15 / 5 ) 0 ( ) 0 ( ) ( , 1 0        <  x P X P x F x
For 
15 / 13 15 / 8 15 / 5 ) 1 ( ) 0 ( ) 1 ( ) ( , 2 1    +   +     <  x P x P X P x F x
For 
1 15 / 2 14 / 8 15 / 5                   
) 2 ( ) 1 ( ) 0 ( ) 2 ( ) ( , 2
 + + 
 +  +      x P x P x P X P x F x
So the cdf of X is:
27
'
< 
< 
<
2 , 1
2 1 , 15 / 13
1 0 , 15 / 5
0 , 0
) (
x
x
x
x
x F
2.4.3 The mean and the variance of X
Given the pmf of X, P(X=x), all the parameters of X such as the mean, the variance and 
the standard deviation can be determined by using the expectation definition.
 
The mean of X is defined by,
                                             
                        ) ( ) (
1
     
n
i
i i
x X P x X E 
The variance of X is defined by,
2
1
2 2 2 2
) ( )) ( ( ) ( ) (               
n
i
i i
x X P x X E X E X Var
The standard deviation,  is a square root of the variance. 
Example 2.6:
Let X is a random variable with pmf,
X 2 1 0
P(X=x) 2/15 8/15 5/15
Find the mean, the variance and the standard deviation of  X.
Solution:
28
The mean of X is:
8 . 0 15 / 12 ) 15 / 5 ( 0 ) 15 / 8 ( 1 ) 15 / 2 ( 2 ) ( ) (
1
    + +       
n
i
i i
x X P x X E 
The variance of X is:
(   )
  0.427 0.64 - 16/15          
) 8 . 0 ( ) 15 / 5 ( 0 ) 15 / 8 ( 1 ) 15 / 2 ( 2          
) ( )) ( ( ) ( ) (
2 2 2 2
2
1
2 2 2 2
 
 + + 
        
 
n
i
i i
x X P x X E X E X Var
Standard deviation is  = 0.653
2.5 Special functions for discrete probability distribution
There are many special discrete probability distributions such as Bernoulli distribution, 
Binomial distribution and Poisson distribution.
2.5.1 Bernoulli distribution
The experiment conducted with only two possible outcomes. In an experiment of tossing 
a fair coin for 1 time and X is the number of head. There are only two possible outcomes, 
X =0 or X=1 with probability distribution:
Possible 
outcomes
Head Tail
X 1 0
P(X=x) 1/2 1/2
2.5.2  Binomial distribution
If  the  Bernoulli   experiment   conducted  for  n  times,   and  the  random  variable  X  is  the 
number of success, then the probability distribution of X  is called Binomial distribution 
with pmf,
29
,....... 3 , 2 , 1 , 0 , ) (   
,
_
 
  
x q p
x
n
x X P
x n x
where p is the probability of success and q=1-p.
By using the definition, it can be shown that, if  X  is a Binomial distribution, then the 
mean of X is E(X) = np and the variance of X, is Var(X) = npq.
Example 2.7:
In the experiment of tossing a fair coin for 10 times, and X is the number of head. 
(i) What is the pmf of X?. 
(ii) Find the probability the head will appear exactly 5 times.
(iii) What is the probability no head?
(iv) Find the mean and the variance of X.
Solution:
 
(i) The probability mass function of X is given by:
,....... 3 , 2 , 1 , 0 , ) 5 . 0 ( ) 5 . 0 (
10
) (
10
,
_
 
  
x
x
x X P
x x
(ii)
246 . 0 ) 5 . 0 ( ) 5 . 0 (
! 5 ! 5
! 10
) 5 . 0 ( ) 5 . 0 (
5
10
) 5 (
5 5 5 5
 
,
_
  X P
30
(iii)
00097 . 0 ) 5 . 0 ( ) 5 . 0 (
! 10 ! 0
! 10
) 5 . 0 ( ) 5 . 0 (
0
10
) 0 (
10 0 10 0
 
,
_
  X P
(iii) The mean of X is np = 10(0.5)=5 
The variance of X is npq=10(0.5)(0.5)=2.5
2.5.3 Poisson distribution
Another important discrete distribution is a Poisson distribution. The random variable X 
is  the   number   of   occurrences   in   the   interval   of   interest.   The   example   of   Poisson 
distribution is the number of accidents within a certain period of times. If X is a random 
variable with a Poisson distribution then the pmf of X is given by,
,....... 3 , 2 , 1 , 0 ,
!
) (     
x
x
e
x X P
x
+  
 +       >
 
e e
X P x P X P X P
31
Exercise 2
1. Identify each of the random variables as continuous or discrete random variable.
(a) The number of atoms 
(b) The number of fish in a pond
(c) The home team score in a football game
(d) The voltage on a power line
(e) A score on the mathematic final exam
(f) The volume of gas in the tank 
(g) The number of cars at the petrol station
(h) The number of accidents in Ipoh
(i) The number of cakes left in the pantry
(j) The height of civil engineering students in UTP
2. Let,
x 0 1 2 3
0.1
5
0.2
5
k 0.3
5
(i) Find the value of k that result in a valid probability distribution.
(ii) Find the expected number of X and the standard deviation of X.
(iii) What is the probability that X greater than or equal to 1?
3.At UTP, the business students run an investment club. Each semester they create investment 
portfolios   in  multiples   of   RM1,000  each.   Records  from  the  past   several   years   show  the 
following  probabilities  of  profits  (rounded  to  the  nearest   RM50).   In  the  table  below,  x  = 
profit per RM1, 000 and P(x) is the probability of earning that profit.
     x 0 50 10
0
15
0
200
0.1
5
0.3
5
k 0.2 0.0
5
(a) Determine the value of k that results in a valid probability distribution.
(b) The profit per RM1, 000 is a random variable. Is it discrete or continuous? Explain.
(c) Find the expected value of the profit in a $1,000 portfolio.
(d) Find the standard deviation of the profit.
(e) What is the probability of a profit of $150 or more in a RM1, 000 portfolios?
32
4. Let  X  denote  the  number  of  bars  of  service  on  your  cell   phone  whenever  you  are  at   an 
intersection with the following probabilities: 
x 0 1 2 3 4 5
0.0
5
0.1
5
0.2
0
0.3
5
0.1
5
0.
1
Determine the following: 
(a) F(x)
(b) Mean and variance
(c) P(X < 2)
(d) P(X >2.5)
5. A local cab company is interested in the number of pieces of luggage a cab carries on a taxi 
run. A random sample of 260 taxi runs gave the following information. x = number of pieces 
of luggage and f is the frequency with which taxi runs carried x pieces of luggage.
x :   0      1         2     3      4       5      6       7      8       9    10
f :        42    51      63     38    19    16     12    10     6       2       1 
(a) Find the probability distribution for x.
(b) Estimate the probability that a taxi run will have from 0 to 4 pieces of luggage  
           (including 0 and 4).
(c) What is the expected value of x?
(d) What is the standard deviation of x.
6. A Professor estimates the probability that he will receive at least one telephone call at home 
during the hours of 5pm to 7pm on a weekday to be 1/3. Use the formulas for computing 
binomial probabilities to answer the following questions:
(a)  What is the probability that he will receive at least one call on all five of the next five 
weekday nights?
(b)  What is the probability that he will not receive a call on any of the next five weekday 
nights?
(c)  What is the probability that he will receive a call on at least four of the next five weekday 
nights?
7. The probability of successfully landing a plane using a flight simulator is given as 0.80. Nine 
randomly and independently chosen student pilots are asked to try to fly the plane using the 
simulator. 
(a) What is the probability that all the student pilots successfully land the plane using the 
simulator?
(b) What is the probability that none of the student pilots successfully lands the plane using 
the simulator?
33
(c) What is the probability that exactly eight of the student pilots successfully land the plane 
using the simulator?
8.Suppose X has a Poisson distribution a mean of 7. Determine the following. 
(a) P(X = 0);
(b) P(X = 5);
(c) P(X < 3); and
(d)
) 4 (    X P
.
9.At the Mc Donald drive-thru window of food establishment, it was found that during slower 
periods of the day, vehicles visited at the rate of 15 per hour. Determine the probability that
(a) no vehicles visiting the drive-thru within a ten-minute interval during one of these slow 
periods;
(b) only 3 vehicles visiting the drive-thru within a ten-minute interval during one of these 
slow periods; and
(c) at least three vehicles visiting the drive-thru within a ten-minute interval during one of 
these slow periods.
10. The number of cracks in a section of PLUS highway that are significant enough to require 
repair is assumed to follow a Poisson distribution with a mean of two cracks per kilometer. 
Determine the probability that
(a) there are no cracks at all in 2km of highway;
(b) at least one crack in 500meter of highway; and
(c) there are exactly 3 cracks in 0.5km of highway. 
34
Chapter 3
3. Continuous probability distributions
Learning objectives:
At the end of this chapter, student should be able to:
 Define the continuous probability distributions
 Know the special functions for continuous probability distribution 
3.1 Introduction
If   the   outcomes   of   the   experiment   conducted   are   continuous  random  variables,   its 
probability   distribution  is   called   a  continuous   probability   distribution  or   probability 
density function, pdf.
3.1.1 The properties of the probability density function (pdf)
The pdf, f(x) of a continuous random variable X must satisfied two conditions;
(j)
1 ) ( 0     x f
(ii) 
1 ) (   
 
dx x f
Given pdf, the probability of X occurs can be calculated. For example the probability at 
most one occurs is 
 
   
1
) ( ) 1 ( dx x f X P
Example 3.1:
Let X be continuous random variable with pdf given by,
35
'
elsewhere , 0
2 0 ,
) (
2
x kx
x f
Find the value of k that makes f(x) a valid pdf of X.
Solution:
To be a valid pdf, f(x) must satistify,
1 ) (   
 
dx x f
So,
8
3
1
3
8
0
2
3
) (
3
2
0
2
    
 
k k
x
k dx kx dx x f
3.1.2 The cumulative distribution function (cdf) 
The cdf of a continuous random variable X is defined by, 
                   
   
  
   
x dx x f x X P x F
x
, ) ( ) ( ) (
Example 3.2:
Let X be continuous random variable with pdf given by,
'
elsewhere , 0
1 0 , 3
) (
2
x x
x f
Find,
(i) P(X < 0.5),  P( 0.5 < X < 0.75)
(ii) The cdf of X.
Solution:
36
(i)
8
1
) 5 . 0 (
0
5 . 0
3 ) ( ) 5 . 0 (
3 3 5 . 0
0
2 5 . 0
  
 <
   
x dx x dx x f X P
64
19
) 5 . 0 ( ) 75 . 0 (                         
5 . 0
75 . 0
3 ) ( ) 75 . 0 5 . 0 (
3 3
3 75 . 0
5 . 0
2 75 . 0
5 . 0
  
 < < x dx x dx x f X P
37
(ii) The cdf of X, 
      <
 
x
dx x f x X P x F 0 ) ( ) ( ) (     0, For x
 
      
   +      
     
x x
x dx x dx dx x f x X P x F x
0
0
3 2
3 0 ) ( ) ( ) ( , 1 0 For 
   +
      
+     >
     
x x
dx dx x dx dx x f x X P x F x
1
0 1
0
2
1 0 3 0 ) ( ) ( ) ( , 1 For 
So the cdf of X is:
'
>
 
<
1 , 1
1 0 ,
0 , 0
) (
3
x
x x
x
x F
3.1.3 The mean and the variance of X
Given the pdf of X, f(x), all the parameters of X such as the mean, the variance 
and the standard deviation can be determined by using the expectation definition.
 
The mean of X is defined by,
                                             
                       
 
  
 
dx x xf X E ) ( ) ( 
The variance of X is defined by,
2 2 2 2
) ( )) ( ( ) ( ) (       
   
  
 
dx x xf X E X E X Var
The standard deviation,  is a square root of the variance. 
Example 3.3:
Let X be continuous random variable with pdf given by,
38
'
elsewhere , 0
2 0 ,
8
3
) (
2
x x
x f
Find the mean and the variance of X.
Solution:
The mean of X is
5 . 1
2
3
0
2
32
3
8
3
) ( ) (
4
3
2
0
    
 
  
 
x
dx x dx x xf X E 
The variance of X is, 
15 . 0
20
3
4
9
5
12
4
9
0
2
40
3
4
9
8
3
           
) 5 . 1 (
8
3
) ( ) (
5
2
0
4
2
2
0
2 2 2 2 2
      
 
  
 
x dx x
dx x x dx x f x X Var    
3.2 Special functions for continuous probability distribution
There  are  many  special   continuous   probability  distributions   such  as   Uniform 
distribution,   Exponential   distribution,   Gamma   distribution   and   Normal 
distribution.
3.2.1 Uniform distribution
The random variable X  is a uniform distribution, and then the pdf of X  is given 
by,
39
'
elsewhere , 0
,
1
) (
b x a
a b
x f
By using the definition, it can be shown that, if X is a uniform distribution, then 
the mean of X is E(X) =  and the variance of X, is Var(X) = .
3.2.2 Exponential distribution
The random variable  X  is an exponential distribution,  and then the pdf of  X  is 
given by,
'
elsewhere , 0
0 ,
) (
x e
x f
x 
'
elsewhere , 0
0 ,
) (
x e
x f
x 
 
where  is the 1/mean of X. So,  = 1/10=0.1
40
Hence, 
135 . 0
20
) 1 . 0 ( ) 20 (
2 1 . 0
20
1 . 0
 
 >
       
e e dx e X P
x x
3.2.3 Normal distribution
The random variable X is a Normal distribution, and then the pdf of X is given by,
    
x e x f
x
,
2
1
) (
2
2
) (
 
By  using  the  definition,   it   can  be  shown  that,   if  X  is  a  random  variable  with 
normal distribution,  then the mean of  X  is  E(X) =   and the variance of  X, is 
Var(X) = 
2
. If X is a random variable with normal distribution, then X is always 
be written as,
  X ~ N( , 
2
)
This distribution is called non-standard normal distribution. The probability of X 
can be found by integrate the pdf, f(x). But this integration is not easy to calculate. 
By  using  the  transformation,   Z  =  (x-m)/s,   then  the  random  variable  X  will  be 
change to random variable Z with pdf,
 
    
  
z e z f
z
,
2
1
) (
2
 <
   
 are tabulated in the Standard Normal 
Distribution Table.
Example 3.5:
After   completing   a   study,   the   civil   engineering   department   in   Universiti 
Teknologi   PETRONAS  (UTP)  concluded  that   the  time  UTP  employees  spend 
commuting  to  work  each  day  is  normally  distributed  with  a  mean  equal   to  15 
minutes and a standard deviation equal to 5 minutes. One employee has indicated 
41
that   he  commutes   25  minutes  per  day.   Find  the  probability  that   an  employee 
would commute 25 or more minutes per day,
   Solution:
The random variable X is the time employees spend commuting to work and
  X ~ N( , 
2
)
where  = 15 and 
2
=(5)
2
Hence,
023 . 0 977 . 0 1 ) 2 ( 1 ) 2 ( 1               
) 2 (
5
15 25
) 25 (
        
  
,
_
 
z P
z P
x
P X P
Exercise 3
1.Suppose that X is a continuous random variable having the probability density function
 
(a) Find the value of constant k
(b) Find  P(-0.5<X<0.5) 
(c) Determine x such that P( X > x) = 0.5
(d) Determine the mean and the variance of X.
2. Let X be a continuous random variable with pdf given by
'
   < < 
elsewhere , 0
2 1 ,
) (
x x k
x f
Find
(a) the value of constant k
(b) P(X < 1)
(c) the mean of X
(d) the standard deviation of X.
42
'
< <
elsewhere
for k
f
, 0
1 x 1 x
) x (
2
3.Let X be a continuous random variable with pdf given by
'
>
0 , 0
0 ,
) (
2
x
x kxe
x f
x
Find 
(a) the value of constant k
(b) P(X > 1)
(c) P(0 < X < 2)
(d) the mean of X
(e) the variance of X.
4. Let X be a continuous random variable with pdf given by
'
elsewhere , 0
3 0 ,
) (
2
x kx
x f
Find
(a) the value of constant k
(b) the cdf, F(x)
(c) P(X >1)
(d) the mean of X
(e) the variance of X.
5. Find the cumulative probability distribution of X  given that the density function is 
Find
(a) the value of constant k
(b) the cdf, F(x)
(c) P(0.25 < X < 0.5)
(d) the mean of X
(e) the variance of X.
6.Suppose a random variable, X  has a uniform distribution with a = 5 and b = 9. Find
(a)P(5.5 < X < 8)
(b)P(X < 7)
(c)the mean of X
43
'
< <
elsewhere
x for x k
x f
, 0
1 0 ), 1 (
) (
4
(d)the standard deviation of X.
7.Let  X   be  an  exponential   random  variable  with    =  0.01.   Calculate  the  following 
probabilities:
(a)P(X < 50)
(b)P(x > 60)
(c)P(50 < x < 60)
(d)What is the mean and the variance of X.
8. The   lifetime   of   a   certain   electronic   component   is   known   to   be   exponentially 
distributed with a mean lifetime of 100 hours. What is the probability that
(a) the lifetime of the component is more than 100hours?
(b) the lifetime  of the component is between 50 to 100hours?
(c) a component will fail before 50hours? 
9.The time between telephone calls to ASTRO, a cable television payment processing 
center follows an exponential distribution with a mean of 1.5 minutes. What is the 
probability that the time between the next two calls
(a) at least 45 seconds?
(b) will be between 50 to 100 seconds?; and
(c) at most 150 seconds?
10. The mean weight of 500 UTP students is 68kg and the variance is 72.25kg. Find the 
probability of students who weight 
(a) between 65kg and 72kg
(b) more than 70kg
11. An average LCD Projector bulb manufactured by the ABC Corporation lasts 300 days 
with variance of 2500days. By assuming that the bulb life is normally distributed,  
what is the probability that the bulb will last
(a) at most 365 days?
(b) between 250days and 350days?
(c) at least 400days?
12. The  line  width  of  a  tool   used  for  semiconductor   manufacturing  is  assumed  to  be 
normally distributed with a mean of 0.5 micrometer and a standard deviation of 0.05 
micrometer. 
(a) What is the probability that a line width is greater than 0.62 micrometer?
(b) What is the probability that a line width is between 0.47 and 0.63 micrometer?
(c) The line width of 90% of samples is below what value?
oooOOOooo
44
Chapter 4
4. Data display and summary of data
Learning objectives:
At the end of this chapter, student should be able to:
 Explain the different between population and sample
 Find the sample mean, sample variance and sample standard deviation
 Plot data using stem and leaf display
 Construct the Box-Plot
4.1 Introduction
The major use of inferential statistics is to use information from a sample to infer 
something   about   a  population.   A  population  is   a   collection   of   data   whose 
properties are analyzed. The population is the complete collection to be studied; it 
contains all subjects of interest. A sample is a part of the population of interest, a 
sub-collection   selected   from   a   population.   A  parameter  is   a   numerical 
measurement that describes a characteristic of a population, while a statistic  is a 
numerical measurement that describes a characteristic of a sample. In general, we 
will use a statistic to infer something about a parameter. 
4.2 Mean and variance 
45
The mean is the sum of all numbers in the list divided by the total numbers in the 
list. If the given list is Statistical Population then the mean is called Population 
Mean and the given list is a Statistical Sample, then the mean is called Sample 
mean. The mean has an expected value of , known as the population mean. The 
sample mean makes a good estimator of the population mean, as its expected value 
which is as the same as the population mean. 
Often, since the population variance is an unknown parameter, it is estimated by 
the mean sum of squares, which changes the distribution of the sample mean from 
a normal distribution to a Student's t distribution with n  1 degrees of freedom.
The mean and the variance of population and sample mean and sample variance 
can be expressed as follows. By using the following equations we can identify the 
difference.
Population Mean and Variance are defined as:
2
1
2 1
) (
1
Variance Mean         
N
i
i
N
i
i
x
N
N
x
where N is the size of the Population.
Sample Mean and sample variance are defined as:
2
1
2 1
) (
1
1
Variance Mean  x x
n
s
n
x
x
N
i
i
n
i
i
 
 
+ + + + +
n
x
x
n
x x
n
s
n
x
x
n
i
i n
i
i
n
i
i
n
i
i
4.3 A Stem and Leaf plot
Data can be shown in a variety of ways including graphs, charts and tables. A 
Stem and Leaf plot is a type of graph that is similar to a histogram but shows 
more information. The Stem-and-Leaf plot summarizes the shape of a set of data 
(the distribution) and provides extra detail regarding individual values. 
The data is arranged by place value. The digits in the largest place are referred to 
as the stem and the digits in the smallest place are referred to as the leaf (leaves).  
The leaves are always displayed to the left of the stem. Stem and Leaf plots are 
great organizers for large amounts of information. It provides an at a glance tool 
for specific information in large sets of data, otherwise one would have a long of 
marks to sift through and analyze. The totals of data, median and mode are also 
can be determined by Stem and Leaf plots. They are usually used when there are 
large amounts of numbers or data to analyze. Series of scores on sports teams, 
series of temperatures or rainfall over a period of time, series of classroom test 
scores are examples of when Stem and Leaf plots could be used. 
Example 4.2:
The following data is the temperatures for August in Malaysia.
77 80 82 68 65 59 61
55 50 62 61 70 69 64
65 70 62 65 65 75 76
47
85 80 82 83 79 79 71
80 77 89
Use the Stem and Leaf plot to determine the mode and the median for the 
temperatures.
Solution:
First step should be to place the numbers in order from smallest to the largest.
The mode is 65 and the median is 70.
4.4 A Box plot
In descriptive statistics, a box plot or (also known as a box-and-whisker diagram) 
is an excellent visual summary of many important aspects of a data distribution 
through   their  five-number   summaries:   the   smallest   observation   (sample 
minimum),   lower  quartile  (Q1),  median  (Q2),   upper  quartile  (Q3),   and  largest 
observation (sample maximum). A box plot may also indicate which observations, 
if any, might be considered outliers. Box plot can be drawn either horizontally or 
vertically.
4.4.1 Construct a box plot
Step 1: Place the numbers in order from smallest to the largest.
Step 2: Find the median, Q2, the lower quartile, Q
2
 and the upper quartile, Q
3
 of a 
given set of data.
Step 3: Find the interquartile range (IQR). The IQR is the difference between the 
upper quartile and the lower quartile.
48
Temperatures
Tens   Ones
5     0 5 9
6 1 1 2 2 4 5 5 5 5 8 9
7     0 0 1 5 6 7 7 9 9
8     0 0 0 2 2 3 5  9
Step 4: Start to draw the Box-plot either horizontally or vertically.
Step  5:   Calculate  the  1.5IQR  and  determine  the  range  of  1.5IQR  from  upper 
quartile  and  the  lower  quartile.   The  value(s)  that   place  outside  of  the 
1.5IQR range called the outlier(s). The value(s) that place outside of the 
3IQR range called the extreme outlier(s).
Example 4.3
Suppose that thirty UTP students live in Village 2. These are the following ages:
18, 20, 21, 26, 24, 19, 25, 20, 22, 21, 
19, 24, 25, 28, 24, 20, 26, 20, 35, 17, 
18, 24, 20, 21, 22, 27, 25, 28, 27, 24.
Step 1: Place the numbers in order from smallest to the largest.
17, 18, 18, 19, 19, 20, 20, 20, 20, 20, 
21, 21, 21, 22, 22, 24, 24, 24, 24, 24, 
25, 25, 25, 25, 26, 26, 27, 27, 28, 35.
Step 2: Find the median, Q2, the lower quartile, Q
2
 and the upper quartile, Q
3
 of a 
given set of data.
The median, Q
2
 = (X
15
 + X
16
)/2 = (22+24)/2=23
The position of Q
1
 = (0.25) (n+1) = 0.25(31) = 7.75
So the lower quartile, Q
1
 is X
7
 + 075(X
8
-X
7
) =20 + 0.75(20-20)=20
The position of Q
3
 = (0.75) (n+1) = 0.75(31) = 23.25
So the upper quartile, Q
3
 is X
23
 + 0.25(X
24
-X
23
) =25+0.25(25-25) = 25
49
Step 3: The interquartile range (IQR) = Q
3
 - Q
1
 = 25  20 = 5
The 1.5IQR = 7.5 and 3IQR = 15
Step 4: Start to draw the Box-plot either horizontally or vertically.
                                                                                                                        outlier
            17                                                                                         28              o35
                                             
                                         
                                               
                                  Q
1
 =20                     Q
2
 =23                     Q
3
=25
12.5.<1.5IQR>< ------------..IQR=5          ----------------><1.5IQR>.32.5
Exercise 4:
1. Find the mean, median and mode for the following observations:
6.5     7.8     4.6     3.7    6.5   9.2     12.1     6.5     3.7     10.8
   
2. Find the mean, median and mode for the following observations:
2.3   3.6    2.6    2.8    3.2    3.6    4.3    5.2    6.9    2.8   3.6
3. Seven  oxide  thickness  measurements   of  wafers  are  studied  to  assess  quality  in  a 
semiconductor manufacturing process. The data (in angstroms) are: 1264, 1280, 1301, 
1300,   1292,   1307,   and  1275.   Calculate  the  sample  average,   variance  and  standard 
deviation. 
4. The  following  data  are  direct   solar  intensity  measurements  (watts/m
2
)  on  different 
days at a location in southern Spain: 562, 869, 708, 775, 775, 704, 809, 856, 655, 806, 
878, 909, 918, 558, 768, 870, 918, 940, 946, 661, 820, 898, 935, 952, 957, 693, 835, 
50
905,   939,   955,   960,   498,   653,   730,   753.   Calculate  the  sample  mean,   variance  and 
sample standard deviation. 
5. Find the mean, variance and standard deviation of the following samples of marks for 
the probability and statistics final examination.
84.9 81.9  80.8  79.4  78.2  76.5
75.0 73.8  72.7  72.6  71.4  70.9
69.3 68.6  67.5  66.8  65.2  64.4
59.5 58.3  58.5  57.6  56.9  55.2  
48.2 48.0  47.8  46.5  45.9  44.6
38.3 37.4  36.8  36.5  35.6  34.9
38.4
6. Find the mean, variance and standard deviation of the following samples of marks for 
the engineering drawing course.
98.4 98.1  98.0  97.8  96.4  95.2  94.3  92.6  91.8  90.5
89.6 88.7  87.3  86.8  85.7  84.2.  83.7  82.8 80.5  80.8
79.7   78.2  77.4  77.4  76.8  75.9  74.2  73.9  72.6  71.4
69.8   68.6  67.5  66.8  65.2  64.4  63.7  62.8  61.4  60.7
59.2   58.3  58.5  57.6  56.9  55.2  54.7  53.9  52.9  51.2
59.6   48.0  47.8  46.5  45.9  44.6  43.8  42.7  41.8  40.6
39.8   37.4  36.8  36.5  35.6  34.9  33.2  33.8  32.7  31.6
7.The shear strengths of 100 spot welds in a titanium alloy follow. Construct a stem-and-
leaf diagram for the weld strength data and comment on any important features that 
you notice. 
540
8
543
1
547
5
544
2
537
6
538
8
545
9
542
2
541
6
543
5
542
0
542
9
540
1
544
6
548
7
541
6
538
2
535
7
538
8
545
7
540
7
546
9
541
6
537
7
545
4
537
5
540
9
545
9
544
5
542
9
546
3
540
8
548
1
545
3
542
2
535
4
542
1
540
6
544
4
546
6
539
9
539
1
547
7
544
7
532
9
547
3
542
3
544
1
541
2
538
4
544
5
543
6
545
4
545
3
542
8
541
8
546
5
542
7
542
1
539
6
538
1
542
5
538
8
538
8
537
8
548
1
538
7
544
0
548
2
540
6
540
1
541
1
539
9
543
1
544
0
541
3
540
6
534
2
545
2
542
0
51
545
8
548
5
543
1
541
6
543
1
539
0
539
9
543
5
538
7
546
2
538
3
540
1
540
7
538
5
544
0
542
2
544
8
536
6
543
0
541
8
(a) Construct a stem-and-leaf display for these data.
(b) Find the median, the quartiles, and the 5th and 95th percentiles.
8. The data that follow represent the yield on 90 consecutive batches of ceramic 
substrate to which a metal coating has been applied by a vapor-deposition process. 
94.1 87.3 94.1 92.4 84.6 85.4
93.2 84.1 92.1 90.6 83.6 86.6
90.6 90.1 96.4 89.1 85.4 91.7
91.4 95.2 88.2 88.8 89.7 87.5
88.2 86.1 86.4 86.4 87.6 84.2
86.1 94.3 85.0 85.1 85.1 85.1
95.1 93.2 84.9 84.0 89.6 90.5
90.0 86.7 78.3 93.7 90.0 95.6
92.4 83.0 89.6 87.7 90.1 88.3
87.3 95.3 90.3 90.6 94.3 84.1
86.6 94.1 93.1 89.4 97.3 83.7
91.2 97.8 94.6 88.6 96.8 82.9
86.1 93.1 96.3 84.1 94.4 87.3
90.4 86.4 94.7 82.6 96.1 86.4
89.1 87.6 91.1 83.1 98.0 84.5
(a) Construct a cumulative frequency plot and histogram for the yield 
(b) Construct a stem-and-leaf display for these data. 
(c) Find the median, the quartiles, and the 5th and 95th percentiles for the yield
9. The average age of the football players on each team of the premier league as follows.
29.4 29.8 29.4 31.8 32.7 34.0
28.5 27.9 30.9 29.3 28.8 28.6
29.1 31.0 30.7 30.3 29.7 31.0
28.4 28.9 27.7 28.7 30.5 29.8
26.6 27.9 27.9 29.9 29.3 28.1
(a) Construct a cumulative frequency plot and histogram for the yield
(b)  Construct a stem-and-leaf display for these data.
52
(c)  Find the median, the quartiles, and the 5th and 95th percentiles for the yield
10. The following  cold start ignition time of an automobile engine obtained for a test 
vehicle are as follows:
                 1.75   1.92    2.62     2.35     3.09     3.15    2.53     1.91
(a)Calculate the sample median, the quartiles and the IQR
(b)Construct a box plot of the data.
11. The following data are the joint temperatures of the O-rings (F) for each test firing or 
actual launch of the space shuttle rocket motor (from Presidential Commission on the 
Space Shuttle Challenger Accident, Vol. 1, pp. 129131): 84, 49, 61, 40, 83, 67, 45, 
66, 70, 69, 80, 58, 68, 60, 67, 72, 73, 70, 57, 63, 70, 78, 52, 67, 53, 67, 75, 61, 70, 81, 
76, 79, 75, 76, 58, 31. 
(a) Compute the sample mean and sample standard deviation;
(b) Calculate the median, the quartiles and the IQR;
(c) Construct a box plot of the data and comment on the possible presence of outliers.
12. Ipoh Pantai Hospital compiles data on the length of stay by patients in short-term 
hospitals. A random sample of 28 patients yielded the following data on length of 
stay, in days.
3 6     15     7     3    55    1
4 4     12    18    9     6    12
5 10   13     7     1    23    9
6  8    11     9     4    21   10
(a) Compute the sample mean and sample standard deviation;
(b) Calculate the median, the quartiles and the IQR;
(c) Construct a box plot of the data and comment on the possible presence of outliers.
 
oooOOOooo
Chapter 5
5. Random sample, central limit theorem and Normal Approximation;
Statistical process control
Learning objectives:
At the end of this chapter, student should be able to:
 Define the random sample and sample mean
53
 Use the Central Limit Theorem to define the sample mean distribution
 Define the Normal Approximation to Binomial and Poisson distribution
 Construct the X-bar chart and R chart in statistical process control
5.1 Random sample and sample mean
In statistical terms, a random sample is a set of independent random variables X
1
,  
X
2
,   ,   X
n  
  that   have  been  drawn  from  a  population  in  such  a  way  that   each 
random variable was selected has the same distribution and has the same chance 
of being selected.
Sample  mean  is  the  average  of   the  sample.   If   we  have  n  observation  in  one 
sample, the sample mean is the total of the observation divide by the number of 
sample size, n. 
5.2 Central Limit Theorem and sample mean distribution
Central limit theorem says that if the sample size is large, and  a random sample is 
a set of independent random variables X
1
, X
2
, , X
n 
 has a normal distribution with 
mean,    and variance,  
2
 then the sample mean,   X is also normally distributed 
with mean,  and variance, 
2
/n. That is
) / , ( ~
2
n N X    
Example 5.1:
At chemical engineering department, Universiti Teknologi PETRONAS, the mean 
age  of  the  students  is  20.6  years  old,   and  the  variance  is  20  years.   A  random 
sample of 80 students is drawn from 250 students. What is the probability that the 
average age of these students is greater than 22 years old? 
Solution:
0808 . 0 9192 . 0 1 ) 4 . 1 ( 1                       
) 4 . 1 ( 1 ) 4 . 1 ( )
25 . 0
6 . 20 22
( ) 22 ( So,
) 25 . 0   , 6 . 20 ( ~                Hence,
25 . 0
80
20
) ( and 6 . 20 ) ( of mean  the , 80 For
20 ) ( of variance the and   6 . 20 ) ( of mean  The
2
     
   > 
>  >
     
   
Z P Z P Z P X P
N X
n
X V X E X n
X V X X E X 
,
_
    +
 
 
  
Z P Z P
Z P X P
5.3.2 Normal approximation to Poisson
The normal distribution can also be used to approximate the Poisson distribution 
for large values of  (the mean of the Poisson distribution).
That is, if we have a random variable X ~ Poisson () and  is large than X can 
be  calculated  approximately  using  the  Normal   distribution.   It   means   that   the 
random variable  X  will be normally distributed with mean  =   and variance, 
    
2
 i.e X ~ N ( ,  ) 
Example 5.3:
A car hire firm has 20 cars to hire. The number of demands for a car is hired per 
day is a Poisson distribution with mean of 3. Calculate the probability that at most 
ten cars will be hired in one day.
Solution:
Let a random variable X denotes the number of demands for a car.
The given mean value is 3. By the Poisson distribution
20 ....., . . . . , 4 , 3 , 2 , 1 , 0 ,
!
) (     
x
x
e
x X P
x
,
_
 +
 
Z P
X
P X P
5.4 Statistical process control
Statistical process control (SPC), is a powerful tools that implement the concept 
of prevention as a shift from the traditional quality by inspection/correction. SPC 
is   a   technique   that   employs   statistical   tools   for   controlling   and   improving 
56
processes. It is an important ingredient in continuous process improvement (CPI) 
strategies.   It   uses   simple   statistical   means   to   control,   monitor,   and   improve 
processes. 
Among the most commonly used tools of SPC:
 histograms 
 cause-and-effect diagrams 
 Pareto diagrams 
 control charts 
 scatter or correlation diagrams 
 run charts 
 process flow diagrams 
The   most   important   SPC  tool   is   called   control   charts.   That   is   a   graphical 
representations   of   process   performance   over   time   concerned   with   how  (or 
whether)   processes   vary  at   different   intervals   and  identifying   nonrandom  or 
assignable causes of variation. The control charts are also providing a powerful 
analytical   tool   for  monitoring  process  variability  and  other  changes  in  process 
mean. There are two common charts use in the SPC. The  X - chart and R-chart.
The  X and Range, R Charts are a set of control charts for variables data (data 
that   is  both  quantitative  and  continuous  in  measurement,   such  as  a  measured 
dimension or time). The  X - chart monitors the process location over time, based 
on the average of a series of observations, called a subgroup. While the R-chart 
monitors the variation between observations in the subgroup over time. 
The  X - chart or R- chart are used when you can rationally collect measurements 
in groups (subgroups) of between two and ten observations. The charts' x-axes are 
time based,  so  that the charts show a  history  of the process.  The  data is  time-
ordered; that is, entered in the sequence from which it was generated. 
5.4.1 How to construct  X - chart and R-chart
In order to construct the chart, the sample mean, the average of the sub-group and 
the limits must be calculated.
The sample mean is calculated from a set of n data values as    
n
i
i
x
n
x
1
1
. 
57
The average of the subgroups data is calculated as   
   
m
j
n
i
ij
x
mn
x
1 1
1
where n is the subgroup size and m is the total number of subgroups included in 
the analysis. 
This x  is a centre line of the chart and is called the estimate process mean.
The  average  range  is  calculated  as    
m
i
i
r
m
r
1
1
,   where  r  is  range  between  the 
largest and the smallest value in each subgroup.
The upper and lower limits for the  X - chart are calculated by using the formula
r A x
2
t  where A
2
 can be find from the process control chart table.
While the upper for the R-chart is calculated by using the formula  r D
4
 and for 
lower limit using the formula 
r D
3
 where D
4
 and D
3
 can be find from the process 
control chart table.
After   the   centre   line   and   limits   are   calculated,   and   then   the   chart   can   be 
constructed  by  plotting  the  observations  of  sample  number  versus   x   for   X -
chart and the sample number versus  r  for R-chart.
Example 5.4:
A  component   part   for   a  jet   aircraft   engine  is  manufactured  by  an  investment 
casting  process.   The  vane  opening  on  this   casting  is   an  important   functional 
parameter of the part. 
We will illustrate the use of      X    and R  control charts to assess the statistical 
stability of  this  process. The table  presents 20  samples  of five  parts each.   The 
values given in the table  have  been  coded  by using  the last three digits  of the 
dimension; that is, 31.6 should be 0.50316 inch. 
Sample Number x1 x2 x3 x4 x5      
X
      r
 1 33 29 31 32 33 31.6  4
 2 33 31 35 37 31 33.4  6
 3 35 37 33 34 36 35.0  4
 4 30 31 33 34 33 32.2  4
 5 33 34 35 33 34 33.8  2
 6 38 37 39 40 38 38.4  3
 7 30 31 32 34 31 31.6  4
 8 29 39 38 39 39 36.8 10
 9 28 33 35 36 43 35.0 15
58
10 38 33 32 35 32 34.0  6
11 28 30 28 32 31 29.8  4
12 31 35 35 35 34 34.0  4
13 27 32 34 35 37 33.0 10
14 33 33 35 37 36 34.8  4
15 35 37 32 35 39 35.6  7
16 33 33 27 31 30 30.8  6
17 35 34 34 30 32 33.0  5
18 32 33 30 30 33 31.6  3
19 25 27 34 27 28 28.2  9
20 35 35 36 33 30 33.8  6
(a) Construct    X     and R control charts. 
(b) After  the  process  is  in  control,   estimate  the  process  mean  and  standard 
deviation. 
Exercise 5
1. Suppose  X
1
, X
2
, , X
20 
 is a sample from normal distribution  N (, 
2
)  with  = 5,  
 
2 
= 4. Find 
(a) Expectation and Variance of    
(b) Distribution of
2. Given that X is normally distributed with mean 50 and standard deviation 4, compute 
the following for n=25.
(a) Mean and variance of   X
(b) ) 49 (    X P
(c) ) 52 (   > X P
(d) ) 5 . 51 49 (    X P
3. Given that X is normally distributed with mean 20 and standard deviation 2, compute 
the following for n=40.
(a) Mean and variance of   X
59
X
X
(b) ) 19 (    X P
(c) ) 22 (   > X P
(d) ) 5 . 21 19 (    X P
4. Let X denote the number of flaws in a 1 in length of copper wire. The pmf of X is 
given in the following table 
X=x 0 1 2 3
P(X=x) 0.4
8
0.39 0.12 0.01
100 wires are sampled from this population. What is the probability that the average 
number of flaws per wire in this sample is less than 0.5? 
5. At a large university,  the mean age of the students is 22.3 years, and the standard 
deviation   is   4   years.   A  random  sample   of   64   students   is   drawn.   What   is   the 
probability that the average age of these students is greater than 23 years? 
6. Assuming an equal chance of a new baby being a boy or a girl, what is the probability 
that 60 or more out of the next 100 births at Pantai Hospital will be girls? 
7. If 10% of UTP students are international students, what is the probability that fewer 
than 100 in a random sample of 818 students are coming from overseas?
8. Suppose that a sample of n = 1,600 tires of the same type are obtained at random from 
an ongoing production process in which 8% of all such tires produced are defective. 
What is the probability that in such a sample 150 or fewer tires will be defective?
9. For overseas flights, an airline has three different choices on its dessert menuice 
cream, apple pie, and chocolate cake. Based on past experience the airline feels that 
each dessert is equally likely to be chosen.
(a) If a random sample of four passengers is selected, what is the probability that at 
  least two will choose ice cream for dessert?
(b) If a random sample of 21 passengers is selected, what is the approximate 
probability that at least two will choose ice cream for dessert?
10. Suppose that at a certain automobile plant, the number of work stoppage is a Poisson 
distribution with an average per day due to equipment problems during the production 
process is 12.0.What is the approximate probability of having 15 or fewer work 
stoppages due to equipment problems on any given day?
11. The number of cars arriving per minute at a toll booth on a 
particular bridge is Poisson distributed with a mean of 2.5.What is 
the probability that in any given minute
(a)no cars arrive?
60
(b)not more than two cars arrive?
If the expected number of cars arriving at the toll booth per ten-
minute interval is 
25.0, what is the approximate probability that in any given ten-
minute period
(c) not more than 20 cars arrive?
(d) between 20 and 30 cars arrive?
12. A component part for a jet aircraft engine is manufactured by an investment casting 
process. The vane opening on this casting is an important functional parameter of the 
part. We will illustrate the use of     X    and R control charts to assess the statistical 
stability of this process. The table presents 20 samples of five parts each. The values 
given in the table have been coded by using the last three digits of the dimension; that 
is, 31.6 should be 0.50316 inch. 
Sample Number   x1   x2   x3   x4   x5
      
X
 
     r
 1 33 29 31 32 33 31.6  4
 2 33 31 35 37 31 33.4  6
 3 35 37 33 34 36 35.0  4
 4 30 31 33 34 33 32.2  4
 5 33 34 35 33 34 33.8  2
 6 38 37 39 40 38 38.4  3
 7 30 31 32 34 31 31.6  4
 8 29 39 38 39 39 36.8 10
 9 28 33 35 36 43 35.0 15
10 38 33 32 35 32 34.0  6
11 28 30 28 32 31 29.8  4
12 31 35 35 35 34 34.0  4
13 27 32 34 35 37 33.0 10
14 33 33 35 37 36 34.8  4
15 35 37 32 35 39 35.6  7
61
16 33 33 27 31 30 30.8  6
17 35 34 34 30 32 33.0  5
18 32 33 30 30 33 31.6  3
19 25 27 34 27 28 28.2  9
20 35 35 36 33 30 33.8  6
(a) Construct    X     and R control charts. 
(b) After the process is in control, estimate the process mean and standard deviation. 
13. The overall length of a skew used in a knee replacement device is monitored using 
and  R  charts.   The   following   table   gives   the   length   for   20   samples   of   size   4. 
(Measurements are coded from 2.00 mm; that is, 15 is 2.15 mm.) 
  Observation   Observation
Sample 1 2 3 4 Sample 1 2 3 4
1 1
6
1
8
1
5
13 11 1
4
1
4
1
5
1
3
2 1
6
1
5
1
7
16 12 1
5
1
3
1
5
1
6
3 1
5
1
6
2
0
16 13 1
3
1
7
1
6
1
5
4 1
4
1
6
1
4
12 14 1
1
1
4
1
4
2
1
5 1
4
1
5
1
3
16 15 1
4
1
5
1
4
1
3
6 1
6
1
4
1
6
15 16 1
8
1
5
1
6
1
4
7 1
6
1
6
1
4
15 17 1
4
1
6
1
9
1
6
 8 1
7
1
3
1
7
16 18 1
6
1
4
1
3
1
9
 9 1
5
1
1
1
3
16 19 1
7
1
9
1
7
1
3
10 1
5
1
8
1
4
13 20 1
2
1
5
1
2
1
7
(a) Using all the data, find trial control limits for  and R charts, construct the chart,
    and plot the data.
(b) Use the trial control limits from part (a) to identify out-of-control points. If 
necessary, revise your control limits, assuming that any samples that plot outside 
the control limits can be eliminated.
(c) Assuming that the process is in control, estimate the process mean and process 
standard deviation. 
62
14. The thickness of a printed circuit board (PCB) is an important quality parameter. Data 
on board thickness (in cm) are given below for 25 samples of three boards each.
Sample 1 2 3 Sample 1 2 3
1 0.0629 0.0636 0.0640 14 0.0645 0.0640 0.0631
2 0.0630 0.0631 0.0622 15 0.0619 0.0644 0.0632
3 0.0628 0.0631 0.0633 16 0.0631 0.0627 0.0630
4 0.0634 0.0630 0.0631 17 0.0616 0.0623 0.0631
5 0.0619 0.0628 0.0630 18 0.0630 0.0630 0.0626
6 0.0613 0.0629 0.0634 19 0.0636 0.0631 0.0629
7 0.0630 0.0639 0.0625 20 0.0640 0.0635 0.0629
 8 0.0628 0.0627 0.0622 21 0.0628 0.0625 0.0616
 9 0.0623 0.0626 0.0633 22 0.0615 0.0625 0.0619
10 0.0631 0.0631 0.0633 23 0.0630 0.0632 0.0630
11 0.0635 0.0630 0.0638 24 0.0635 0.0629 0.0635
12 0.0623 0.0630 0.0630 25 0.0623 0.0629 0.0630
13 0.0635 0.0631 0.0630
(a) Using all the data, find trial control limits for  and R charts, construct the chart,
and plot the data.
(b) Use the trial control limits from part (a) to identify out-of-control points. If 
necessary, revise your control limits, assuming that any samples that plot outside 
the control limits can be eliminated.
(c) Assuming that the process is in control, estimate the process mean and process   
         standard deviation. 
oooOOOooo
Chapter 6
6. Hypothesis Testing  One population
63
Learning objectives:
At the end of this chapter, student should be able to:
 Explain the concept of hypothesis testing
 Understand the procedure or steps to perform the test 
 Do a testing about the mean when the population variance is known and is 
unknown
 Do a testing about the proportion
 Perform the testing about the variance
6.1 Introduction
There are two types of statistical inferences: estimation of population parameters 
and hypothesis testing. Hypothesis testing is one of the most important tools of 
application of statistics to real life problems. Most often, decisions are required to 
be made concerning populations on the basis of sample information.  Statistical 
tests are used in arriving at these decisions.
Statistical   hypotheses  are  based  on  the  concept   of  proof  by  contradiction.   For 
example, say, we test the mean () of a population to see if an experiment has 
caused  an  increase  or  decrease  in  .   We  do  this  by  proof  of  contradiction  by 
formulating a null hypothesis against alternative hypothesis. 
6.1.1 Null Hypothesis: 
It is a hypothesis which states that there is no difference between the procedures 
and is denoted by H
0
. For the above example the corresponding H
0
 would be that 
there has been no increase or decrease in the mean. Always the null hypothesis is 
tested, i.e., we want to either accept or reject the null hypothesis because we have 
information only for the null hypothesis.
6.1.2 Alternative Hypothesis: 
It is a hypothesis which states that there is a difference between the procedures 
and is denoted by H
1
.
In hypothesis testing there will be a correct decision or false decision would be 
made on the null hypothesis as summarize in this table.
Suppose     Accept H
0
 as true Reject H
0
 as false
64
H
0
 is true
Correct decision. Probability: 1 
- 
Type I error. Probability: 
H
0
 is false Type II error. Probability: 
Correct decision. 
Probability: 1   
We will make a correct decision if we accept H
0
 when H
0
 is true or we will reject 
H
0
 when actually H
0
 is false.
The risk of rejecting the null hypothesis when we should not reject it is called 
type I error with probability . It means that we make a false decision because we 
reject H
0
 when actually H
0
 is true with probability . 
While  when  we  accept   H
0
  but   actually  H
0
  is  not   true,   then  we  make  a  wrong 
decision and this decision is called type II error is. The probability of type II error 
is  .  We  cannot   determine    (beta)   with  the  statistical   tools  you  learn  in  this 
course.
The probability type I error,  is called the level of significance and (1- )100% 
is called the confidence level of the test and (1  ) is called the "power" of the 
test.
6.1.3 Types of test
In hypothesis testing there are three types of test on any parameters of interest 
called two tailed (sided) test, upper tailed test and lower tailed test such as in the 
table below.
Type Null 
Hypothesis, H0
Alternative 
Hypothesis, H1
1
0
  
0
  Two tailed test
2
0
 
0
  > Upper tailed test
3
0
 
0
  < Lower tailed test
6.1.4 Test statistics
It   is  the  random  variable  X  whose  value  is  tested  to  arrive  at   a  decision.   In 
hypothesis testing the right choice of test statistics is essential.  Among the test 
statistics   in  hypothesis   testing  are   Z-statistics,   T-statistics,  
2
-statistic   and  F-
statistic. The choice of this test statistics is depend on the parameter of interest 
that we want to test. The Central Limit Theorem states that for large sample sizes 
(n > 30) drawn randomly from a population, the distribution of the means of those 
65
samples will approximate normality, even when the data in the parent population 
are  not   distributed  normally  but   the  population  variance  is   known  then  a  Z-
statistic  is  usually  used  for  large  sample  sizes  (n  >  30).   However,   often  large 
samples are not easy to obtain and the population variance is unknown, then the t-
distribution can be used. The population standard deviation  is estimated by the 
sample   standard   deviation,  s.   For   test   the   population   variance   or   standard 
deviation, the 
2
-statistic is used.  In case of performing multiple comparisons by 
one way ANOVA, the F-statistic is normally used. 
6.1.5 Rejection region
It is the part of the sample space (critical region) where the null hypothesis H
0
 is 
rejected. The size of this region is determined by the probability () of the sample 
point falling in the critical region when H
0
 is true.  is also known as the level of 
significance, the probability of the value of the random variable falling in the 
critical region. Also it should be noted that the term "Statistical significance" 
refers only to the rejection of a null hypothesis at some level . It implies that the 
observed difference between the sample statistic and the mean of the sampling 
distribution did not occur by chance alone.
6.1.6 Decision making
If the test statistic falls in the rejection/critical region, then we may conclude that 
H
0
  is rejected, it means that there are enough evidence to support the alternative 
hypothesis.  Otherwise  we fail  to reject  H
0
  means that  there are no  evidence to 
support the claim that the H
1
 is true.
6.1.7 Steps to do the test
In hypothesis testing, there are seven steps to perform any statistical test:
(i)  Identify the parameter of interest. 
(ii) State the hypothesis: Null Hypothesis and Alternate Hypothesis
(iii) Determine the appropriate Test Statistic
(iv) Determine the critical value 
(v) Determine the Rejection/Critical Region or P-value or 100(1-)% confidence 
      intervals 
(vi) Calculate the Test Statistic 
(vii) Make a decision or conclusion based on step (v).
6.2 Testing about the mean for large sample size and the variance is known
If  the  parameter   of  interest   is  to  test   about   the  mean  for  population  when  the 
variance is known or the sample size is very large, then the test can be performed 
as below:
66
Step 1: To test about the mean and population variance is known
Step 2: 0 1 0 1 0 1 0 0
: or : or : versus :             < >   H H H H
 
Step 3: Test statistic:  ) 1 , 0 ( ~
/
0
0
N
n
x
Z
  2 / 2 /
  +   
Upper tailed test, 
0
  >
n
Z x
  
   2 /
+    
67
Lower tailed test,  0
  <
      
n
Z x
2 /
Step 6:  Calculate the test statistics, Z in step 3.
Step 7:  Decision: To make a conclusion based on the criteria in step 5.
Example 6.1:
Test the hypothesis that the mean age of UTP students is less than 21, given a 
random sample of 20 individuals who have a mean of 20 and assume that the age 
is normally distributed with variance of 20.
i. Test the hypothesis that the mean age is less than 21. Use alpha = 0.05.
ii. What is the P-value for this test?
iii. Construct 95% two-sided CI on the mean strength.
iv. Use the CI found in part (iii) to test the hypothesis.
Solution:
i.     (1) The parameter of interest is to test the true mean age of UTP students, .
              (2) The hypothesis Testing:
21 : 21 :
0 1 0 0
  <      H vs H
      (3)   The test statistics is:
n
x
z
/
0
0
 
 
z
x   
      (7)  Result and conclusion:
68
Since z
o
  > -1.65, so we failed to reject  H
0  
and conclude that not enough 
evidence to say the true mean age of UTP students is less than 21 years 
old at  = 0.05.
ii. The P-value for this test is [(-1)]=0.159. Since P-value > 0.05, then we failed 
to reject H
0
.
iii. A 95% two-sided CI on mean strength is
96 . 21 04 . 18               
20
20
96 . 1 20
20
20
96 . 1 20
025 . 0 025 . 0
 
,
_
,
_
,
_
+   
,
_
n
z x
n
z x
Since 21 is in the interval, so we failed to  reject H
0
 and conclude that not 
enough evidence to say the true mean age of UTP students is less than 21 
years old at  = 0.05.
6.3 Testing about the mean when the population variance is unknown
If  the  parameter   of  interest   is  to  test   about   the  mean  for  population  when  the 
variance is unknown or the sample size is small, then to perform the test same as 
for variance is known. Except that in step 3, instead of using the Z statistic, now 
the test statistic will be replaced by T  statistic and   will be replaced by s  the 
sample standard deviation.
Step 1: To test about the mean and population variance is known
Step 2: 0 1 0 1 0 1 0 0
: or : or : versus :             < >   H H H H
 
Step 3: Test statistic: 
0 1
0
if , ~
/
   
   n
t
n s
x
T
Step 4: The critical value, at  significant level is t
, n-1
 for one tailed (sided) test 
and         t
/2, n-1
 for two tailed (sided) test. 
Step 5: i.  The critical region:
Alternative Hypothesis, H
1
Rejection Criteria (Reject H
0
) IF
69
Two tailed test,  0
  
1 , 2 / 1 , 2 /
or
   
   < >
n n
t T t T
   
Upper tailed test,  0
  >
1 ,   
>
n
t T
  
Lower tailed test,  0
  <
1 ,   
 <
n
t T
  
ii.    P-Value approach:
Alternative Hypothesis, H
1 Reject H
0
 IF P < 
Two tailed test,  0
   P-value = 2P(T
n-1
 > |t|)
Upper tailed test,  0
  > P-value = P(T
n-1
 > t)
Lower tailed test,  0
  < P-value = P(T
n-1
  < t)
iii. 100(1-)% Confidence Intervals:
Alternative   Hypothesis, 
H
1
Reject H
0
 IF 
o
 falls outside of the 
interval
Two tailed test,  0
  
n
s
t x
n
s
t x
n n 1 , 2 / 1 , 2 /    
  +   
   
  
Upper tailed test,  0
  >
n
s
t x
n 1 , 2 /   
+    
  
n
s
t x
n 1 , 2 /
Step 6:  Calculate the test statistics, T in step 3.
Step 7:  Decision: To make a conclusion based on the criteria in step 5.
Example 6.2:
70
A  practical   brand  of   diet   margarine   was   analyzed  to  determine   the  level   of 
polyunsaturated fatty acid (in percent). A sample of six packages resulted in the 
following data: 16.8, 17.2, 17.4, 16.9, 16.5 and 17.1.
i. Using the P-value approach, test the hypothesis that the mean  is not 17.0, 
ii.   Construct 95% two-sided CI on the mean.
iii. Use the CI found in part (ii) to test the hypothesis.
[4 marks]
Solution:
i.     (1) The parameter of interest is the true mean compressive strength, , 
 variance unknown
              (2) The hypothesis Testing:
17 : 17 :
0 1 0 0
        H vs H
       (3) The test statistics is:
n s
x
t
/
0
0
   
 
 
t
s x
From t-table, t
0
 = 0.1537 with 5 df is fall < 0.267 for which  > 0.4,so the 
P-value > 2(0.4) = 0.8 
      (7)  Result and conclusion:
Since P-value > 0.05, then we fail reject H
0 
and we conclude that the true 
mean is 17 at  = 0.05.
ii. A 95% two-sided CI on mean strength is
71
3146 . 17 645 . 16
6
3188 . 0
571 . 2 98 . 16
6
3188 . 0
571 . 2 98 . 16
5 , 025 . 0 5 , 025 . 0
 
,
_
+   
,
_
,
_
+   
,
_
n
s
t x
n
s
t x
iii. Since 17 is fall in the interval, so we fail to reject  H
0
  and conclude the true 
mean is 17at  = 0.05.
6.4  Testing about the proportion
In hypothesis testing, the procedure to test about the proportion of the population 
is the same as the procedure to test about the mean when the population variance 
is known. 
Step 1: To test about the population proportion 
Step 2:
0 1 0 1 0 1 0 0
: or : or : versus : p p H p p H p p H p p H   < >  
 
Step 3: Test statistic: 
n
X
p p p N
n p p
p p
Z    
  where , if , ) 1 , 0 ( ~
/ ) 1 (
0
0 0
0
0
Step 4: The critical value, at  significant level is Z
2
 test. The steps are following:
Step 1: To test about the population proportion 
Step 2:
2
0
2
1
2
0
2
1
2
0
2
1
2
0
2
0
: or : or : versus :             < >   H H H H  
Step 3: Test statistic: 
73
2
0
2 2
1 2
2
2
if , ~
) 1 (
  
   n
s n
Step 4: The critical value, at  significant level is Z
>
n 
 
Lower tailed test, 
2
0
    <
2
1 , 1
2
 
<
n 
 
ii. P-Value approach:
Alternative Hypothesis, H
1 Reject H
0
 IF P < 
Two tailed test, 
2
0
2
     P-value = 2P(
2
n-1
 > |
2
|)
Upper tailed test, 
2
0
2
    > P-value = P(
2
n-1
 > 
2
)
Lower tailed test, 
2
0
    < P-value = P(
2
n-1
  < 
2
)
iii. 100(1-)% Confidence Intervals:
Alternative Hypothesis, H
1 Reject H
0
 IF 
o
 falls outside of 
the interval
Two tailed test, 
2
0
2
    
2
1 , 2 / 1
2
2
2
1 , 2 /
2
) 1 ( ) 1 (
  
n n
s n s n
 
  
n
s n
74
Lower tailed test, 
2
0
    <
2
1 , 2 /
2
2
) 1 (
n
s n
H
vs
H
             
              (3)   Test statistics is:
2
0
2
2
0
) 1 (
s n
    n n
s n s n
Since =0.3 is outside of the interval, then we reject the null hypothesis 
and   conclude   that   the   engineers   claim  is   true   at   the   0.05   level   of 
significance.
oooOOOooo
Chapter 6
1.A manufacturer of sprinkler systems used for fire protection in office buildings claims
76
that the true average system- activation temperature is 1300. A sample of 9 systems 
when tested yields an average activation temperature of 131.080F. If the distribution 
of activation times is normal with standard deviation 1.50F, does the data contradict 
the firms claim at level of significance a  = 0.01. What is the P-value for this test?
2. A  random  sample of 50 battery  packs  is  selected and subjected to  a  life  test.  The 
average life of these batteries is 4.05 hours. Assume that the battery life is normally 
distributed with standard deviation equals 0.2 hour. Is there evidence to support the 
claims that mean battery life exceeds 4 hours? Use a = 0.05. What is the P-value for 
this test?
3. The flow discharge of Perak River (measured in m
3
/s) was obtained at random. 40 
readings were collected and the mean flow discharge was found to be 3.815m
3
/s with 
a standard deviation of 0.5m
3
/s. 
(a) Test the hypothesis that mean flow discharge at Perak River is not equal to 4m
3
/s . 
Use  =0.05;
(b) Use the P-value approach to test the hypothesis null.
(c) Construct a 95% two-sided CI on mean flow discharge. What is conclusion?
4. A  civil   engineer   is   analyzing  the  compressive  strength  of   concrete.   Compressive 
strength is approximately normally distributed with variance 
2
 = 1000psi
2
. A random 
sample of 12 specimens has a mean compressive strength of  x =3255.42 psi. 
(a) Test the hypothesis that mean compressive strength is 3500psi. Use  =0.01;
(b) What is the smallest level of significance at which you would be willing to reject 
the null hypothesis?;
(c) Construct a 95% two-sided CI on mean compressive strength; and
(d) Construct a 99% two-sided CI on mean compressive strength. Compare the width 
of this  confidence  interval  with  the width  of  the one  in  part (c).   What  is  your 
comment?
5. A new process for producing synthetic diamonds can be operated at a profitable level 
only if the average weight of the diamonds is greater than 0.5 karat. To evaluate the 
profitability of the process, six diamonds are generated with recorded weights, 0.46, 
0.61, .52, .48, .57 and .54 karat. 
(a) At 5% significance level Do the six measurements present sufficient evidence that 
the average weight of the diamonds produced by the process is in excess of .05 
karat? 
(b) Use the P-value approach to test the hypothesis null.
(c) Construct a 95% CI on the average weight of diamonds.
6. One of the Cigarette Company claims that their cigarettes contain an average of only 
10mg of tar. A random sample of 25 cigarettes shows the average tar content to be 
12.5 mg with standard deviation of 4.5mg. 
(a) Construct   a   hypothesis   test   to  determine   whether   the   average   tar   content   of 
cigarettes exceeds 10mg. using the P-value approach;
(b) Construct a 95% two-sided CI on the average tar content of cigarettes.
77
7. Regardless of age, about 20% of Malaysian adults participate in fitness activities at 
least twice a week. In a local survey of 100 adults over 40 years old, a total of 15 
people indicated that they participated in a fitness activity at least twice a week. 
(a) Do these data indicate that the participation rate for adults over 40 years of age is 
significantly less than 20%? Carry out a test at 10% significance level and draw 
appropriate conclusion. 
(b) Construct  a 95% two-sided CI on the participation rate.
8. A  survey  done  one  year   ago  showed  that   45%  of   the  population  participated  in 
recycling programs. In a recent poll a random sample of 1250 people showed that 588 
participate in recycling programs.
(a) Test   the   hypothesis   that   the   proportion  of   the   population  who  participate   in 
recycling programs is greater than it was one year ago. Use a 5% significance 
level.
(b) Construct a 95% two-sided CI on the proportion.
9. A Ipoh city council member gave a speech in which she said that 18% of all private 
homes  in  the  city  had  been  undervalued  by  the  county  tax  assessors  office.   In  a 
follow-up story the local newspaper reported that it had taken random sample of 91 
private homes.   Using professional  evaluator to  evaluate the  property and  checking 
against county tax records it found that 14 of the homes had been undervalued. 
(a) Does this data indicate that the proportion of private homes that are undervalued 
     by the county tax assessor is different from 18%? Use a 5% significance level.
(b) Construct a 95% two-sided CI on the proportion.
10. Engineers   designing  the  front-wheel-drive  half   shaft   of   a  new  model   automobile 
claim that the variance in the displacement of the constant velocity joints of the shaft 
is less than 1.5 mm. 20 simulations were conducted and the following results were 
obtained,                     and s = 1.41.  
(a) At  = 0.05, do these data support the claim of the engineers? 
(b) What is the P-value for this test?
(c) Construct a two-sided CI for .
11. An  Aerospace Engineers claim  that  the standard  deviation of the  percentage in  an 
alloy used in aerospace casting is greater than 0.3. 51 parts were randomly selected 
and the sample standard deviation of the percentage in an alloy used in aerospace 
casting is  s =0.37.  
(a) At  = 0.05, do these data support the claim of the engineers? 
(b) What is the P-value for this test?
(c) Construct a 95% two-sided CI for . What is conclusion?
12. The scientists claim that the variance of sugar content of the syrup in canned peaches 
thought to be 18 mg
2
. From a random sample of 10 cans yields a sample deviation of 
4.8mg.
(a) At  = 0.05, do these data support the claim of the scientists?
78
39 . 3  x
(b) What is the P-value for this test?
(c) Construct a 95% two-sided CI for . What is the conclusion?
oooOOOooo
Chapter 7
7. Hypothesis Testing  Two populations
79
Learning objectives:
At the end of this chapter, student should be able to:
 Understand the procedure or steps to perform the test for two populations.
 Do   a   testing   about   the   different   between   the   two   mean   when   the 
populations variance are known.
 Do   a   testing   about   the   different   between   the   two   mean   when   the 
populations variance are unknown but assume to be equal.
 Do   a   testing   about   the   different   between   the   two   mean   when   the 
populations variance are unknown but assume to be not equal.
 Do a testing about the different between the two proportions.
 Perform the testing about the different between the two variances.
7.1 Introduction
In hypothesis testing for two populations, the procedure or method is the same as 
in hypothesis testing for one population. But now we want to test the different 
between two parameters of interest of populations. For example, we want to test 
about the different between the two mean of the two populations, 
1
 and 
2
 or to 
test the different between two proportions of the two populations, p
1
 and p
2
.
7.1.1 Types of test
In this hypothesis testing there are three types of test on any parameters of interest 
called two tailed (sided) test, upper tailed test and lower tailed test such as in the 
table below.
Type Null 
Hypothesis, H0
Alternative 
Hypothesis, H1
1 0
2 1
      0
2 1
      Two tailed test
2 0
2 1
      0
2 1
  >    Upper tailed test
3 0
2 1
      0
2 1
  <    Lower tailed test
7.1.3 Steps to do the test
In hypothesis testing for two populations, the procedure to perform the test is the 
same as in the hypothesis testing for one population. 
(i)  Identify the parameter of interest. 
80
(ii) State the hypothesis: Null Hypothesis and Alternate Hypothesis
(iii) Determine the appropriate Test Statistic
(iv) Determine the critical value 
(v) Determine the Rejection/Critical Region or P-value or 100(1-)% confidence 
      intervals 
(vi) Calculate the Test Statistic 
(vii) Make a decision or conclusion based on step (v).
  7.2 Testing about the different between the two means, when the both population 
variances are known.
If the parameter of interest is to test about the different between the two means for 
two populations when both variances are known, then the test can be performed as 
below:
Step 1: To  test   about   the  different   between  two  means,  
1
  and  
2
  when  both 
population variances,  
2
1
  and 
2
2
  are known
Step 2:
0 : or 0 : or 0 : versus 0 :
2 1 1 2 1 1 2 1 1 2 1 0
  <  >                H H H H
 
Step 3: Test statistic: 
) 1 , 0 ( ~
) ( ) (
2
2
2
1
2
1
2 1
2 1
0
N
n n
x x
Z
 
 
+
  
+ + 
   +  
Upper tailed test, 
0
2 1
  >   
1
2
1
1
2
1
2 /
2 1
2 1
) (
n n
Z x x
   
 
  
  + +      
Lower tailed test, 
0
2 1
  <   
      +   ) (
2 1
1
2
1
1
2
1
2 /
2 1    
 
n n
Z x x
Step 6:  Calculate the test statistics, Z  in step 3.
Step 7:  Decision: To make a conclusion based on the criteria in step 5.
Example 7.1:
  The burning rates of two different solid-fuel propellants used in rocket 
  systems   are   being   studied.   It   is   known   that   both   propellants   have 
approximately   the   same   standard   deviation   of   burning   rate,   that   is 
3cm/second. Two random samples with the same sample size of 20 specimens 
are   tested  and  the   sample   mean  burning  rates   are   18  cm/second  and  24 
cm/second respectively.
82
i. Test the hypothesis that both propellants have the same mean burning rate, 
           using the P-value approach.
      ii.  Construct a two-sided 95% CI on the difference in means, 
1
  
2
. 
      iii. What is the practical meaning of this interval?
Solution:
(i)  (1) The parameter of interest is the difference in mean fill volume, 
1
  - 
2
  , 
variances,  
2
1
  and 
2
2
  are known
            (2) The hypothesis testing:
0 : versus 0 :
2 1 1 2 1 0
            H H
     (3)  The test statistics is:
) 1 , 0 ( ~
) (
2
2
2
1
2
1
2 1
0
N
n n
x x
z
 
+
 z
P-value = 2[1-(6.32)]=2[1-1]=0
 (7)  Result and conclusion:
Since P-vale < 0.05, then we reject H
0
 . Both propellants are not the 
same mean burning rate.
ii.  A 95% two-sided CI on the difference in means, 
1
  
2
  is
83
141 . 4 859 . 7
20
9
20
9
96 . 1 ) 24 18 (
20
9
20
9
96 . 1 ) 24 18 (
) ( ) (
2 1
2
2
2
1
2
1
025 . 0
2 1
2
2
2
1
2
1
025 . 0
2 1
    
,
_
+ +
,
_
,
_
+ +
,
_
 
n n
z x x
n n
z x x
iii. Since 
1
  
2
  0 is not in the interval then we reject H
0
. Both propellants 
are not the same mean burning rate.
7.2  Testing about the different between the two means, when the both population 
variances are unknown.
If the parameter of interest is to test about the different between the two means for 
two populations when both variances are unknown, then the test can be classified 
into two cases. First case we assume that the populations variances are the same, 
2
2
2
1
        and second case is we assume that the variances are not equal, 
2
2
2
1
       . 
7.2.1 First case: 
2
2
2
1
      
Testing   about   the   different   between   the   two   means,   when   the   both 
population       variances   are   unknown   but  
2
2
2
1
       .   The   test   can   be 
performed as below:
Step 1: To  test   about   the  different   between  two  means,  
1
  and  
2
  when  both 
population variances,  
2
1
  and 
2
2
  are unknown but 
2
2
2
1
        .
Step 2:
0 : or 0 : or 0 : versus 0 :
2 1 1 2 1 1 2 1 1 2 1 0
  <  >                H H H H
 
Step 3: Test statistic: 
2
) 1 ( ) 1 (
where
) 2 ( with  ,
1 1
) ( ) (
2 1
2
2 2
2
1 1 2
2 1
2 1
2 1
2 1
0
 +
 + 
 +
+
  
n n
s n s n
s
df n n
n n
s
x x
t
p
p
 
84
Step 4: The critical value, at  significant level is t
   
Upper   tailed  test, 
0
2 1
  >   
2 1
2 , 2 / 2 1
1 1
) (
2 1
n n
s t x
p n n
  + +     
   + 
 
85
Lower  tailed  test, 
0
2 1
  <   
      + 
   +
) (
1 1
2 1
2 1
2 , 2 /
2 1
   
n n
s t x
p n n
Step 6:  Calculate the test statistics, t
0
 in step 3.
Step 7:  Decision: To make a conclusion based on the criteria in step 5.
Example 7.2:
Professor Adams taught the same large lecture course for two terms. Except 
for negligible differences the two courses were the same. However, one met at 
8a.m. and the other met at 11a.m. The two courses were given final exams of 
the  same  degree  of  difficulty  and  covering  the  same  material.   Both  exams 
were worth 100 points. A random sample of 49 students from the 8a.m. class 
had an average score of 73.2 with standard deviation 8.1. A random sample of 
36  students   from  the  11:00a.m.   class   had  an  average   score  of   78.1  with 
standard deviation 10.0. Assume that the population variances are the same 
and the data are drawn from a normal distribution.
i. Does this data indicate that the mean score for the 11:00 a.m. class is 
      higher than the mean score for the 8a.m. class? Use a 5% significance    
      level.
ii. What is the P-value for this test?
iii. Construct a two-sided 95% CI on the difference in average scores.
Solution:
(i) (1) The parameter of interest is to test the mean score at 11am,  
2
 is better 
than the mean score at 8am,  
1
 .
          (2)  Hypothesis testing:
2 1 1
2 1 0
:
0 :
 
 
<
 
H
vs
H
     (3) Test statistics is:
86
2
) 1 ( ) 1 (
where
2 with  ,
1 1
) ( ) (
2 1
2
2 2
2
1 1 2
2 1
2 1
2 1
2 1
0
 +
 + 
 +
+
  
n n
s n s n
s
df n n
n n
s
x x
t
p
p
 
    (4)   Critical value 0.05, t
0.05, 83
 = 1.658
      (5)  The critical region is reject H
0
 if t
0
 < -1.658
      (6)  Computation:
Computations:
95 . 8
83
28 . 6649
2 36 49
) 10 )( 35 ( ) 1 . 8 )( 48 (
where
83 with  5 . 2
96 . 1
9 . 4
36
1
49
1
95 . 8
) 1 . 78 2 . 73 (
So
36 , 49 , 10 , 1 . 8 , 1 . 78 , 2 . 73
2 2
0
2 1 2 1 2 1
 
 +
+
     
p
s
df t
n n s s x x
            (7) Result and conclusion:
Since t
0
 < -1.658, then we have to reject H
0
 at =0.05. The mean score 
for the two tests are the same. There is enough evidence to say that test 
at 11am is better result from test at 8am.
ii. The P-value for the test:
From t-table with 83 df,  t
0
  =2.5  is between t= 2.358  and t=2.617, which 
give 0.005<p<0.01. Since P < 0.05, thus we reject H
0
 at the 0.05 level of 
significance and conclude that there is enough evidence to say that test at 
11am is better result from test at 8am.
iii. A 95% CI for the difference in mean before and after the policy change 
where t
0.025,22
 =1.98 is
87
1.0101 - 8.7899 -                                               
36
1
49
1
) 95 . 8 )( 98 . 1 ( ) 1 . 78 2 . 73 (
36
1
49
1
) 95 . 8 )( 98 . 1 ( ) 1 . 78 2 . 73 (
1 1
) ( ) (
1 1
) ( ) (
36 , 49 , 95 . 8 , 1 . 78 , 2 . 73
2 1
2 1
2 1
2 , 2 /
2 1
2 1
2 1
2 , 2 /
2 1
2 1
2 1
2 1 2 1
  
,
_
+ +
,
_
+  
+ +     +  
    
 +  +
 
 
 
   
n n
s t x x
n n
s t x x
n n s x x
p n n p n n
p
7.2.2 Second case: 
2
2
2
1
      
Testing   about   the   different   between   the   two   means,   when   the   both 
population       variances   are   unknown   but  
2
2
2
1
       .   The   test   can   be 
performed as below:
Step 1: To  test   about   the  different   between  two  means,  
1
  and  
2
  when  both 
population variances,  
2
1
  and 
2
2
  are unknown but 
2
2
2
1
        .
Step 2:
0 : or 0 : or 0 : versus 0 :
2 1 1 2 1 1 2 1 1 2 1 0
  <  >                H H H H
 
Step 3: Test statistic: 
2
2
2
1
2
1
2 1
2 1
0
) ( ) (
n
s
n
s
x x
t
+
  
   
with degrees of freedom given by,
(   )   (   )
1
/
1
/
2
2
2
2
2
1
2
1
2
1
2
2
2
2
1
2
1
,
_
n
n s
n
n s
n
s
n
s
v
Step 4: The critical value, at    significant level is t
, v
  for one tailed (sided) test 
and t
/2, v
  for two tailed (sided) test. 
Step 5: i.  The critical region:
Alternative Hypothesis, H
1
Rejection Criteria (Reject  H
0
) 
IF
88
Two tailed test,  0
2 1
     
v v
t t t t
, 2 / 0 , 2 / 0
or
   
   < >
Upper tailed test,  0
2 1
  >   
v
t t
, 0   
>
Lower tailed test,  0
2 1
  <   
v
t t
, 0   
 <
ii. P-Value approach:
lternative Hypothesis, H
1 Reject H
0
 IF P < 
Two tailed test,  0
2 1
      P-value = 2P(T
n-1
 > |t|)
Upper tailed test,  0
2 1
  >    P-value = P(T
n-1
 > t)
Lower tailed test,  0
2 1
  <    P-value = P(T
n-1
 < t)
iii. 100(1-)% Confidence Intervals:
Alternative 
Hypothesis, H
1
Reject H
0
 IF  0
2 1
       falls outside of the interval
Two   tailed   test, 
0
2 1
     
2
2
2
1
2
1
, 2 /
2 1
2 1
2
2
2
1
2
1
, 2 /
2 1
) (
) (
n
s
n
s
t x x
n
s
n
s
t x x
v
v
+ + 
   +  
   
Upper tailed test, 
0
2 1
  >   
2
2
2
1
2
1
, 2 /
2 1
2 1
) ( ) (
n
s
n
s
t x x
v
  + +      
  
 
Lower tailed test, 
0
2 1
  <   
      +   ) ( ) (
2 1
2
2
2
1
2
1
, 2 /
2 1    
n
s
n
s
t x x
v
Step 6:  Calculate the test statistics, t
0
 in step 3.
Step 7:  Decision: To make a conclusion based on the criteria in step 5.
Example 7.3:
89
Two companies manufacture a rubber material intended for use in an 
automotive  application.   The  part   will   be  subjected  to  abrasive  wear  in  the 
application, so we decide to compare the material produced by each company 
in a test. Twenty-five samples of material from each company are tested in an 
abrasion  test   and  the  amount   of   wear   after   1000  circles   is   observed.   The 
sample   mean   and   standard   deviation       of   wear   for   company   A  and   B 
respectively, are 
circles mm S circles mm S
circles mm X circles mm X
B A
B A
1000 / 8 , 1000 / 2
, 1000 / 12 , 1000 / 20
 
 
i.        Do the data support the claim that the two companies produce material 
with different mean wear? Use 0.05, and assume that each population is 
normally distributed but their variances are not equal. What is the P-value 
for this test?
ii. Construct a two-sided 95% CI that will address the questions in part(i) 
                            and (ii) above.
(6 marks)  
Solution:
(i) (1) The parameter of interest is to test the different between the two means, 
and
   
    (4)   Critical value 0.05, 
(   )   (   )
27
24 /
25
8
24 /
25
2
25
8
25
2
1
/
1
/
2
2
2
2
2
2 2
2
2
2
2
2
2 2
,
_
,
_
,
_
,
_
B
B A
A
B A
B
B
A
A
n
n s
n
n s
n
s
n
s
v
t
0.025, 27
 = 3.057
90
      (5)  The critical region is reject H
0
 if t
0
 > 3.057 or t
0
 < -3.057
      (6)  Computation:
85 . 4
25
8
25
2
) 12 20 (
So
25 , 8 , 2 , 12 , 20
2 2
0
  
+
     
t
n n s s x x
B A B A B A
     (7) Result and conclusion:
Since  t
0
 > 3.057,  thus we have to reject the null hypothesis and enough 
evidence to support that the means are difference.
P-value for the test is P < 0.0001, since t
0 
= 4.85 with 27 df is falls > 3.69  
for which   > 0.0005.
(ii)  A 95% CI for the difference in mean 
and
  is 
13.0427 2.9582                                             
25
2
25
8
) 057 . 3 ) 12 20 (
25
2
25
8
) 057 . 3 ) 12 20 (
) ( ) (
25 , 2 , 8 , 12 , 20
2 2 2 2
2 2
, 2 /
2 2
, 2 /
  
,
_
+ +
,
_
+  
+ +     +  
     
B A
B A
B
B
A
A
v
B A
B A
B
B
A
A
v
B A
B A B A
B A
n
s
n
s
t x x
n
s
n
s
t x x
n n s s x x
 
 
 
   
From  CI,   since  
 and  
B
  are difference.
7.3 Testing about the different between the two proportions, p
1
 and p
2
.
If   the   parameter   of   interest   is   to   test   about   the   different   between   the   two 
proportions of two populations, then the test can be performed as below:
Step 1: To test about the different between two proportions, p
1
 and p
2
 .
91
Step 2:
0 : or 0 : or 0 : versus 0 :
2 1 1 2 1 1 2 1 1 2 1 0
  <  >      p p H p p H p p H p p H
Step 3: Test statistic: 
) 1 , 0 ( ~
1 1
)  1 ( 
) ( )   (
2 1
2 1 2 1
0
N
n n
p p
p p p p
Z
,
_
 where,  
2 1
2 1
n n
x x
p
+
+ + 
   +  
Upper tailed test, 
0
2 1
  >  p p
1
2
1
1
2
1
2 1 2 1
)   (
n n
Z p p p p
   
  + +      
Lower tailed test, 
0
2 1
  <  p p
      +   )   (
2 1
1
2
1
1
2
1
2 / 2 1
p p
n n
Z p p
   
:
:
1
0
            (3) Test statistics is:
B A
B A
B A
B A B A
n n
x x
p
n n
p p
p p p p
z
+
+
,
_
+ 
  
  where ,
1 1
)  1 ( 
) ( )   (
0
    (4)   Critical value 0.05, z
 
= 1.65
93
     (5)  The critical region is reject H
0
 if z
0
 > 1
     (6)  Computation:
29 . 0
149 55
36 24
 where , 9455 . 4
149
1
55
1
) 30 . 0 1 ( 30 . 0
24 . 0 44 . 0
24 . 0  , 44 . 0  , 36 , 24 , 149 , 55
0
  
+
+
 
,
_
 
     
p z
p p x x n n
B A B A B A
           (7)  Result and conclusions:
Since z
0
 > 1.65, then we reject H
0
. It means that enough evidence to claim that 
the  proportion  of  patients  on  hypertension  is  higher  than  non  hypertension 
patients
ii. P-value=(1-(4.95)=(1-1)=0.   Since  the  P-value  is   less   than  0.05,   thus   we 
reject   the   null   hypothesis.   There   is   enough   evidence   to   claim  that  the 
proportion   of   patients   on   hypertension   is   higher   than   non   hypertension 
patients.
iii  A 95% CI on the difference in the two proportion of patients are
0.348 0.052                                                                
149
) 76 . 0 )( 24 . 0 (
55
) 56 . 0 )( 44 . 0 (
) 96 . 1 ( ) 24 . 0 44 . 0 (
149
) 76 . 0 )( 24 . 0 (
55
) 56 . 0 )( 44 . 0 (
) 96 . 1 ( ) 24 . 0 44 . 0 (
)  1 (  )  1 ( 
)   (
)  1 (  )  1 ( 
)   (
24 . 0  , 44 . 0  , 36 , 24 , 149 , 55
2 / 2 /
  
,
_
+ + 
  
,
_
 
     
B A
B A
B
B B
A
A A
B A B A
B
B B
A
A A
B A
B A B A B A
p p
p p
n
p p
n
p p
z p p p p
n
p p
n
p p
z p p
p p x x n n
 
From  CI,   since   0  
B A
p p   is   not   in  the  interval,   so  we  reject  H
0
.   There  a 
significance difference in the two proportion of patients.
7.3 Testing about the different between the two variances, 
2
2
2
1
and   .
If the parameter of interest is to test about the different between the two variances 
of two populations, then the test can be performed as below:
Step 1: To test about the different between two variances, 
2
2
2
1
and   .
94
Step 2:
0 : or 0 : or 0 : versus 0 :
2
2
2
1 1
2
2
2
1 1
2
2
2
1 1
2
2
2
1 0
  <  >                H H H H  
Step 3: Test statistic: 
2
2
2
1
0
s
s
F   
Step 4:   The critical value, at    significant level is   f
 n n
f
s
s
Step 6:  Calculate the test statistics, Z  in step 3.
Step 7:  Decision: To make a conclusion based on the criteria in step 5.
95
Example 7.4:
A random sample of 12 air pollution index at UTP station produced a variance 
0.0340 while a random sample of another 13 air pollution index at Tronoh station 
produced a variance 0.0525. 
(i). Are the population variances equal?. Use  = 0.05.
(ii) Find the 95% two-sided confidence interval on the ratio of two variances. 
Solution:
i.    (1) The parameters of interest are to test the difference between the two 
            variances of pollution indexes, 
1
2
 and 
2
2
.
             (2)  Hypothesis testing:
2
2
2
1 1
2
2
2
1 0
:
:
 
 
H
vs
H
            (3) Test statistics is:
2
2
2
1
0
s
s
F   
    (4)   Critical value 0.05, f
/2, 12,13 
= 3.15
(5)  The critical region is reject H
0
 if 
32 . 0
15 . 3
1 1
or 15 . 3
13 , 12 , 025 . 0
13 , 12 , 975 . 0 0 13 , 12 , 025 . 0 0
     <  >
f
f F f F
     
    (6)  Computation:
648 . 0
0525 . 0
0340 . 0
2
2
2
1
0
    
s
s
F
           (7)  Result and conclusions:
Since  0.32<F
0
  <  3.15,   then  we  cannot   reject  H
0
.   It   means  that   not   enough 
evidence to say that the variances of the two pollution indexes are different.
ii   A 95% two-side  confidence interval on the ratio of  two variances of pollution 
     indexes are
96
041 . 2 206 . 0
) 15 . 3 (
0525 . 0
0340 . 0
15 . 3
1
0525 . 0
0340 . 0
1
2
2
2
1
2
2
2
1
1 , 1 , 2 /
2
2
2
1
2
2
2
1
1 , 1 , 2 /
2
2
2
1
1 , 1 , 2 /
2
2
2
1
2
2
2
1
1 , 1 , 2 / 1
2
2
2
1
1 2
1 2
1 2 1 2
 
 
 
 
 
 
    
 
n n
n n
n n n n
f
s
s
f s
s
f
s
s
f
s
s
From CI, since 
2
2
2
1
and 
1
  for an observed data set. The estimates of, 
0
and 
1
 , are calculated 
using  least   squares  method.   The  estimated  regression  line,   obtained  using  the 
103
values of 
0
and 
1
and 
1
, are given by
,
_
,
_
 
,
_
 
n
i
n
i
i
i
n
i
n
i
i
n
i
i
i i
n
x
x
n
y x
y x
1
2
1 2
1
1 1
1
(8.2)
and
x y
1 0
 
       
(8.3)
where 
n
y
y
n
i
i
  1 is the 
mean of all values of the predictor variable at which the observations were taken.
Once the
0
and 
1
The error of estimate is a square root of the error of sum of squares divided by 
error degree of freedom, n-2. i.e 
2 
n
SS
E
 . The smaller 
The  significance  of   the  fitted  regression  model   can  be  tested  by  using  the  t-
students   test   on   the   parameter,  
1
1
1
0
se
t   
.   If 
2 . 2 / 0   
>
n
t t
   ,   then  the  null   hypothesis,   0
1
  is   rejected.   It   means   that   the 
regression   model   is   adequate   and   fitted   to   the   data   otherwise   there   is   no 
relationship between x and y. 
8.3.4 The ANOVA Approach
The analysis of variance (ANOVA) can also be used to test for the significance of 
regression as in the table below:
ANOVA Table for simple linear regression,
       + +  x Y
1 0 :
Source of 
variation
Degree of 
freedom
Sum of 
squares
Mean of 
squares
F
Regression 1 SS
R
MS
R
=SS
R
/1 MS
R
/MS
E
Error n-2 SS
E
MS
E
=SS
E
/n-2
Total n-1 SS
T
Where,
105
    
n
i
i R
y y SS
1
2
)  ( is   called   regression   sum   of   squares   which   measures 
variability explained by the regression model.
      
   
n
i
n
i
i i i E
e y y SS
1 1
2 2
)  ( is   called  error   sum  of   squares   which  measures  of 
unexplained variability in the response.
n
y
y y y S SS
n
i
i
n
i
i
n
i
i yy T
2
1
1
2
1
2
) (
  
,
_
        
  
 
is   called   total   sum   of   squares 
which  measures of the total variability in the response. 
SS
T
 can be written as,
R E
n
i
i i
n
i
i
n
i
i T
SS SS y y y y y y SS   +    +        
  
2
1
2
1 1
2
)  ( )  ( ) (
If   the  value   of   F  is   large   compared  to  f
,1,   n-2
,   then  the   regression  model   is 
significant. 
8.4 Confidence intervals on regression parameters
A 100 (1- a )% confidence level on the slope 
1
 is given by
where                                    and 
106
)
1 2 , 2 / 1 1 1 2 , 2 / 1
      
   
se t se t
n n    
  +   
)
0 2 , 2 / 0 0 0 2 , 2 / 0
      
   
se t se t
n n    
  +   
XX
S
se
2
1
(
  
   
)
1
(  )
(
2
2
0
XX
S
x
n
se   +     
Example 8.1
The following measurements of the specific heat of a certain chemical were made 
in order to investigate the variation in specific heat with 
temperature.
Temperature 
o
C 0 10 20 30 40 50
Specific heat 0.5
1
0.5
5
0.5
7
0.5
9
0.6
3
0.65
i. Plot the points on a scatter diagram
ii. Estimate the regression line of specific heat on temperature
iii. Estimate  the  value  of  the  specific  heat   when  the  temperature  is 
35
o
C.
Example 8.2
The following data were collected on 8 lung cancer patients where x measures the 
number of years the patient smoke cigarette (or any form of nicotine product) and 
y is the physicians subjective evaluation of the extent of lung damage on a scale 
of 0 to 100.
x (years) 25 35 22 15 48 39 42 31
y (0-100) 55 60 50 30 75 70 71 55
An analysis of variance is conducted using a statistic software package and the 
output is displayed in the table below:
The regression equation is
y = 21.228 + 1.230x
Predictor Coef SE Coef T p  
value
Constant 21.228 9.442 2.248 0.066
x
1.230 0.280 4.397 0.005
S = 8.17169  R - squared = ?       R - squared ( adj) = 0.724
107
Analysis of Variance
Source Df SS MS F p  
value
Regression 1 1290.84 1290.84 19.331 0.005
Residual Error 6 400.66 66.777
Total 7 1691.50
i. Estimate   the  predicted  physician  scores,   on  the  extent   of   lung 
damage, for two patients who have smoking habits of 20 and 40 
years respectively.
ii. Obtain a 95 % confidence interval for the true slope  .
iii. Estimate the coefficient of determination (R
2
). Discuss briefly on 
the value obtained.
iv. Conduct a hypothesis test on the significance of the regression at 
level of significance  = 0.05. 
v. Find a 95% Confidence Interval for the the physicians subjective 
evaluation of the extent of lung damage.
Exercise 8
1. The manager of a car plant wishes to investigate how the plants electricity usage 
depends upon the plant production. The data is given below 
Production 
(RMmillion) 
(x)
4.51 3.58 4.31 5.06 5.64 4.99 5.29 5.83 4.7 5.61 4.9 4.2
Electricity 
Usage 
(y)
2.48 2.26 2.47 2.77 2.99 3.05 3.18 3.46 3.03 3.26 2.67 2.53
(a) Estimate the linear regression equation 
(b) An estimate for the electricity usage when x = 5 
108
x Y
1 0
     + 
(c) Find a 90% Confidence Interval for the electricity usage.
2. An experiment was set up to investigate the variation of the specific heat of a certain 
chemical with temperature. The data is given below 
Temperature 
o
F
(x)
50 60 70 80 90 100
Heat
(y)
1.60
1.64
1.63
1.65
1.67
1.67
1.70
1.72
1.71
1.72
1.71
1.74
(a) Estimate the linear regression equation 
(b) Plot the results on a scatter diagram
(c) An estimate for the specific heat  when the temperature is 75
o
F 
(d) Find a 95% Confidence Interval for the specific heat.
3. An engineer at a semiconductor company wants to model the relationship between the 
device HFE (y) and the parameter Emitter - RS (    ). Data for Emitter - RS was first  
collected and a statistical analysis is carried out and the output is displayed in the 
table given. 
Regression Analysis: y = 1075.2  63.87x
1 
Predictor        Coef       SE Coef            T         P-value
Constant       1075.2        121.1            8.88      0.000
      x
1
           -63.87           8.002           -7.98      0.000
S = 19.4       R-Sq = 0.78
Analysis of variance
Source             DF           SS                MS           F
Regression        1          23965          23965       63.70
Residual           18          6772              376
Total              19        30737
(a)   Estimate HFE when the Emitter - RS is 14.5. 
(b)  Obtain a 95 % confidence interval for the true slope . 
(c)  Test for significance of regression for a  = 0.05. 
4. An chemical engineer wants to model the relationship between the purity of oxygen 
(y) produced in a chemical distillation process and  the percentage of hydrocarbons  
(x ) that are present in the main condenser of the distillation unit. A statistical analysis 
is carried out and the output is displayed in the table given. 
109
1
x
x Y
1 0
     + 
Regression Analysis: y = 74.3 + 14.9x
 
Predictor        Coef        SE Coef           T         P-value
Constant      74.283   1.593           46.62   0.000
      x
1
            14.947   1.317            11.35      0.000
S = 1.087     R-Sq = 87.7%
Analysis of variance
Source             DF           SS                MS           F
Regression        1          152.13   152.13      12.86
Residual           18           21.25       1.18
Total            19        173.38
(a)  Estimate the purity of oxygen when the percentage of hydrocarbon 1%. 
(b)  Obtain a 95 % confidence interval for the true slope . 
(c)  Test for significance of regression for a  = 0.05.
5. Regression  methods  were  used  to  analyze  the  data  from  a  study  investigating  the 
relationship between roadway surface temperature (x) and pavement deflection (y). 
The data follow. 
Temperature 
x
Deflection 
y
Temperature 
x
Deflection 
y
70.0 0.621 72.7 0.637
77.0 0.657 67.8 0.627
72.1 0.640 76.6 0.652
72.8 0.623 73.4 0.630
78.3 0.661 70.5 0.627
74.5 0.641 72.1 0.631
74.0 0.637 71.2 0.641
72.4 0.630 73.0 0.631
75.2 0.644 72.7 0.634
76.0 0.639 71.4 0.638
(a) Estimate   the   intercept     and  slope     regression  coefficients.   Write   the 
estimated 
regression line.
(b) Compute SS
E
 and estimate the variance.
(c) Find the standard error of the slope and intercept coefficients.
(d) Show that 
(e) Compute the coefficient of determination, R
2
. Comment on the value.
(f) Use a t-test to test for significance of the intercept and slope coefficients at    
       .      Give the P-values of each and comment on your results.
110
(g) Construct the ANOVA table and test for significance of regression using the 
P
    value. Comment on your results and their relationship to your results in part (f).
(h) Construct 95% CIs on the intercept and slope. Comment on the relationship of 
these 
      CIs and your findings in parts (f) and (g).
6. The   designers   of   a   database   information   system  that   allows   its   users   to  search 
backwards for several days wanted to develop a formula to predict the time it would 
be take to search. Actually elapsed time was measured for several different values of 
days. The measured data is shown in the following table:
Number of Days 1 2 4 8 16 25
Elapsed Time 0.6
5
0.79 1.36 2.26 3.59 5.39
(a) Estimate the intercept   and slope    regression coefficients. Write the estimated
  regression line.
(b) Compute SS
E
 and estimate the variance.
(c) Find the standard error of the slope and intercept coefficients.
(d) Show that 
(e) Compute the coefficient of determination, R
2
. Comment on the value.
(f) Use a t-test to test for significance of the intercept and slope coefficients at  . 
Give the P-values of each and comment on your results.
(g) Construct the ANOVA table and test for significance of regression using the P-value. 
Comment on your results and their relationship to your results in part (vi).
(h) Construct 95% CIs on the intercept and slope. Comment on the relationship of these 
CIs and your findings in parts (vi) and (vii).
Chapter 9
9. Multiple Linear Regression
Learning Outcomes:
At the end of the lesson, the student should be able to
 Use the least squares method to estimate a multiple linear model
 Carry out tests to determine if the model obtained is an adequate fit to the data
9.1 Introduction
111
Multiple regressions (the term was first used by Pearson, 1908) is to learn more 
about the relationship between several independent or predictor variables and a 
dependent or criterion variable.  It is an extension of a simple linear regression 
model. 
Consider the following data consisting of n sets of values
The value of the dependent variable y
i
 is modeled as 
(9.1)
Where,   k
    ., . . . , , ,
2 1 0 are   called   the  regression   coefficients  can   be 
estimated  by  using  the  least   squares   method.   The  random  error   term,  
,   is 
assumed to follow the normal distribution with a mean of 0 and variance of 
2
 .  
9.2 Fitted Regression Model
The true regression line corresponding to Equation (9.1) is usually never known. 
However, the regressions model can be estimated by estimating the coefficients 
k
    ., . . . , , ,
2 1 0
  for   an   observed   data   set..   The   estimated   regressions 
model  is obtained using the  values of  
k
   
., . . ,
2 1 0
are called  the  fitted 
model equations. The least square estimates, 
k
   
., . . ,
2 1 0
 are given by
112
) .... , , , (
.
) .... , , , (
) .... , , , (
2 1
2 22 12 2
1 21 11 1
kn n n n
k
k
x x x y
x x x y
x x x y
      + + + + 
k k
x x Y ....
1 1 0
These   equations   can  be   solved   by  using   matrices.   Then   we   have   the   fitted 
regression model as given below
9.3 Assessment of the Regression Model
The ANOVA Approach
The analysis of variance (ANOVA) can also be used to test for the significance of 
regression as in the table below:
ANOVA Table for multiple  linear regression,
Source of 
variation
Degree of 
freedom
Sum of 
squares
Mean of squares F
Regression k SS
R
MS
R
=SS
R
/k MS
R
/MS
E
Error n - (k+1) SS
E
MS
E
=SS
E
/n-(k+1)
Total n-1 SS
T
Where,
    
n
i
i R
y y SS
1
2
)  ( is   called   regression   sum   of   squares   which   measures 
variability explained by the regression model.
      
   
n
i
n
i
i i i E
e y y SS
1 1
2 2
)  ( is   called  error   sum  of   squares   which  measures  of 
unexplained variability in the response.
n
y
y y y S SS
n
i
i
n
i
i
n
i
i yy T
2
1
1
2
1
2
) (
  
,
_
        
  
 
is   called   total   sum   of   squares 
which  measures of the total variability in the response. 
113
k k
x x Y     
....
  
1 1 0
  + + + 
      + + + + 
k k
x x Y ....
1 1 0
SS
T
 can be written as,
R E
n
i
i i
n
i
i
n
i
i T
SS SS y y y y y y SS   +    +        
  
2
1
2
1 1
2
)  ( )  ( ) (
The ANOVA table is used to test for significance of regression 
The hypotheses are:
The test statistic is 
MSE
MSR
F   
0
.
If the  value  of  F
0
    is  large  compared  to  f
,,k,   n-(k+1)
,  then the  multiple  regression 
model is significant. 
Example 9.1:
A set of experimental runs were made to determine a way of predicting cooking 
time  y  at   various  levels  of  oven  width  x1,   and  temperature  x2.   The  data  were 
recorded as follows:
i. Estimate the multiple linear regression equation for the data.
ii. Test   whether  the  regression  explained  by  the  model   obtained  in  part   (i)  is 
significant at the 0.01 level of significance. 
Solution:
i. Using the computer for computations, the following results were observed.
The regression equation is  
114
y x1 x2
6.4   1.32   1.15
15.05   2.69   3.4
18.75   3.56   4.1
30.25   4.41   8.75
44.86   5.35   14.82
48.94   6.3   15.15
51.55   7.12   15.32
61.5   8.87   18.18
100.44   9.8   35.19
111.42   10.65   40.4
zero both not are H
H
2 1 1
2 1 0
and :
0 :
 
     
Cooking time = 0.568 + 2.706 width + 2.051 temperature
Predictor        Coef       SE Coef             T               P
Constant        0.568       0.585     0.970         0.364
Width           2.706        0.194   13.935         0.000
Temp.           2.051        0.046  44.380         0.000
S = 0.6334       R-Sq = 100%     R-Sq(adj) = 100%
ii. The following ANOVA table is obtained
Analysis of Variance
Source             DF            SS             MS                F               P
Regression          2 10953.334           5476.667       13647.872 
0.000
Residual Error          7                2.809            0.401 
Total                  9         10956.143 
Since F > f
0.01,  2,  7
  = 9.55, then we reject H
0
. It means that the regressions 
are significant.
Exercise 9
1. Given the data:
Test Number y x1 x2
1 1.6 1 1
2 2.1 1 2
3 2.4 2 1
4 2.8 2 2
5 3.6 2 3
6 3.8 3 2
7 4.3 2 4
8 4.9 4 2
115
9 5.7 4 3
10 5 3 4
(a) Fit a multiple linear regression model to these data.
2. Given the data:
Observation 
Number Pull Strength y
Wire Length 
x1 Die Height x2
1 9.95 2 50
2 24.45 8 110
3 31.75 11 120
4 35.00 10 550
5 25.02 8 295
6 16.86 4 200
7 14.38 2 375
8 9.60 2 52
9 24.35 9 100
10 27.50 8 300
11 17.08 4 412
12 37.00 11 400
13 41.95 12 500
14 11.66 2 360
15 21.65 4 205
16 17.89 4 400
17 69.00 20 600
18 10.30 1 585
19 34.93 10 540
20 46.59 15 250
21 44.88 15 290
22 54.12 16 510
23 56.63 17 590
24 22.13 6 100
25 21.15 5 400
(b) Fit a multiple linear regression model to these data.
3. A study was performed to investigate the shear strength of soil  (y)  as it related to 
depth  in  meter   (x
1
)   and  percentage  moisture  content   (x
2
).   Ten  observations   were 
collected and the following summary quantities obtained: 
 
6 . 595 , 371 , 8 . 736 , 104 , 8 . 550 , 43
, 352 , 12 , 729 , 31 , 9 . 200 , 5
, 916 , 1 , 553 , 223 , 10
2
2 1
2 1
2
2
2
1
2 1
  
  
   
  
  
  
i i i i i
i i i i
i i i
y y x y x
x x x x
y x x n
(a) Estimate the parameters to fit the multiple regression models for these data.
(b) What is the predicted strength when  x
1
=18meter and x
2
= 43%.
116
4. A set of experimental runs were made to determine a way of predicting cooking time 
y at various levels of oven width x1, and temperature x2. The data were recorded as 
follows: 
(a) Fit a multiple linear regression model to these data.
(b) Estimate  and the standard errors of the regression coefficients.
(c) Test for significance of  and  .
(d) Predict the useful range when brightness = 80 and contrast = 75. Construct a 95% 
PI.
(e) Compute the mean response of the useful range when brightness = 80 and contrast 
     = 75. Compute a 95% CI.
(f) Interpret parts (d) and (e) and comment on the comparison between the 95% PI 
     and 95% CI.
5. An  article  in  Optical   Engineering  (Operating  Curve  Extraction  of   a  Correlator's 
Filter, Vol.  43, 2004,  pp. 27752779)  reported  the  use of  an  optical  correlator  to 
perform an experiment by varying brightness and contrast. The resulting modulation 
is characterized by the useful range of gray levels. The data are shown
Brightness (%): 5
4
6
1
6
5
10
0
10
0
10
0
50 57 54
Contrast (%): 5
6
8
0
7
0
50 65 80 25 35 26
Useful range (ng): 9
6
5
0
5
0
11
2
96 80 15
5
14
4
25
5
(a) Fit a multiple linear regression model to these data.
(b) Estimate  and the standard errors of the regression coefficients.
(c) Test for significance of  and  .
(d) Predict the useful range when brightness = 80 and contrast = 75. Construct a 95% 
PI.
117
y x1 x2
6.4   1.32   1.15
15.05   2.69   3.4
18.75   3.56   4.1
30.25   4.41   8.75
44.86   5.35   14.82
48.94   6.3   15.15
51.55   7.12   15.32
61.5   8.87   18.18
100.44   9.8   35.19
111.42   10.65   40.4
(e) Compute the mean response of the useful range when brightness = 80 and contrast 
= 75. Compute a 95% CI.
(f) Interpret parts (d) and (e) and comment on the comparison between the 95% PI 
and 95% CI.
6. A study was performed on wear of a bearing y and its relationship to x
1
 = oil viscosity 
and x
2
 = load. The following data were obtained:
x
1
1.6 15.
5
22.0 43.0 33.0 40.0
x
2
85
1
816 105
8
120
1
135
7
111
5
y 29
3
230 172 91 113 125
(a) Fir a multiple regression model to these data.
(b) Estimate 
2
 and the standard errors of the regression coefficients.
(c) Use the model to predict wear when x1 = 25 and x2 = 1000.
(d) Fit a multiple regression model with an interaction term to these data.
(e) Estimate  
2
 and  se(j)   for   this   new  model.   How  did  these   quantities 
change? Does 
this tell you anything about the value of adding the interaction term to the model?
(f) Use the model in (d), to predict when x
1
=25 and x
2
=1000. Compare this 
prediction with the predicted value from part (c) above.
Chapter 10
10. Factorial Experiment
At the end of the lesson, the student should be able to:
118
 Design and conduct factorial experiments involving two factors using factorial 
design.
 Analyze and interpret main effects and interactions.
 Understand how to use ANOVA to analyze data from these experiments.
   10.1  Introduction
 
Experimental design techniques based on statistics are useful in the engineering 
world for improving the performance of manufacturing process.
By  using  design  experiments,   we  can  determine  which  subsets  of  the  process 
variables have the most influence on process performance. Among the advantage 
of   using  this  experimental   design  are:   it   can  improved  process  yield,   reduced 
variability  in  the  process,   reduced  design  and  development   time  and  also  can 
reduced cost of operation. Statistically designed experiments allow efficiency and 
economy   in   the   experimental   process.   If   data   are   collected   without   an 
experimental   design,   it  may  not  be  possible  to  extract   the  desired  information. 
Results   of   the   analysis   may  be   confusing,   misleading,   not   credible   and  not 
reproducible.  Normally when several factors are of interest in an experiment,  a 
factorial experiment should be used.
   10.2 Terminology and definition
There is some terminology in experimental design such as
 Factor: variable whose influence upon the response variable is being studied 
in the experiment.
 Factor Level: different modes or settings of a factor.
 Trial (or runs): applying of a treatment to an experimental unit.
 Treatment  or  level   combination:   specific   combination   of   the   levels   of 
different factors. 
 Experimental   units  (subjects):   the   basic   unit   for   which   the   response 
measurement are collected.
 Replicates: number of experimental units on which a particular treatment is 
applied.
 A  factorial   experiment  means   that   in   each   complete   replicate   of   the 
experiment   all   possible   combinations   of   the   levels   of   the   factors   are 
investigated.
 The  effect  of  a  factor  is  defined  as  the  change  in  response  produced  by  a 
change in the level of the factor.
 Main  effect:   the  primary   factors  in   the   study   that   change   the   response 
variable.
Interaction  effect:  the  change  in  response  variable  is  due  to  an  interaction 
between the factors
10.3 The 2
k
 factorial design
119
Factorial   design  can  be  used  to  identify  factors  with  significant   effects  on  the 
response,   to  identify  (discover)   interactions   among  factors,   to  identify  which 
factors have the most important effects on the response and last to decide whether 
further investigation of a factors effect is justified. The 2
k
 factorial design means 
that the experiment has been setup with k factors at 2 levels for each factor. The 
objective   is   to   test   and   to   determine   which   are   the   main   effects   and   the 
interactions are important of all k factors at 2 levels.  
The   Analysis   of   Variance   (ANOVA)   can  be   used  to  analyze   the   data   from 
experimental designs. From the ANOVA, the null hypothesis  that the effect is 
equal to 0 is tested. When H
0
  is rejected, this provides evidence that the factor 
involved  actually   affect   the   outcome   (response).   However   some   assumptions 
should be made before we do the analysis. Among the assumptions are the same 
numbers of replicates for each treatment, at least 2 replications for each cells, and 
each treatment is a random sample from a normal population. 
10.4 The 2
2
 factorial design
The 2
2
 factorial design means that the experiment has been setup with 2 factors at 
2 levels for each factor. The objective is to test and to determine which are the 
two main effects and the interactions are important of all 2 factors at 2 levels. In 
one experiment, the example of two factors is factor A: reaction time and factor 
B: reaction temperature.   For  factor A, the two  levels are time  at 1  hour and  2 
hours. These levels can also be denoted as  (minus) for one level and + (plus) for 
another level. For factor B, the two levels are 35
o
C (-) and 55
o
C (+). This can be 
explained by the table below:
Factor A (Time)
1 hour 
( - )
2 hours
( + )
Factor B
(Temperature
)
35
o
C 
( - )
Yields 
measured
113 112 111
, , x x x
Yields 
measured
123 122 121
, , x x x
55
o
C 
( + )
Yields 
measured
213 212 211
, , x x x
Yields 
measured
223 222 221
, , x x x
where, n = 3 replications and  x
ijk
,  k = 1,..,n  are the observations in the cell (i,j).
The levels can be in the form of variable data (numbers) or attribute data such as 
male and female, on and off. Normally the levels will be designated one level as 
high (+) and the other level as low (-) as explained in the table below:
Factor A 
120
(Time)
Low
( -)
High
( + )
Factor B
(Temperature
)
Low
( - )
(1)
a
High
( + )
b ab
All possible treatment combination of the level of the factors or called factorial 
experiment is given in a design or test matrix for 2
2
 factorial designs as follows:
Treatment 
Combinatio
n
Factorial Effect
A B AB
(1) - - +
a
+ - -
b
- + -
ab
+ + +
The  letters  (1),  a,  b  and  ab  represents  the  total   of  all  n  observations  at   each 
treatment combination.
10.5 Estimate the effects of factors in the 2
2
 factorial design
The effect of main factors, factor A and factor B and the effect of AB interaction 
can be calculated using the formula below.
Effect of main factor A is:
[   ] ) 1 (
2
1
2
) 1 (
2
    + 
+
+
 b ab a
n n
b
n
ab a
A
Effect of main factor B is:
[   ] ) 1 (
2
1
2
) 1 (
2
    + 
+
+
 a ab b
n n
a
n
ab b
B
Effect of AB interaction is:
[   ] b a ab
n n
b a
n
ab
AB     + 
+
+
 ) 1 (
2
1
2 2
) 1 (
where [.] is called contrast. 
121
10.6 Sum of squares formula for ANOVA
The effects of main factors and the interaction factor can be tested by using the 
two-way ANOVA table.
ANOVA Table:
Source 
of 
variatio
n
Degree 
of 
Freedo
m
Sum of 
Square
s
Mean of 
squares
F P-
valu
e
A 1 SS
A
MS
A
=SS
A
/1 MS
A
/MS
E
B 1 SS
B
MS
B
=SS
B
/1 MS
B
/MS
E
AB 1 SS
AB
MS
AB
=SS
AB
/
1
MS
AB
/MS
E
Error n-4 SS
E
MS
E
=SS
E
/n-
4
Total n-1 SS
T
The sum of squares for factor A, SS
A
 is given by
[   ]
n
A Contrast
b ab a
n
SS
A
4
) (
) 1 (
4
1
2
   + 
The sum of squares for factor B, SS
B
 is given by
[   ]
n
B Contrast
a ab b
n
SS
B
4
) (
) 1 (
4
1
2
   + 
The sum of squares for factor A, SS
AB 
is given by
[   ]
n
AB Contrast
b a ab
n
SS
AB
4
) (
) 1 (
4
1
2
   + 
The total sum of squares, SS
T
 is given by,
2
2
1
1 1
2
1
1 1
2
4
1
,
_
   
j
i
n
k
ijk
j
i
n
k
ijk T
x
n
x SS
The error sum of squares is obtained by subtraction:
 
                          SS
E
 = SS
T
  SS
A
  SS
B
 - SS
AB
122
10.7 Least square regression model
An initial estimated regression model is,
2 1 3 2 2 1 1 0
   
 x x x x y         + + + 
where,
0