STA108 Project 1
STA108 Project 1
PREPARED BY:
AS115 5B
PREPARED FOR:
CONTENT PAGE
1. INTRODUCTION
1.1 Background of study
1.2 Objectives of study 3
1.3 Significance of study
1.4 Limitation of study
2. METHODOLODY
2.1 Data description 4
2.2 Graphical Technique 5
2.3 Numerical Technique 6–9
3. RESULTS AND INTEPRETION
3.1 Data presentation 10 – 15
3.2 Descriptive analysis 16 – 19
3.3 Correlation and Regression 20 – 22
analysis
4. CONCLUSION
4.1 Report summary 23
5. APPENDICES AND REFERENCES 24 – 28
CHAPTER 1: INTRODUCTION
1.1 BACKGROUND OF STUDY
This study was carried out to analyse the concentration of Ferrum (ppm) of a sample
based on its absorbance using standard calibration curve of Ferrum standard. The
data source was obtained from primary source whereas the analysts themselves
collected the data for specific purposes.
The data used for this study is a primary data which comes from primary source whereas the
data is collected by ourselves for a specific purposes. The title of the data is concentration of
Ferrum (ppm) and its absorbance. The data consist of quantitative variables, which the
variables can be measured numerically.
The data used in this study is an ungrouped data which the data gives information on each
member of the population or sample individually.
i) The first
Q1 ¿ ( ) observation
4
quartile, Q1 is
the value such
1
that at most
4
of the
measurements
are less than
Q1 or at most
3
greater than
4
Q1. 3
Q3 ¿ ( n+1 )th observation
ii) The third 4
quartile, Q3 is
the value such
3
that at most
4
of the
measurements
are less than
Q3 or at most
1
greater than
4
Q3.
Range The difference in values Range ¿ Largest observation – smallest
between the largest observation
observation and the
smallest observation.
Midspread The difference in values Interquartile range ¿ Q3 – Q1
between third quartile and
first quartile. It is also
known as interquartile
range and covers the
middle 50% of the
ordered data set.
Variance Variance approximate the
average deviations of
each of the ( Σ x) 2
of the variance. s=
√
1
n−1
Σ x 2−[( Σ x)
n ]
Coefficient of Coefficient of variation is Present the average deviation of the
Variation used to determine how data set as a percentage of its mean.
large or small the
deviation is with respect
to the data, which s
CV = ×100 %
represented the mean, x̅
whereas larger deviation
indicates less consistency
and small variation
indicates more
consistency.
Pearson’s Pearson’s Coefficient of
Coefficient of Skewness is used to
Skewness confirm if there is the
element of skewness.
The types of distribution
is based on the sign of
the coefficient, whereas; Pearson’s Coefficient of Skewness
i. Positive sign: 3( x́−~x)
¿
Positively s
skewed
ii. Zero sign:
Symmetric
iii. Negative sign:
Negatively
skewed
Pearson Product- Pearson Product-Moment
Moment Correlation Coefficient is
Correlation used to analyse the n Σ xy−Σ x Σ y
r=
Coefficient relationship between two √¿ ¿ ¿
20
Concentration of Ferrum (ppm)
15
10
The boxplot above shows the comparison of the distribution of the concentration of Ferrum
(ppm). Referring to the data table for the concentration of Ferrum (ppm) above, the data is
rearranged in increasing order in order to determine:
th
n+1
i) x : Location, ~
Median,~ x= ( ) =¿
2
th
n+1 7 +1 th
ii) First quartile, Q1: location Q1 ¿ ( ) ( )
4
=
4
=2.0
3 3
iii) Third quartile, Q3: location Q3 ¿ ( n+1 )th = ( 7+1 )th=6.0
4 4
vi) Distribution of the data: The distribution of the data is negatively skewed. The left
whisker is longer than the right whisker. The median line is located to the right
side of the box.
Boxplot of Absorbance
0.6
0.5
0.4
Absorbance
0.3
0.2
0.1
0.0
The boxplot above shows the comparison of the distribution of the absorbance of the Ferrum
atoms. Referring to the data table for the absorbance above, the data is rearranged in
increasing order in order to determine:
th
n+1
i) x : Location, ~
Median,~ x= ( ) =¿
2
th
n+1 7 +1 th
ii) First quartile, Q1: location Q1 ¿ ( ) ( )
4
=
4
=2.0
3 3
iii) Third quartile, Q3: location Q3 ¿ ( n+1 )th = ( 7+1 )th=6.0
4 4
vi) Distribution of the data: The distribution of the data is negatively skewed. The left
whisker is longer than the right whisker. The median line is located to the right
side of the box.
Scatterplot of Absorbance vs Concentration of Ferrum (ppm)
0.6
0.5
0.4
Absorbance
0.3
0.2
0.1
0.0
0 5 10 15 20 25
Concentration of Ferrum (ppm)
The scatterplot above shows the values of concentration of Ferrum (ppm) and its
absorbance using Cartesian coordinates. By using the scatterplot, the existence of linear
relationship between the two variables can be identified. In this study, the independent
variable (IV) in this study the concentration of Ferrum (ppm). The concentration of the
Ferrum is set by the experimenters. On the other hand, the dependent variable (DV) in this
study is absorbance. Absorbance is depends on the concentration of Ferrum (ppm).
3.2 DESCRIPTIVE STATISTICS ANALYSIS
92.6069
¿ =13.2296
7
th
n+1
x ; Location, ~
Median , ~ x= ( ) =¿
2
th
n+1 7 +1 th
First quartile, Q1: location Q1 ¿ ( ) ( )
4
=
4
=2.0
3 3
Third quartile, Q3: location Q3¿ ( n+1 )th = ( 7+1 )th=6.0
4 4
Midspread ¿ Q3 – Q1
¿ 20.3827−6.9651
¿ 13.4176
Variance, s2=60.775
s
Coefficient of Variance, CV = ×100 %
x̅
7.796
¿ ×100 %=58.9284 %
13.2296
∴ The type of the distribution of data is negatively skewed. (Skewed to the left)
Data for absorbance of Ferrum atoms:
2.388
¿ =0.3411
7
th
n+1
x : Location, ~
Median , ~ x= ( ) =¿
2
th
n+1 7 +1 th
First quartile, Q1: location Q1 ¿ ( ) ( )
4
=
4
=2.0
3 3
Third quartile, Q3: location Q3 ¿ ( n+1 )th = ( 7+1 )th=6.0
4 4
¿ 0.5772−0.0026
¿ 0.5746
Midspread ¿ Q3 – Q1
¿ 0.5250−0.1794
¿ 0.3456
Standard deviation, s=0.20006
Variance, s2=0.04002
s
Coefficient of Variance, CV = ×100 %
x̅
0.20006
¿ ×100 %=58.5614 %
0.3411
∴ The type of the distribution of data is negatively skewed. (Skewed to the left)
3.3 CORRELATION AND REGRESSION ANALYSIS
To measure the linear relationship between the independent and dependent variables:
n Σ xy−Σ x Σ y
r=
√¿ ¿ ¿
Where,
n=7
Σ xy =40.9491
Σ x =92.6069
Σ x 2=1589.7177
Σ y=2.388
Σ y 2=1.0548
r =1.0000
∴ There is a very strong, positive linear relationship between the concentration of Ferrum
(ppm) and its absorbance.
Coefficient of Determination, r 2
r 2=(1.0000)2=1.0000
%r2 =100.00 %
∴ This implies that 100.00% of the variability in the absorbance among the Ferrum atoms
can be explain by their concentration (ppm). The remaining 0.00% of the variability is due to
other unexplained factors.
Regression line; Y = A +BX ,
Where,
Σ y B Σx n Σ xy −Σ x Σ y
A= − , B=
n n n Σ x 2−¿ ¿
n Σ xy −Σ x Σ y
B=
n Σ x 2−¿ ¿
¿ 0.02566
¿ 0.001619
∴ When the concentration of Ferrum (ppm) is equal to zero (0), the absorbance is
0.001619. When the concentration of Ferrum (ppm) is increased in by 1 ppm, the
absorbance will increase by 0.001619.
0.4
Absorbance
0.3
0.2
0.1
0.0
0 5 10 15 20 25
Concentration of Ferrum (ppm)
Figure 4: Regression Model
CHAPTER 4: CONCLUSION
As a conclusion, throughout carrying this assignment, we were able to apply the statistical
techniques studied in chapter 1, 2 and 3. From the analysis done, we can conclude that the
absorbance increases with the concentration of Ferrum (ppm). This can be supported by the
positive linear relationship observed from the two variables, the independent variable (IV)
which is the concentration of Ferrum (ppm) and on the other hand, the dependent variable
(DV) in this study is absorbance.
APPENDICES AND REFERENCES
1. Minitab
22.4094
20.3827
20
Concentration of Ferrum (ppm)
16.8109
15
14.0311
12.0122
10
6.9651
0 0.0000
Boxplot of Absorbance
0.6
0.5772
0.5250
0.5
0.4330
0.4
0.3614
Absorbance
0.3094
0.3
0.2
0.1794
0.1
0.0 0.0026
Summary Report for Concentration of Ferrum (ppm)
Anderson-Darling Normality Test
A-Squared 0.19
P-Value 0.847
Mean 13.230
StDev 7.796
Variance 60.775
Skewness -0.678515
Kurtosis -0.085677
N 7
Minimum 0.000
1st Quartile 6.965
Median 14.031
3rd Quartile 20.383
Maximum 22.409
95% Confidence Interval for Mean
0 5 10 15 20 6.020 20.440
95% Confidence Interval for Median
5.108 20.923
95% Confidence Interval for StDev
5.024 17.167
Mean
Median
Mean
Median
Variable Maximum
Concentration of Ferrum 22.41
Absorbance 0.5772
Analysis of Variance
Source DF SS MS F P
Regression 1 0.240152 0.240152 470899.96 0.000
Error 5 0.000003 0.000001
Total 6 0.240154
Analysis of Variance
Coefficients
Regression Equation
Std
Obs Absorbance Fit Resid Resid
1 0.002600 0.001619 0.000981 2.24 R
R Large residual