0% found this document useful (0 votes)
20 views

Mathematics As A Tool 1

Uploaded by

ml2495941
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
20 views

Mathematics As A Tool 1

Uploaded by

ml2495941
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 68

STATISTICS

Involves the collection, organization, summarization, presentation and


interpretation of data.

Data. Individual pieces of factual information recorded and used for purposes
of analysis.

STATISTICAL METHOD
A. To describe- DESCRIPTIVE STATISTICS. The main tasks is that of
sample or population data organization, presentation and summarization(with out
drawing conclusion or inference beyond the data).
B. To infer-INFERENTIAL STATISTICS. The tasks is to generalize results
beyond the sample data collected.

Population. Is the complete collection of the data or individual that is of


interest.
Sample .A variable or individual that is a subset of a population.
Descriptive Statistics Inferential StatistICS

SAMPLE
INFERENCE

SAMPL
E
Descriptive statistics describe a population and sample’
Inferential statistics do an inference to a sample to
make conclusion about the popu;ation.
POPULATION
Data Measurement Scale
SCALE BASIC BASIC COMMON MARKETING PERMISSIB PERMIS
CHARACTERISTICS EMPERICAL EXAMPLES EXAMPLES LE SIBLE
OPERATION DESCRIPTI INFEREN
S VE TIAL
STATISTICS STATISTI
CS
NOMINA “Names”, “Labels” or Determinatio Gender, Brand name or Percentage, Chi-
L “category” a n of equality color brand numbers, Mode square,
numbers is used to classificatio store type Binomial
classify or identify n.name, Test
individuals address
ORDINA Numbers indicate the Determinatio Quality Preference Percentile, Rank
L relative position of n of lesser or rankings of rankings, Median order
objects but not the greater value, team in a market position, correlatio
magnitude of measure of tournament, social class n,
difference between non numeric #4 is better Friedman
them, has order but concepts than #5 ANOVA
no distance nor
unique origin
INTERVA Both with order or Determinatio Temperatur Attitudes, Range, Product
DESCRIPTIVE STATISTICS
TYPES OF DESCRIPTIVE ANALYSIS
1. Frequency distribution
Count, Percent, Array, Tally
2. Measure of Central Position and other Positions
Mean, Median, Mode, Quartile, Decile and Percentile
3. Measure of Dispersion and Variation
Range, Variance, Standard deviation
4. Correlation and Regression
Linear Regression and Correlation
1.Frequency Distribution
Number of Laptop Computers per Household
2 0 3 1 2 1 0 4 Frequen Observ
2 1 1 7 2 0 1 1 cy(f)Nu ed
0 2 2 1 3 2 2 1 mber of event,
1 4 2 5 2 3 1 2 househ Number
2 1 2 1 5 0 2 5 old of
laptop
Frequency distribution of table above
5 0
12 1
14 2
3 3
2 4
3 5
0 6
1 7
Age of Best Entrepreneur Award Recipients (2020)
40 42 37 76 39 53 45 36 62 43 51 32
42 54 52 37 38 32 45 60 46 40 36 47
29 43 37 38 45 50 48 60 43 58 46 33

Array frequency of the data


Array 29 3 3 36 37 3 39 40 42 43 45 4 47 48 50 51 52 53 54 58 60 62 72
Age 2 3 8 6
Frequ 1 2 1 2 3 2 1 2 2 3 3 2 1 1 1 1 1 1 1 1 2 1 1
ency

Array of the data. Is arranging the data in order (descending or


ascending order.
Frequency.Is the tally of the data values.
Amount Spent for a Cruise(in dollar)
3600 4700 7200 2100 5700 4400 9400
6200 5900 2100 4100 5200 7300 6200
3800 4900 5400 5400 3100 3100 4500
4500 2900 3700 3700 4800 4800 2400

Stem and Leaf Plot for the Amount Spent for a Cruise
Stems Leaves
Steps in constructing Stem and Leaf Plot
2 1 1 4 9 1. Make an array of the data,
3 1 1 6 7 7 8
2. Divide each measurement into two parts:
4 1 4 5 5 7 8
8 9
stem ; leaf
5 2 4 4 7 7 9 3. List the stem in column with a vertical line
6 2 2 boundary with the leaf.
7 2 3 4. For each measure record the leaf portion in
8 order (ascending)on the same row
9 4 corresponding with the stem.
Back-to-back stem and leaf plot
The Statistics test scores for two classes that took the same test of
different time 8am class and 10 am class.
8 98 89 78 68 5 42 65 77 88 9 73 88 85 84 86 73 73 85 92 7 85 71 84 84
a 7 5 2
m
10 45 56 62 71 8 94 48 57 63 7 83 95 59 64 73 86 59 68 75 8 75 86 76 78
am 2 3 6

Statistics Test Scores


8am class 10am class
2 4 5 8
7 5 6 7 9 9
8 5 6 2 3 4 8
8 7 3 3 3 2 1 7 1 3 3 5 5 6 8
9 8 8 6 5 5 4 4 8 2 3 6 6 6
8 5 5 4 2 9 4 5
Frequency Distribution Table(grouped data)
Monthly average Expenses of 50 Entrepreneurs (in thousand
pesos)
23 35 20 65 79 88 45 57 119 67
67 55 81 100 112 96 53 28 40 53
70 93 86 75 60 101 93 86 94 81
38 44 57 69 52 35 46 69 77 66
50 71 30 45 83 79 105 88 68 60

1. Make an array of the data.


2. Set an appropriate size of interval i = 10
3. Determine the class interval by dividing the range R= HV-LV/ I,
199 – 20= 99, 99/10= 9.9 ~10.
4. Start the first class interval from the lowest value applying the
size of interval =10.
Frequency Distribution Table of Monthly Expenses of 50 Entrepreneurs
X f Xm ˂cf >cf fXm rf crf Xm- (Xm- Mean)2 f(Xm-
Mean Mean)2

20-29 3 24.5 3 50 73.5 6 6 -42.5 1806.25 5418.75


30-39 4 34.5 7 47 138 8 14 -32.5 1056.25 4225
40-49 5 44.5 12 43 222.5 10 24 -22.5 506.25 2531.25
50-59 7 54.5 19 38 381.5 14 38 -12.5 156.25 1093.75
60-69 9 64.5 28 31 580.5 18 56 -2.5 6.25 56.25
70-79 6 74.5 34 22 447 12 68 7.5 56.25 337.7
80-89 7 84.5 41 16 591.5 14 82 17.5 306.25 2143.75
90-99 4 94.5 45 9 378 8 90 27.7 756.25 3025
100-109 3 104. 48 5 313.5 6 96 37.5 1406.25 4218.75
5
110-119 2 114. 50 2 229 4 100 47.5 2256.25 4512.5
5
N=5 ΣfXm= 100 Σf(xm-
0 3355 % Mean)2=
2. Measure of Central Position and other
Position
A. From the given example on Frequency Distribution table of
grouped data;
�푓 m 3355
The MEAN = = = 67.1 ≈ 67
� 50

� 50
−푝�푓 −19
The MEDIAN= LBMdn + ( 2
)i = 59.5 +( 2
)10 = 66.1 ≈ 66
푓푀 � 9

1 9−7
The MODE =LBMo + ( )� = 59.5 + ( (9−7)+(9−6)
) 10 = 63.5
1+ 2
Quartile

( −˂�푓푝 1)
Q1 = LBQ1 + 4

푓 1
(12.5−7)
= 39.5 + 10 = 50.5
5
Decile
8�
( − ˂�푓푝 8)
D8 = LBD8 + 10

푓 8
(40−34)
= 79.5 + 10 = 88
7
Percentile
60�
( − <�푓푝 60)
P60 = LBP60 + 100
i
푓 60
(30−28)
=69.5 + 10 = 72.83
6
B. Ungrouped data
Arithmetic mean
Mariella is in 4th year graduating architect student and she
wants to know the present average salary of the first time employed
or newly graduate architect in 5 different companies.
P32000 P28000 P30000 P38000 P45000
Sol.
32000+28000+30000+38000+45000 173000
=
5 5
=34,600
Weighted mean
1.Adem’s Final grade in Statistics based on this weighted criteria of
class performance

Criteria Grade Weight (%) Average

Major Exams 87 40% 34.8


Long Test 92 20% 18.4
Quizzes/Short 88 20% 17.6
Test
Recitation 89 10% 8.9

Assignments 98 10% 9.8

Final Grade 89.5

2.A’s weighted average grade for the 7 subjects I the first semester.
Subje Grade Weight
ct ed Unit
1 1.2 3
2 2.0 5
3 1.5 3
4 2.4 3
5 1.8 3
6 1.4 5
7 2.2 3
Total 25

(1.2�3) + (2.0�5) + (1.5�3) + (2.4�3) + (1.8�3) + (1.4�5) + (2.2�3)


25
44.3
=
25
=1.772
≈1.8
Geometric Mean
Examples: Compounded annual growth rate
A stock that grows by 10% in a year one, declines 20% in year
two and then grows 30% in year year three.The geometric mean of
the growth rate is calculated as follows
3
(1 + 0.1)(1 − 0.2)(1 + 0.3) =0.046 or 4.6%
Example 2. Annual average increase.
The average persons monthly salary in a certain town
jumped from P25000 to P50000 over the course of ten years.
Using the geometric mean, what is the average yearly increase?
25000�50000= 3535.53390593
3535.53
=353.5 The average increase annually is P353.55
10
Median
1.Find the median of the data in the following lists
a. 4 8 1 14 9 21 12 n= 7
1 4 8 9 12 14 21
�+1 7+1
= Median position = = 4�ℎ
� 2
b. 46 23 92 89 77 108
23 46 77 89 92 108
6+1
=3.5th The average between 3rd and 4th
2
77+89
3rd data = 77 4th data = 89 Mdn= = 83
2
Mode
Find the mode of the data in the following lists
a. 18 15 21 16 15 14 15 21
the number 15 occurs frequently more than other value, 15
is the mode
b. 4 3 6 3 3 7 8 9 7 11 7 12 14 15 16 18
14 14
the numbers 3 7 and 14 are the mode?
Other position
Quartile
Find the third quartile of the given data below
92 80 88 80 89 78 75 85 75 85
73 93 74 84 95 85 83 70 84 90
Solutions:
Array of the data
70 73 74 75 75 78 80 80 83 84
84 85 85 85 88 89 90 92 93 95
n=20
3� 3(20)
Q3 = �ℎ �푒 = 15�ℎ �푒 , 푏 inspection is 88
4 4
Seventh Decile
7(20)
• D7 = = 14�ℎ �푒 푏 ��푠푝푒���‱� is 85
10
• The 80th Percentile of the data
80(20)
• P80 = = 16th term by inspection is 90
100
Measure of Dispersion and Variation
High value of dispersion signifies that data points tend to fall
farther away or the data points are more scatter or greater
interval or less consistent.
Low value of dispersion indicates that data points tend to cluster
tightly around the center, data points are less scatter or smaller
interval or more consistent.
Symmetrical proportion of
the distribution based from
standard deviation of the
given data
Example
1.A flower shop costumer requested to decorate their venue with a different kinds of
flowers of almost the same size.
Given the diameter of 5 different flowers randomly chosen by two flower supplier of
the flower shop. Which of the Supplier A and supplier B will satisfy the customers request?

Supplier A : 7 5 4 3 2
� = 4.2 R = 5 s = 1.92
where � is the mean, R is the range
and s is the standard deviation

Supplier B : 7 6 3 3 2
� = 4.2 R = 5 s = 2.17

The supplier A will satisfy the


customers request because it has lower
standard deviation therefore the flower it
Measure of Dispersion Computational
Formula
Range®
Highest value – Lowest Value
Quartile Deviation
3− 1
QD =
2
Average Deviation
� � �−��
AD =
�−1
Variance
Ungrouped data
�(�− �)2
s2=
�−1
Grouped data

�푓(� − �)2
s2 =
�−1
Standard Deviation
Ungrouped data

�(�− �)2
s=
�−1

Grouped data

�푓(� − �) 2
s=
�−1
Example: The following numbers obtained by sampling a
population
X 2X-mean
4 7 12 15(x-mean) 2+4+7+12+15
2 2-8 = -6 −62 = 36
� = =8
5
4 4-8= -4 −42 = 16
7 7-8 = -1
R = 15 -2 =
−12 = 1
13
118
12 12-8 = 4 42 = 16 S2 = = 29.5
4
15 15-8= 7 7 2
= 49 118
S= = 29.5 = 5.43
Summation=118 4

�−6�+�−4�+�−1�+�4�+�7� 22
AD= = = 4.4
5 5
Normal Distribution Curve
• Mean and Standard Deviation
• Raw Score(Standard Normal Distribution)
• Z – Score (Normal Distribution Probability)
• Area Under the Normal Curve
Z
Score
Probability
Distribution
Area
68 73 78 83 88 93 98
Raw Score
Raw Score
Is the data points or data values of a sample or population data.
Let us assumed a data of 50 students grade in statistics. Each
grade is the data point. The mean(�) is also a data point. It is the
interval of values based from the value of the mean and standard
deviation found in the normal distribution.
Let us used this data of the grade in Statistics of 50 students.
Lowest grade is 65 and highest grade is 98. The mean of their
grade is 83 and the standard deviation is 5.
The raw score is distributed in the curve in such a way that the z
score is found correspondingly to a raw score
z Score
z Score can be found using the raw score, the mean and the
standard deviation
1. grade 83 (that is equal to the mean)
�− � 83 −83
z= = =0
푠 5
2. 78
78 −83 −5
z = = = -1
5 5
3. 90
90 −83 7
z = = = 1.4
5 5
1.If the z score is given z = -1.5 we find the raw score
�−� � −83
z = = = -1.5 2. at z = 2.2 we find the raw score
푠 5
� −83
We find x = 2.2
5
x - 83 = 5(-1.5) x - 83 = 11
x – 83 = - 7.5 x = 11 + 83
x = 83 – 7.5 x = 94
x = 75.5 3. at z = 0.6
� −83
= 0.6
5
x - 83 = 3
x = 83+3
x = 86
68 73 78 83 88 93 98
or
87
. 15 87%
0 5.
1

P(z˂ -
1)=0.1587

78
Area under Normal curve

• The area under the normal curve can be found using the table of
normal distribution
1. Let us find P(z ˂ 0.6) = 0.7257
2. P(z ˂ -1.83)= 0.0336
3. P(-2˂ z ˂ 2) = 0.9544
4. P(z> 2) =0.0228
5. P(z> 1.5) = 0.0668
6. P(z˂ ___) = 0.6025 = 0.26
7. P(z˂____) = 0.1020 = -1.27
1.

P(Z˂ 0.6) = 0.2257 +0.5000 =


0.7257
Or 72.57%

P(z˂0.6)= 0.7257 or
72.57%

0.6
86
2.

P(z˂ -1.83) = 0.0336 or


3.36%

0.0336 or
3.36%

-1.83
73.85
3.

P(-2≤ z ≤ 2) = 0.9544
P(z ˂ 2 ) = 0.4772 + 0-5000 = 0-9772
P(z˂ -2) = 0.0228
P(z˂2)- P(z˂ -2) = 0-9772 -0.0228 =
0.9544 or 95.44%

P(-2≤z≤2)= 0.9544 or 95.44%

73 93
4.
P(z>2) = 0.0228
P(z˂ 2) = 0.9772
P(z>2)= 1 – P(z˂2) =1-
0.9772=0.0228
Or 2.28%

P(z>2)=
P(z˂2)=0.9972 or 0.2228 0r
97.72% 2,28%

93
5. P(z>1.5) = 0.0668
P(z˂1.5) = 0.4332+0.5000=0.9332
P(z>1.5) = 1-P(z˂1.5)=1-
0.9332=0.0668
Or 6.68%

P(z>1.5)=0.0668 or
P(z˂1.5)=0.9332 or 6.68%
93.32%

1.5
90.5
6.

P(z˂0.26)=0.6025

0.2
6
7.

0.1020 or
10.20%

-1.27
Applications on Normal Probability
Distribution
1. A data of grade in Statistics of 50 students with a � = 83 and s =
5.
a. Find the probability that the student grade is less than or
equal to 85 and How many of the students score less than or equal
to 85?
b. Find the probability of the student grade is less than or
equal to 74, if 74 is failing How many of the students have failing
grades?
c. c. Find the probability of the students grade is greater than
or equal to 75, if 75 is passing how many of the students passed?
d. Find the probability that the students grade is greater than
85.6, How many of the students have equal or greater than this
grade?
a.
85−83
Z= =0.4; P(z˂ 0.4)
5
=0.1554+0.5000
=0.6554
0.6554x50 = 32.77 or 33 students

0.6554 or
64.45%

0.4
85
b.

74−83
Z= =-1.8; P(z˂-1.8)= 0-
5
0359
0.0359x50= 1.795 or 2
students

0.0359 or
3.59%
-1.8
74
c.
75−83
Z= =-1.6; P(z˂-1.6) =
5
0.0548
P(z>-1.6) = 1-P(z˂1.6)=
1-0.0548 = 0.9452 or 94.52%
0.9452x50= 47.26 or 48
students

0.9452 or
94.52%

0.0548 or
5.48%
-
1.6
75 10
0
d. 85.6−83
Z= =0.52; P(z˂0.52)=
5
0.1985+0.5000 = 0.6985
P(z>0.52)=1-P(z˂0.52)=
1- 0.6985 = 0.3015 or 30.15%
-3015x50=15.075 or 16
students
e.
P(z˂0.26)=
0.1026+0.5000=0.6026
�−83
Z= =0.26
5
X-83=1.3
X=84.3and above this grade.
Descriptive Analysis and Presentation of
Bivariate Data:Linear Correlation and
Regression
• Two Quantitative Variables
• Data for Push-Ups and Sit-Ups
Student 1 2 3 4 5 6 7 8 9 10
Push- 27 22 15 35 30 52 35 55 40 40
Ups
Sit-Ups 30 26 25 42 38 40 32 54 50 43
The coefficient of Linear Correlation r, is a numerical
measure for the strength of the linear relationship between two
variables. It has a value between -1 and +1.A positive 1 shows a
perfect positive correlation and -1 shows a perfect negative
correlation.The value of r is defined by Pearson’s product
moment correlation formula.
Table for finding the five summations

Studen Push-Ups(x) X2 Sit-Ups(y) Y2 xy


t

1 27 729 30 900 810


2 22 484 26 676 572
3 15 225 25 625 375
4 35 1225 42 1764 1470
5 30 900 38 1444 1140
6 52 2704 40 1600 2080
7 35 1225 32 1024 1120
8 55 3025 54 2916 2970
9 40 1600 50 2500 2000
10 40 1600 43 1849 1720
Σx = 351 Σx2=13,717 Σy =380 Σy2= 15,298 Σxy = 14,257
Σx Σx2 Σy Σy2 Σxy
푆푆(� ) 919
r= = = 0.8394 or 0.84
푆푆(�) 푆푆( ) (1396)(858)
where,
(��)2 (351)2
푆푆(�) = Σx2 - = 13,717 - = 1,396.9
� 10
( )2 (380)2
푆푆( ) = Σy2 - = 15,298 − = 858
� 10
� (351)(380)
푆푆(� ) = � - = 14,257 - = 919
� 10
Since r =0.84 then the value is nearly positive 1.The data
has a positive linear correlation.
Linear Regression
• Method of least Squares : Line of Best Fit
y = b0 +b1x
is the predicted value of y in the equation of the line, where b0 is the constant
and b1 is the slope of the line.
푆푆(� ) � −(푏1 . �)
푏1 = b0 =
푆푆(�) �
푆푆(� ) 919
Slope 푏1= = = 0.66
푆푆(�) 1,396.9
−(푏1 . �) 380−((0.6579)(351)) 380−230.9229
y intercept 푏0 = = = = 14.9077 ‱ 14.9
� 10 10

By placing the two values in the model y = 푏0 + 푏1 x, we get the equation of


the line of best fit
y = 14.9 + 0.66x
Chart Title
Series1 Linear (Series1)

60

y = 0.6579x + 14.908
50 R2 = 0.7047

40
Axis Title

30

20

10

0
0 10 20 30 40 50 60
Axis Title
Example 2. Number of hours studied for an exam and the grade
received.Let x be the number of hours studied and y is the
grade received.
X,Hour 2 3 3 4 4 5 5 6 6 6 7 7 7 8 8
s
Studied
Y,Grade 5 5 7 5 7 7 8 6 9 8 7 9 10 8 9

You might also like