Basic Stat-1, Descriptive Statistics and Probability
Basic Stat-1, Descriptive Statistics and Probability
Assignment -1
-Requina Rachel Louis
Q2) Identify the Data types, which were among the following
Gender Nominal
High School Class Ranking Ordinal
Celsius Temperature Ratio
Weight Interval
Hair Color Nominal
Socioeconomic Status Nominal
Fahrenheit Temperature Ratio
Height Interval
Type of living Ordinal
accommodation
Level of Agreement Nominal
IQ(Intelligence Scale) Interval
Sales Figures Interval
Blood Group Nominal
Time Of Day Ordinal
Time on a Clock with Hands Nominal
Number of Children Nominal
Religious Preference Ordinal
Barometer Pressure Ratio
SAT Scores Interval
Years of Education Interval
Q3) Three Coins are tossed. What is the probability that two heads and one tail are
obtained?
A3) The probability of getting two heads and one tail on tossing three coins at once is equal to
3/8.
Q4) Two Dice are rolled, find the probability that sum is
a. Equal to 1
b. Less than or equal to 4
c. Sum is divisible by 2 and 3
A4) When two dice are rolled, sample space is given as:
(1,1) (1,2) (1,3) (1,4) (1,5) (1,6)
Therefore,
When we roll two dice, the possibility of getting number 4 is (1, 3), (2, 2), and (3, 1).
So,
Thus, 1/12 is the probability of rolling two dice and getting a sum of 4.
Favorable outcomes = (1 , 5) , (3 , 3) , (4 , 2) , (5 , 1) , (6 , 6)
Therefore,
Number of favorable outcomes = 5
Q5) A bag contains 2 red, 3 green and 2 blue balls. Two balls are drawn at random. What
is the probability that none of the balls drawn is blue?
A5) There are 7 balls originally with 2 of them blue so the probability of the first ball not being
blue is 5/7. This leaves 6 balls with 2 blue balls. The probability of the second ball not being blue
assuming that the first wasn’t is 4/6. The probability that neither ball drawn was blue is
(5/7)*(4/6)=20/42=10/21
Q6) Calculate the Expected number of candies for a randomly selected child
Below are the probabilities of count of candies for children (ignoring the nature of the
child-Generalized view)
A 1 0.015
B 4 0.20
C 3 0.65
D 5 0.005
E 6 0.01
F 2 0.120
Child A – probability of having 1 candy = 0.015.
= 3.090
= 3.09
Q7) Calculate Mean, Median, Mode, Variance, Standard Deviation, Range & comment
about the values / draw inferences, for the given dataset
● For Points,Score,Weigh>
Find Mean, Median, Mode, Variance, Standard Deviation, and Range and also
Comment about the values/ Draw some inferences.
A7)
SD 0 0.978457443 1.786943236
Assume one of the patients is chosen at random. What is the Expected Value of the
Weight of that patient?
∑ P(x).E(x)
P(x) 1/9 1/9 1/9 1/9 1/9 1/9 1/9 1/9 1/9
= (1/9) ( 108 + 110 + 123 + 134 + 135 + 145 + 167 + 187 + 199)
= (1/9) ( 1308)
= 145.33
Q9) Calculate Skewness, Kurtosis & draw inferences on the following data
Use Q9_a.csv
A9) Skewness
speed=-0.117510
distance=0.806895
Inference-Speed distribution is left skewed (negative skewness) 2. Distance distribution is right
skewed (positive skewness)
Kurtosis
speed=-0.508994
distance=0.405053
Inference-Speed distribution is platykurtic (negative kurtosis i.e. flatter than normal distribution)
2. Distance distribution is leptokurtic (positive kurtosis i.e. peaked than normal distribution)
SP and Weight(WT)
Use Q9_b.csv
Skewness
SP=1.581454
WT=-0.6033099
Kurtosis
SP=5.723521
WT=3.819466
A11)
34,36,36,38,38,39,39,40,40,41,41,41,41,42,42,45,49,56
Mean 41
Median 40.5
Mode 25.52
SD 5.05664
Majority of the students marks lie between 38-42. Skewness, that is 1.52 is positively skewed
towards the left side of the graph.
Q13) What is the nature of skewness when the mean, median of data are equal?
A14)Negative skewness, which means the distribution is concentrated on the right side.
A15)Positive skewness, which means the distribution is concentrated on the left side.
Q18) Answer the below questions using the below box plot visualization.
Negative Skewness
10-18
Draw an Inference from the distribution of data for Boxplot 1 with respect to Boxplot 2.
Q 20) Calculate probability from the given dataset for the below cases
1-pnorm(38,34.422,9.13144)= 0.3475908
b. P(MPG<40)
pnorm(40,34.422,9.13144)= 0.7293527
c. P (20<MPG<50)
pnorm(50,34.422,9.13144)-(1-pnorm(20,34.422,9.13144))= 0.01311818
Dataset: Cars.csv
Distributed normally
b. Check Whether the Adipose Tissue (AT) and Waist Circumference(Waist) from
wc-at data set follows Normal Distribution
Dataset: wc-at.csv
Adipose Tissue (AT)
Waist Circumference(Waist)
A22)
60% 0.8416212
90% 1.644854
94% 1.880794
Q 23) Calculate the t scores of 95% confidence interval, 96% confidence interval,
99% confidence interval for sample size of 25
A23)
95% 2.063899
96% 2.171545
99% 2.79694
Q 24) A Government company claims that an average light bulb lasts 270 days. A
researcher randomly selects 18 bulbs for testing. The sampled bulbs last an average of 260
days, with a standard deviation of 90 days. If the CEO's claim were true, what is the
probability that 18 randomly selected bulbs would have an average life of no more than 260
days
Hint:
df degrees of freedom
A24) 52.86%