CSBS - AD3491 - FDSA - IA 2 - Answer Key
CSBS - AD3491 - FDSA - IA 2 - Answer Key
Tiruchirappalli - 620012
Correlation Regression
The primary objective of Correlation is, When it comes to regression, its C206.2 U
to find out a quantitative/numerical value primary intent is, to reckon the values
expressing the association between the of a haphazard variable based on the
values. values of the fixed variable.
6 C206.3 R
Multiplication rule states that multiply together the separate
probabilities of several independent events to find the probability that these
events will occur together.
Part – B
(Answer all the questions 2 x 10 = 20marks)
Q.
Questions CO Skills
No.
11 Assume that an r of (–0.80) describes the strong negative relationship
between years of heavy smoking (X) and life expectancy (Y). Assume,
furthermore, that the distributions of heavy smoking and life expectancy
each have the following means and sums of squares:
X̅ = 5 Y̅ = 60
SSx = 35 SSy = 70
(a) Determine the least squares regression equation for predicting life C206.2 A
expectancy from years of heavy smoking.
(b) Determine the standard error of estimate, Sy|x, assuming that the
correlation of (–0.80) was based on n = 50 pairs of observations.
(c) Predict the life expectancy for John, who has smoked for 8 years.
(d) Predict the life expectancy for Katie, who has never smoked.
Or
Each of the following pairs represents the number of licensed drivers (X )
12
and the number of cars (Y ) for seven houses in my neighborhood:
DRIVERS (X) CARS (Y)
5 4
5 3
2 2
2 2
C206.2 A
3 2
1 1
2 2
(a) Determine the least squares equation for these data.
(b) Determine the standard error of estimate, Sy|x, given that n = 7.
(c) Predict the number of cars for each of two new families with two and
five drivers.
Discuss about the following with suitable example: (5x2=10)
13
i. Random Sampling vs Random Assignments
ii. Independent vs Dependent Events C206.3 R
iii. Independent vs Mutually Exclusive Events
iv. Conditional Probability
v. Sampling Distribution of the Mean
Answer:
i) Random sampling is a selection process that guarantees all potential observations in the population
have an equal chance of being selected. Random sampling occurs if, at each stage of sampling, the
selection process guarantees that all potential observations in the population have an equal chance
of being included in the sample. It’s important to note that randomness describes the selection
process - that is, the conditions under which the sample is taken and not the particular pattern of
observations in the sample.
Random Assignment is a procedure designed to ensure that each subject has an equal chance of
being assigned to any group in an experiment. Random sampling occurs in well-designed surveys,
and random assignment occurs in well-designed experiments.
ii) Dependent Events - The occurrence of one event cause some effect on the probability that the other
event will occur.
Independent Events - The occurrence of one event has no effect on the probability that the other
event will occur.
iv) Conditional Probability is the probability of one event, given the occurrence of another event.
Before multiplying to obtain the probability that two dependent events occur together, the
probability of the second event must be adjusted to reflect its dependency on the prior occurrence of
the first event. This new probability is the conditional probability of the second event, given the first
event.
v) The sampling distribution of the mean refers to the probability distribution of means for all possible
random samples of a given size from some population. The sampling distribution of the mean
allows us to determine whether, given the variability among all possible sample means, the one
observed sample mean can be viewed as a common outcome or as a rare outcome.
Refer to class notebook or textbook to know about each terms with suitable examples.
Or
Imagine a very simple population consisting of only four observations:
14 2 3 4 5
(a) Explain the process of constructing relative frequency table showing the
C206.3 A
sampling distribution of the mean.
(b) Construct a relative frequency table showing the sampling distribution
of the mean for the above observations.
Part – C
(Answer all the questions 1 x 10 = 10marks)
Q.No. Questions CO Skills
Define Hypothesis. Discuss in detail about at least 5 types of hypothesis
statement with suitable example.
Hypothesis is a statement about the nature of a population. It is often
stated in terms of a population parameter. Hypothesis testing is a form of
statistical inference that uses data from a sample to draw conclusions about a
15 population parameter or a population probability distribution C206.3 U
Some types of hypothesis statements are Directional Hypothesis, Non-
Directional Hypothesis, Null hypothesis, Alternative hypothesis, Associative
Hypothesis.
Refer to class notebook or https://www.analyticssteps.com/blogs/what-hypothesis-
testing-types-and-methods to know more about different types of hypothesis testing.
Or
Calculate the value of the z test for each of the following situations. Also
given critical z scores of ±1.96, calculate the critical confidence level.
16 (a) X̄ = 12; σ = 9; n = 25, μhyp = 15 C206.3 A
(b) X̄ = 3600; σ = 4000; n = 100; μhyp = 3500
(c) X̄ = 0.25; σ = 010; n = 36; μhyp = 0.22
Critical z Score: A z score that separates common from rare outcomes and hence dictates whether H0 should
be retained or rejected.