Engineering Mathematics 2
Engineering Mathematics 2
UNIT 5
CORRELATION & REGRESSION
UNIT 6
LEAST SQUARE ADJUSTMENT
UNIT 7
CHI SQUARE TEST
UNIT 5
CORRELATION & REGRESSION
5.1 COVARIANCE
Example:
Solution-
Example:
Solution-
The SIGNIFICANCE of (r):
Example:
5.3 LINEAR REGRESSION
5.4 BIVARIATE DISTRIBUTIONS
Example:
For the given lines of regression, 3x – 2y = 5 and x – 4y = 7,
find: (a) regression coefficients byx and bxy
(b) coefficient of correlation r(x, y)
Example:
Example:
Example: (PYRQ)
UNIT 6
LEAST SQUARE ADJUSTMENT
1. Specify the null and alternative hypotheses. The null hypothesis is usually
that the population variance is equal to a specific value, while the alternative
hypothesis is that the population variance is not equal to that value.
2. Select a sample from the population and calculate the sample variance and
size.
3. Calculate the test statistic, which is the sample variance divided by the
known population variance.
4. Determine the critical value of the test statistic based on the significance
level (alpha) of the test and the degrees of freedom. The degrees of freedom
are calculated as the sample size minus 1.
5. Compare the calculated test statistic to the critical value to determine
whether to reject or fail to reject the null hypothesis. If the calculated test
statistic exceeds the critical value, the null hypothesis is rejected, and the
alternative hypothesis is accepted.
Conditions for the Chi-Square Test for One Variance:
To conduct a valid chi-square test for one variance, the following conditions
must be met:
i. The sample must be drawn randomly from the population.
ii. Each observation in the sample must be independent of the others.
iii. The population distribution must approximate a normal distribution.
The chi-square test is used to estimate how likely the observations that are made
would be, by considering the assumption of the null hypothesis as true.
Chi-Square Distribution
When we consider, the null speculation is true, the sampling distribution of the test
statistic is called as chi-squared distribution. The chi-squared test helps to
determine whether there is a notable difference between the normal frequencies
and the observed frequencies in one or more classes or categories. It gives the
probability of independent variables.
Note: Chi-squared test is applicable only for categorical data, such as men and
women falling under the categories of Gender, Age, Height, etc.
Finding P-Value
P stands for probability here. To calculate the p-value, the chi-square test is used in
statistics. The different values of p indicates the different hypothesis interpretation,
are given below:
Properties
Formula
The chi-squared test is done to check if there is any difference between the
observed value and expected value. The formula for chi-square can be written as;
A survey on cars had conducted in 2011 and determined that 60% of car owners
have only one car, 28% have two cars, and 12% have three or more. Supposing
that you have decided to conduct your own survey and have collected the data
below, determine whether your data supports the results of the study.
Use a significance level of 0.05. Also, given that, out of 129 car owners, 73 had
one car and 38 had two cars.
Solution:
H0: The proportion of car owners with one, two or three cars is 0.60, 0.28 and 0.12
respectively.
H1: The proportion of car owners with one, two or three cars does not match the
proposed model.
Let’s tabulate the given information and calculate the required values.
Therefore, χ2 = ∑(Oi – Ei)2/Ei = 0.7533
Let’s compare it to the chi-square value for the significance level 0.05.
Using the table, the critical value for a 0.05 significance level with df = 2 is 5.99.
That means that 95 times out of 100, a survey that agrees with a sample will have a
χ2 value of 5.99 or less.
The Chi-square statistic is only 0.7533, so we will accept the null hypothesis.
Table
The chi-square distribution table with three probability levels is provided here. The
statistic here is used to examine whether distributions of certain variables vary
from one another. The categorical variable will produce data in the categories and
numerical variables will produce data in numerical form.
ANS- 10.38-10.44