Inferential Statistics
Inferential Statistics
PROJECT : INFERENTIAL
STATISTICS
Chinmaye TL
BATCH: PGPDSBA.O.JULY24.A
TABLE OF
CONTENTS
SERIAL PAGE
NUMBER
PROBLEM STATEMENT NUMBER
1. PROBLEM 1 3
2. PROBLEM 2 8
3. PROBLEM 3 14
4. PROBLEM 4 20
2
PROBLEM 1
A physiotherapist with a male football team is interested in studying
the relationship between foot injuries and the positions at which the
players play from the data collected.
3
1.1 What is the probability that a randomly chosen player
would suffer an injury?
The formula used to find the probability that a randomly chosen player suffers an injury is
IMPLICATIONS
A high injury rate (61.7%) suggests that injuries are a significant concern for the team.
It’s crucial to implement or enhance injury prevention programs.
This could include better warm-up routines, strength and conditioning programs, and
flexibility exercises.
4
1.2 What is the probability that a player is a forward or a
winger?
To calculate this probability, the total number of forwards and wingers should be
added and then divided by the total number of players
IMPLICATIONS
With 52% of players being either Forward or Winger, these positions are quite prevalent
on the team.
Understanding that more players are forwards or wingers might explain a larger share of
foot injuries if those positions experience more physical stress or are more prone to
certain types of injury.
Injury prevention strategies should be more targeted towards the demands placed on
forwards and wingers, such as sprinting and cutting movements, which can affect the
feet.
5
1.3 What is the probability that a randomly chosen player
plays in a striker position and has a foot injury?
Dividing the number of injured strikers by the total number of players will give the
probability that a randomly chosen player plays in a striker position and has a foot injury
IMPLICATIONS
19% of the total players being injured Strikers indicates that Strikers are notably affected
by injuries.
Strikers appear to be at higher risk for foot injuries, likely due to the physical demands of
their position, which include quick acceleration, deceleration, sharp turns, and frequent
kicking.
Strikers, due to their high-intensity playing style, should have personalized load
management plans to monitor the strain on their feet during training and matches and
Regular check-ins to assess pain levels or early signs of injury.
6
1.4 What is the probability that a randomly chosen injured
player is a striker?
Dividing the number of injured strikers by the total number of injured players will give the
probability that a randomly chosen injured player is a striker
IMPLICATIONS
31% of injured players are Strikers implies that Strikers, although they make up a smaller
proportion of the team (compared to other positions), have a significant share of the
injuries.
Strikers making up 31% of the injured group suggest that this position may be more
injury-prone compared to others, likely due to the physical demands of their role.
Strikers are essential to the team’s offensive strategy, so managing their health is critical
for performance.
Given that a large portion of injured players are strikers, the physiotherapist may need to
design position-specific rehab programs. Strikers need to regain explosive strength,
balance, and coordination to return to peak form.
7
PROBLEM 2
Quality Control Analysis: Determine the proportion of gunny bags that fail to meet
specific strength thresholds, helping to identify weaknesses in the packaging material.
Supply Chain Efficiency: Understand the potential for wastage or pilferage based on
the strength of the gunny bags, which can affect the integrity of the packaging and
lead to material loss.
Risk Management: Identify the percentage of bags that could fail under various
strength conditions, allowing the company to take preventive measures to reduce
product damage.
Performance Benchmarking: Compare the performance of gunny bags against the
expected standard (mean breaking strength of 5 kg/cm²), ensuring consistency in
packaging quality.
8
METHODOLOGY
2. Z-Score Calculation: To determine the proportion of gunny bags that fall within
specified breaking strength thresholds, z-scores are computed using the formula
This calculation converts the breaking strength values into standard normal variables,
which allows the use of cumulative distribution functions (CDF).
4. Visualization
9
2.1 What proportion of the gunny bags have a breaking
strength less than 3.17 kg per sq cm?
VISUALIZATION
10
2.2 What proportion of the gunny bags have a breaking
strength of at least 3.6 kg per sq cm.?
IMPLICATION
The proportion of gunny bags with a breaking strength of at least 3.6 kg per square cm is
82.38%.
This means that approximately 82.38% of the gunny bags have a strength greater than or
equal to 3.6 kg per sq cm.
VISUALIZATION
11
2.3 What proportion of the gunny bags have a breaking
strength between 5 and 5.5 kg per sq cm.?
IMPLICATION
Approximately 12.93% of the gunny bags have a breaking strength between 5 and 5.5 kg
per sq cm.
VISUALIZATION
12
2.4 What proportion of the gunny bags have a breaking
strength NOT between 3 and 7.5 kg per sq cm.?
Mean (μ\muμ) of breaking strength = 5 kg per sq cm
Standard deviation (σ\sigmaσ) = 1.5 kg per sq cm
We need to calculate P(X<3 or X>7.5), which is the complement of P(3≤X≤7.5).
Converting the values to Z-scores
The complement of this probability gives us the proportion of gunny bags with
a breaking strength NOT between 3 and 7.5 kg per sq cm:
P(X<3 or X>7.5)=1−0.8607=0.1393
IMPLICATION
Approximately 13.93% of the gunny bags have a breaking strength not between 3 and
7.5 kg per sq cm.
VISUALIZATION
13
PROBLEM 3
Problem 3
Zingaro stone printing is a company that specializes in printing
images or patterns on polished or unpolished stones. However,
for the optimum level of printing of the image, the stone surface
has to have a Brinell's hardness index of at least 150. Recently,
Zingaro has received a batch of polished and unpolished stones
from its clients. Use the data provided to answer the following
(assuming a 5% significance level);
14
METHODOLOGY
The average hardness for unpolished stones is slightly above 150, while
polished stones have a mean of approximately 145.
The standard deviation is higher for unpolished stones, indicating greater
variability in their hardness.
The histogram shows that most unpolished stones have hardness values slightly
above 150, while polished stones have a tighter distribution around 145.
15
BOXPLOT
Both distributions exhibit some variation, but there are no extreme outliers. Unpolished
stones show a wider range of hardness values compared to polished stones.
2. Statistical Analysis
2.1 One-Sample t-Test for Unpolished Stones:
To determine if the unpolished stones are unsuitable for printing (mean hardness < 150),
a one-sample t-test was conducted with the following hypothesis:
Null Hypothesis (H₀): The mean hardness of unpolished stones is greater than or
equal to 150.
Alternative Hypothesis (H₁): The mean hardness of unpolished stones is less than
150.
Results:
t-statistic: -0.343
p-value: 0.366
Since the p-value is greater than the significance level of 0.05, we fail to reject the null
hypothesis. This means there is not enough statistical evidence to conclude that the
unpolished stones are unsuitable for printing.
16
2.2 Two-Sample t-Test (Polished vs Unpolished Stones):
To determine if the mean hardness of polished and unpolished stones differs, we
performed a two-sample t-test with the following hypothesis:
Null Hypothesis (H₀): The mean hardness of polished stones equals the mean
hardness of unpolished stones.
Alternative Hypothesis (H₁): The mean hardness of polished stones is different from
the mean hardness of unpolished stones.
Results:
t-statistic: 3.026
p-value: 0.003
Since the p-value is less than the significance level of 0.05, we reject the null
hypothesis. This indicates that the mean hardness of polished and unpolished stones
is significantly different.
3. Conclusion
Recommendation
Zingaro should consider both types of stones for printing but may need to focus on
ensuring that polished stones meet the necessary hardness requirements for optimal
printing.
17
3.1 Zingaro has reason to believe that the unpolished
stones may not be suitable for printing. Do you think
Zingaro is justified in thinking so?
18
3.2 Is the mean hardness of the polished and unpolished
stones the same?
Results:
t-statistic: 3.026
p-value: 0.003
Since the p-value is less than 0.05, we reject the null hypothesis, meaning
the mean hardness of polished and unpolished stones is significantly
different.
19
PROBLEM 4
Problem 4
Dental implant data: The hardness of metal implants in dental
cavities depends on multiple factors, such as the method of implant,
the temperature at which the metal is treated, the alloy used as well
as the dentists who may favor one method above another and may
work better in his/her favorite method. The response is the variable
of interest.
20
4.1 How does the hardness of implants vary depending on
dentists?
Objective:
The primary objective of this analysis was to determine whether different
dentists have a significant effect on the hardness of implants for Alloy 1.
Hypotheses:
Null Hypothesis (H0): There is no significant difference in the hardness of
implants depending on the dentists.
Alternative Hypothesis (Ha): There is a significant difference in the
hardness of implants depending on the dentists.
F-statistic: 7.39
p-value: 0.0094
Given that the p-value is less than 0.05, we reject the null hypothesis. This
suggests that there is a statistically significant difference in the hardness of
implants depending on the dentist for Alloy 1.
21
ANOVA Table Breakdown:
1. Sum of Squares (sum_sq):
Dentist: The variability explained by the "Dentist" factor is 94802.68.
Residual: This is the variability not explained by the "Dentist" factor,
and it’s 551474.57.
2. Degrees of Freedom (df):
Dentist: The degrees of freedom for this factor is 1.0, meaning the
test is comparing two groups (likely two conditions or categories for
"Dentist").
Residual: The residual degrees of freedom is 43.0, representing the
number of observations minus the number of groups being
compared.
3. F-Statistic (F):
The F-value is 7.39, which compares the variance explained by the
"Dentist" factor to the unexplained variance (Residual). A higher F-
value indicates that the factor has a significant effect on the response
variable.
4. p-value (PR(>F)):
The p-value is 0.0094, which is less than the common significance
threshold of 0.05. This means there is strong evidence that the
"Dentist" factor has a statistically significant effect on the response
variable.
CONCLUSION
Since the p-value is 0.0094, we reject the null hypothesis (which typically
states that the "Dentist" factor has no effect). There is significant evidence
that the "Dentist" factor has an effect on the outcome related to Alloy 1.
VISUALIZATION
22
ANOVA Results of Alloy 2
CONCLUSIONS
Given that the p-value (0.14879) is greater than 0.05, we fail to reject the
null hypothesis. This suggests that there is no statistically significant
difference in the hardness of implants depending on the dentist for Alloy 2.
23
4.2 How does the hardness of implants vary depending on
methods?
OBJECTIVE
To determine if there are significant interactions and main effects impacting the hardness
of the implants.
This report presents the results of a two-way ANOVA analysis conducted to assess the
effects of different dentists and methods on the hardness of dental implants made from
two types of alloys.
Methodology
A two-way ANOVA was performed to evaluate the following factors:
Dentist: The different dentists involved in the procedure.
Method: The various methods employed for implant placement.
Hypotheses:
Null Hypothesis (H₀): There is no significant difference in the hardness of implants
based on the dentists and methods used. (All group means are equal.)
Alternative Hypothesis (Ha): There is a significant difference in the hardness of
implants based on the dentists and methods used. (At least one group mean is
different.)
Results
Analysis for Alloy 1:
The results of the two-way ANOVA for Alloy 1 are summarized below:
24
Analysis for Alloy 2:
The results of the two-way ANOVA for Alloy 2 are summarized below:
Significant Findings:
The p-value for the Method (0.000086) is significantly less than 0.05, indicating that
the method used significantly affects implant hardness. However, the p-values for
Dentist (0.082695) and the interaction (0.092577) do not meet the significance
threshold, suggesting that there is not enough evidence to conclude that these
factors have a significant impact on hardness for Alloy 2.
VISUALIZATION
CONCLUSIONS
The analysis reveals that for Alloy 1, both the dentist and method, along with their
interaction, significantly influence the hardness of the implants. Conversely, for Alloy
2, only the method has a statistically significant effect, while the influence of the
dentist and the interaction with the method were not statistically significant.
Recommendations
Further investigation is needed to explore the underlying reasons for these
differences, particularly for Alloy 1, where multiple factors were significant.
Additionally, expanding the sample size and including more variables such as
temperature and alloy composition could provide deeper insights into factors
affecting implant hardness. 25
4.3 What is the interaction effect between the dentist and
method on the hardness of dental implants for each type of
alloy?
Objective
The aim of this analysis was to investigate the interaction effect between the dentist
and method used on the hardness of dental implants for two different alloys (Alloy 1
and Alloy 2). Understanding these interactions can provide insights into the
effectiveness of different methods as performed by various dentists.
Methodology
To analyze the interaction effect, interactive box plots were created using Plotly. These
plots allow for a clear visualization of the hardness of implants as influenced by the
dentist and method. Each box plot represents the distribution of hardness values for
the different methods across dentists, enabling a comprehensive view of the
interaction effects.
Results
1. Alloy 1:
Box Plot Insights: The interactive box plot for Alloy 1 shows variations in
implant hardness among different dentists and methods.
Observations:
Some dentists demonstrate consistently higher hardness levels with
specific methods, suggesting a potential preference or proficiency in using
those methods.
The spread of hardness values indicates variability, which may warrant
further investigation into the underlying factors affecting these differences.
2. Alloy 2:
Box Plot Insights: The box plot for Alloy 2 reveals distinct trends as well.
Observations:
Similar to Alloy 1, certain methods yield higher hardness values when
performed by specific dentists.
The overlap of box plots suggests that while some methods work well
across dentists, there are also noticeable differences that highlight the
importance of individual skill and technique.
Conclusion
The interactive box plots effectively illustrate the interaction effects between dentists
and methods on the hardness of dental implants. The analysis reveals that both the
choice of method and the dentist's proficiency significantly influence the hardness
outcomes for dental implants in both alloys.
26
VISUALIZATION
27
4.3 What is the interaction effect between the dentist and
method on the hardness of dental implants for each type of
alloy?
Objective
This analysis aimed to investigate how the hardness of dental implants varies
depending on the interaction between dentists and methods for two types of alloys
(Alloy 1 and Alloy 2). Understanding these interactions is crucial for optimizing dental
procedures and improving implant outcomes.
Methodology
A two-way ANOVA was conducted separately for each alloy to assess the interaction
effects of dentists and methods on the hardness of implants. The following steps were
undertaken:
1.Hypotheses Formulation:
Null Hypothesis (H₀): There is no significant interaction effect between
dentists and methods on the hardness of implants.
Alternate Hypothesis (Ha): There is a significant interaction effect between
dentists and methods on the hardness of implants.
Based on the ANOVA results, analyzing the interaction effect of the dentist and
method on the hardness of dental implants for both Alloy 1 and Alloy 2.
28
Interpretation for Alloy 1:
Dentist: The p-value of 0.0024 is less than 0.05, which means the dentist has a
significant effect on the hardness of implants.
Method: The p-value of 0.0009 is also less than 0.05, indicating that the method
has a significant impact on implant hardness.
Interaction (Dentist): The p-value of 0.0119 is less than 0.05, suggesting that there
is a significant interaction between dentist and method in influencing the hardness
of Alloy 1 implants.
Method: The p-value of 0.0001 is less than 0.05, meaning the method has a
significant effect on implant hardness.
Interaction (Dentist): The p-value of 0.0926 is greater than 0.05, suggesting that
there is no significant interaction between dentist and method in Alloy 2.
Summary:
Alloy 1: Both the dentist, method, and their interaction have significant effects on
implant hardness.
Alloy 2: Only the method significantly impacts hardness, while the dentist and the
interaction do not.
30