Objective of a Business Report
Objective of a Business Report
To assess the impact of the new landing page, the Data Science team
conducted an experiment using A/B testing methodology. They randomly
divided 100 users into two groups: the control group, which was shown the
existing landing page, and the treatment group, which was presented with
the new landing page. Data regarding user interactions with both versions of
the landing page was collected, including the time spent on the page,
whether the user converted to a subscriber, and the preferred language of
the user.
1. Do users spend more time on the new landing page compared to the existing
landing page?
2. Does the conversion rate (subscriber acquisition) depend on the preferred
language of the user?
3. Is the mean time spent on the new landing page the same for users with
different preferred languages?
Solution
Let:
Given:
P(A∣B)=P(B)P(A∩B)
So, the probabilities of fire, mechanical failure, and human error respectively are
approximately 0.999, 0.9985, and 0.9988.
Solution
To find the probability of a radiation leak (denoted as P(R)), we can use the law of
total probability, which states that the probability of an event can be found by
considering all possible ways the event could occur.
P(R) can occur in three mutually exclusive ways: through a fire ( F), a mechanical
failure (M), or a human error (H).
So, we have:
P(R)=P(R∣F)×P(F)+P(R∣M)×P(M)+P(R∣H)×P(H)
Given:
P(R∣F) = 0.20
P(R∣M) = 0.50
P(R∣H) = 0.10
P(F) = 0.999
P(M) = 0.9985
P(H) = 0.9988
P(R)=(0.20×0.999)+(0.50×0.9985)+(0.10×0.9988)
P(R)=0.1998+0.49925+0.09988
P(R)=0.79993
1.3 Suppose there has been a radiation leak in the reactor for which the
definite cause is not known. What is the probability that it has been
caused by a) a fire b) a mechanical failure c) a human error?
Solution
To determine the probability that the radiation leak was caused by a specific type of
accident (fire, mechanical failure, or human error) given that a radiation leak has
occurred, we can use Bayes' theorem. Bayes' theorem states:
P(A∣B)=P(B)P(B∣A)×P(A)
Where:
In this case, we want to find the probability of each type of accident (fire,
mechanical failure, human error) given that a radiation leak has occurred.
Let:
We already know:
P(A∣BF)=P(BF)P(BF∣A)×P(A)
P(A∣BM)=P(BM)P(BM∣A)×P(A)
P(A∣BH)=P(BH)P(BH∣A)×P(A)
Given that:
For P(BF∣A):
(BF∣A)=P(A)P(A∣BF)×P(BF)=0.20×0.999/0.799930 =0.1998/0.79993
P(BF∣A)= 0.1998/0.79993≈0.24975P(BF∣A)≈0.24975
P(BF∣A):≈ 0.24975
P(BM∣A)=0.49925/0.799930
/
P(BM∣A)≈0.6247
P(BH∣A)=0.09988/0.79993≈0.12445
P(BH∣A)≈0.12445
So, the probabilities that the radiation leak was caused by a) a fire, b) a mechanical
failure, c) a human error are approximately: a) 24.975% b) 62.47% c) 12.445%
2.1 What is the probability that a randomly chosen student gets a grade
below 85 on this exam?
Solution
Given that the grades are normally distributed with a mean ( μ) of 77 and a
standard deviation (σ) of 8.5, we can use the Z-score formula to standardize the
score and then look up the corresponding cumulative probability in the standard
normal distribution table.
Z=(X−μ)/ σ
Where:
X is the score we want to find the probability for (85 in this case),
μ is the mean (77),
σ is the standard deviation (8.5).
=(85−77)/8.5
≈0.9412
From the table or using a calculator or software, we find that the cumulative
probability for Z=0.9412 is approximately 0.8257.
Therefore, the probability that a randomly chosen student gets a grade below 85 on
this exam is approximately 0.8257, or 82.57%.
Solution
the Z-score formula to standardize the scores and then find the cumulative
probabilities for each score.
For X1=65:
(65−77)/8.5=−12/8.5≈−1.4118
Z1≈−1.4118
For X2=87:
(87−77)/8.5=10/8.5≈1.1765
Z2≈1.1765
P(65<X<87)=P(X<87)−P(X<65) =0.8790−0.0788≈0.8002
Therefore, the probability that a randomly selected student scores between 65 and
87 on the exam is approximately 0.80020.8002, or 80.02%80.02%.
2.3 What should be the passing cut-off so that 75% of the students clear
the exam?
Solution
Since the grades are normally distributed with a mean ( μ) of 77 and a standard
deviation ( σ) of 8.5, we need to find the score X such that 75% of the scores fall
below it.
We'll use the Z-score formula to find the Z-score corresponding to the 75th
percentile and then use it to find the corresponding score X.
Now, we use the Z-score formula to find the score X corresponding to this Z-score:
Z=(X−μ)/ σ
Solving for X:
X−77=0.6745×8.5
X−77=5.73325
X≈82.73325
Therefore, the passing cut-off score so that 75% of the students clear the exam
should be approximately 82.7382.73.
Solution
Problem Definition:
The problem at hand is to determine the effectiveness of the new landing page in
gathering new subscribers for the E-news Express portal. This is done through an
A/B test where users are randomly assigned to either the control group (exposed to
the old landing page) or the treatment group (exposed to the new landing page).
The key question to answer is whether the new landing page leads to a higher rate
of subscription compared to the old landing page.
Questions to be Answered:
1. What is the subscription rate for each group?
2. Is there a statistically significant difference in subscription rates between the control
and treatment groups?
3. What insights can be gained from the exploratory analysis of user interactions with
the landing pages?
(100, 6)
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 100 entries, 0 to 99
Data columns (total 6 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 user_id 100 non-null int64
1 group 100 non-null object
2 landing_page 100 non-null object
3 time_spent_on_the_page 100 non-null float64
4 converted 100 non-null object
5 language_preferred 100 non-null object
dtypes: float64(1), int64(1), object(4)
memory usage: 4.8+ KB
# Univariate Analysis
# Group
#Landing page
# Converted
# Language preferred
Bivariate Analysis
The sample standard deviation of the time spent on the new page is: 1.82
The sample standard deviation of the time spent on the new page is: 2.58
Based on the results of the hypothesis test, which yielded a p-value of approximately 0.000139,
we reject the null hypothesis.
Therefore, we have sufficient evidence to conclude that the mean time spent on the new landing
page is significantly greater than the mean time spent on the old landing page.
This inference suggests that the changes made to the landing page design or content have led
to an increase in user engagement, as reflected by the increased time spent on the page.
Null Hypothesis (H0): The converted status does not depend on the preferred language. Alternate
Hypothesis (Ha): The converted status depends on the preferred language. Step 2: Conduct the
hypothesis test and compute the p-value
converted
no 11 19 16
yes 21 15 18
Draw inference
Based on the results of the chi-square test of independence with a p-value of approximately
0.213, we fail to reject the null hypothesis. This suggests that there is not enough evidence to
conclude that the converted status depends on the preferred language.
Therefore, we do not find a significant association between the converted status and the
preferred language.
3.5 Is the mean time spent on the new page same for the
different language users?
- State the null and alternate hypotheses - Check the assumptions of the
hypothesis test. - Conduct the hypothesis test and compute the p-value -
Write down conclusions from the test results
Draw Inference
Based on the p-value obtained from the one-way ANOVA test, which is approximately 0.432,
and considering a significance level of α = 0.05:
Since the p-value is greater than α, we fail to reject the null hypothesis. There is insufficient
evidence to conclude that there is a significant difference in the mean time spent on the new
page among English, French, and Spanish language users. Therefore, we cannot conclude that
the mean time spent on the new page varies significantly across different language groups.
Conclusions from the one-way ANOVA test comparing the mean time spent on the new page
among English, French, and Spanish language users:
The p-value obtained from the test is approximately 0.432. With a significance level of α = 0.05,
we fail to reject the null hypothesis. There is insufficient evidence to conclude that there is a
significant difference in the mean time spent on the new page among English, French, and
Spanish language users. Therefore, we cannot conclude that the mean time spent on the new
page varies significantly across different language groups.