0% found this document useful (0 votes)

354 views2 pages

Question 1 (5 Marks) : Part 3 Hypothesis Testing (5 Marks)

This document contains 4 questions related to statistical concepts and simulations. Question 1 involves setting up the theoretical expectations for the central limit theorem (CLT) simulation. Questions 2 involves simulating the CLT results for different sample sizes and probability mass functions without using libraries. Question 3 involves performing hypothesis tests on two scenarios. Question 4 provides definitions for terms used in the CLT simulation questions and asks the student to state the theoretical distributions expected from the CLT for different sample sizes and a given probability mass function.

Uploaded by

Moreen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

354 views2 pages

Question 1 (5 Marks) : Part 3 Hypothesis Testing (5 Marks)

Uploaded by

Moreen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Question 1 (5 marks)

The SETU (https://www.monash.edu/ups/setu) score of FIT units is known to follow a Gaussian distribution (https://en.wikipedia.org/wiki/Normal_distribution) with a variance of 0.25.
Suppose you wish to estimate for the mean SETU score for all units by taking a sample of n units and checking their last semester's SETU. How many units in this sample that you
need to have a 95% confidence interval for μ with a width of 0.1?

ANSWER

Question 2 (5 marks)
You do a poll to see what fraction p of the students participated in the FIT5197 SETU survey. You then take the average frequency of all surveyed people as an estimate p^ for p. Now it

is necessary to ensure that there is at least 95% certainty that the difference between the surveyed rate p
^ and the actual rate p is not more than 10% . At least how many people

should take the survey?

ANSWER

Question 3 (5 marks)
Suppose you repeated the above polling process multiple times and obtained 40 confidence intervals, each with confidence level of 90% . About how many of them would you expect to
be "wrong"? That is, how many of them would not actually contain the parameter being estimated? Should you be surprised if 12 of them are wrong?

ANSWER

Question 4 (5 marks)
In lecture 3 (https://d3cgwrxphz0fqu.cloudfront.net/81/8c/818c7ed4d0cd856607bf4a5347fb10a6f9dcea50?response-content-
disposition=inline%3Bfilename%3D%22FIT5197_L3.pdf%22&response-content-
type=application%2Fpdf&Expires=1649953740&Signature=JqqTutDRrQhBB6QLX9pCb58FlEcx4WdmvWt6fOdki83rImO0cY8z5~VM1G8xyXBa81U9ffBzCivE5eoZCGB8LulfUuiuUlPaY7f
IBlEqW1k41YRZzwdlgmL~UCbMKHmFCOwfw2aoD1MgC2hE-2-iPCFesIXUrdY9oWUsjx6XaDjEAdRylr30SQGV93JdqehV46MvsU-
YW8Miq6BfeMWLPT2gvIjz7sz0Dqwp~6PRMGuJWNf6GfiAPW6-mjnAx91AKBKopIG4LRjkvL98oEgh~dSmPS4Hg__&Key-Pair-Id=APKAJRIEZFHR4FGFTJHA), we mentioned the use
of the weak law of large numbers which tells us that the sample estimator will converge to the population parameter if we have a sufficiently large number of observations (or sample
size). In this question, we would like to see how big the sample size should be in order to get the approximation error down to a certain level.

Continuing from Question 3, we consider the random variable X to denote the event that the confidence interval cover the unknown parameter or not. Thus, X will follow the Bernoulli
distribution with a parameter θ , i.e., X ∼ Be(θ), where θ = 0.9 was provided in question 3. Given that you collect n random variable X1 , X2 , … , Xn . Calculate the smallest
number of confidence intervals, n, you have to observe to guarantee that
n
∣ ∑ Xi ∣
1
P (∣ − θ∣ > 0.01) < 0.1.
∣ n ∣

ANSWER

Part 3 Hypothesis Testing (5 marks)

Question 1 (2.5 marks)

As a motivation for students to attend the tutorial, Levin is offering a lot of hampers this semester. He has designed a spinning wheel (This is an example https://spinnerwheel.com/
(https://spinnerwheel.com/)) where there are four choices on it: "Hamper A", "Hamper B", "Hamper C", and "Better Luck Next Time". These choices are evenly distributed on the wheel.
If a student completes the attendance form for one of the tutorials, they will get a chance to spin the wheel.

As a hard-working student yourself, you have earned 12 chances at the end of the semester. When you finished your spins, the result showed {"N", "A", "N", "N", "B", "C", "N", "N", "N",
"A", "A", "N"} ("A","B" and "C" denote three hampers respectively, while "N" denotes "Better Luck Next Time"). You are shocked by the result and feel the game might be faulty. Before
questioning Levin, you would like to perform a hypothesis test to check whether you are really unlucky or has Levin secretly done something that had influenced the probability of
winning or not. State your hypothesis, perform the test and interpret the result.

ANSWER
Question 2 (2.5 marks)
The operation team of a retailer is about to report the performance of year 2022. As the data analyst, your job entails reviewing the reports provided by the team. One of the reports
regarding membership subscription looks suspicous to you. In this report, they compared the amount of money spent by the members against the non-members over the year. The
methodology is that they randomly selected 20 customers and compared their spending before and after becoming a member.

The average spending before becoming a member is $88.5 per week with a standard deviation of $11.2. The average after becoming a member is $105 per week with a standard
deviation of $15. In the report, the retailer claimed that after becoming a member, customers tend to spend 10% more than before on average.

As a statistician, you decide to perform a hypothesis test to verify the veracity of this claim. State your hypothesis, perform the test and interpret the result. Additionally, please suggest
another methodology to compare member vs non-member.

ANSWER

Part 4 Simulation (10 marks)

Consider the following experimental design definitions:

simulations : Number of samples you repeatedly take - for all Part 4, Q2 we set this number equal to 10000 , i.e., you have 10000 samples. If you have trouble understanding
this, perhaps it is time to rewatch the lecture recordings/materials.

n : Number of observations per sample, this will be given in the question as we will experiment with different values of n .

PMF(Y): Is the probability mass function that the random variable Y follows (please check Lecture 2 and Tutorial 2). Similar to n , we can experiment with different settings for
PMF(Y).

Random Variables RVs Y 1 , Y 2 , … , Y n ∼ PMF(Y) : All the random variables in the sample (observation RVs) will follow the distribution set out by the PMF. Again, the

number of observations n as well as the distribution PMF(Y) have not been set here but will be given in the questions.

Question 1: Theoretical Set-up for the CLT (No Coding or Simulation here!) (2 Marks)
Before simulating CLT, we must first establish what we would want to see from the simulation, i.e., what the theory tells us. Thus, we are going to set up the experiment here as well as
n
∑ Yi
n ¯¯¯
¯
set up our expectation for the (1) Summation Distribution , and (2) Mean Distribution .
i
∑ Yi Y ≡
i n

We will consider one of the possible set-ups for the distribution PMF(Y) as shown below. Additionally, we will also consider three different values for n , namely nSmall = 5 ,
nMedium = 30 , nBig = 100 .

Simply, we would like to obtain the distribution for (1) and (2) with each pair of n , and PMF(Y) that we set here. Again, please revisit the lecture materials if you have any doubts
since we have done a live presentation of this in our unit. Please put down your results up to five decimal places as we would like to compare this result with the simulation results later.

y 1 2 3 4 5

Pr (Y = y) 0.35 0.05 0.15 0.05 0.4

ANSWER

Question 2: Simulating the CLT result (NO LIBRARIES ALLOWED) (8 Marks)

After finishing Question 1, you should have collected the theoretical results. In this question, you will use these theoretical results to compare with the simulation results and verify
the CLT. As you should know by now, the CLT is based on the idea of repeated sampling. Thus, please simulate your results accordingly under the given PMF(Y) and the three
sample sizes n for the two distributions (1) and (2). The number of pairings is the same with question 1 since we would like to compare simulations with theoretical values.

For each pair of n, PMF(Y) under each distribution (1) and (2), you are required to display a histogram to represent the results of repeated sampling, and a curve to display the
theoretical results from Question 1. Explain your findings and results (no more than 150 words).

Instructions for plots (MUST FOLLOW) : The marking for this question also includes the cleanliness of your plots (proper labels for axes, name of the plot must include
the type of sampling distribution, and the sample size that you are using, e.g. Mean Distribution: n = 30 ). The theoretical values and simulated values need to be presented
accordingly for ease of comparison - you must put these values in the legends.

Instructions for codes (MUST FOLLOW) : The code needs to be elegant (do not hard code) with enough comments describing what you want to do. Furthermore,
the naming of the variables needs to make sense. If you need to use a chunk of code for more than one time, please write a function for it, we will deduct marks if you copy and
paste your codes here and there. As specified from the beginning, please put your result with 5 decimal places so we can compare and assess the theoretical results of the CLT and its
simulation.

ANSWER

Part 5 Linear Regression - The Consciousness Metre Challenge (45 Marks)

MA5120 Final Paper 2023
No ratings yet
MA5120 Final Paper 2023
4 pages
Statistical Analysis of Checkout Times
No ratings yet
Statistical Analysis of Checkout Times
7 pages
Semester Test 1 Memo
No ratings yet
Semester Test 1 Memo
12 pages
Final Sample Paper 1 - Solution
No ratings yet
Final Sample Paper 1 - Solution
11 pages
ST104a 2022 October Exam Paper
No ratings yet
ST104a 2022 October Exam Paper
21 pages
Probability and Statistics Exam Solutions
No ratings yet
Probability and Statistics Exam Solutions
9 pages
Statistics: Federal Public Service Commission
No ratings yet
Statistics: Federal Public Service Commission
2 pages
Statistical Analysis Questions
No ratings yet
Statistical Analysis Questions
6 pages
Questions For 2nd Midterm Exam
No ratings yet
Questions For 2nd Midterm Exam
5 pages
Final Math - 206 - 20SQ Fall 22 23
No ratings yet
Final Math - 206 - 20SQ Fall 22 23
2 pages
Final Sample Paper 1
No ratings yet
Final Sample Paper 1
33 pages
Example Questions For Final
No ratings yet
Example Questions For Final
9 pages
Post Trial Q
No ratings yet
Post Trial Q
2 pages
ST130: Basic Statistics: Duration of Exam: 3 Hours + 10 Minutes
No ratings yet
ST130: Basic Statistics: Duration of Exam: 3 Hours + 10 Minutes
12 pages
Statistics Question Bank
No ratings yet
Statistics Question Bank
4 pages
MB650005 Data Analysis For Management
No ratings yet
MB650005 Data Analysis For Management
14 pages
Practical On Nonparametric Statistical Tests
No ratings yet
Practical On Nonparametric Statistical Tests
16 pages
IBM322 MTE 22 Feb
No ratings yet
IBM322 MTE 22 Feb
4 pages
Statistics Paper
No ratings yet
Statistics Paper
12 pages
UCT STA3030F Inferential Statistics Exam
No ratings yet
UCT STA3030F Inferential Statistics Exam
13 pages
P&S Unit-4 and 5
No ratings yet
P&S Unit-4 and 5
3 pages
Option 5 2018 Final Exam
No ratings yet
Option 5 2018 Final Exam
3 pages
The Hong Kong Polytechnic University: Department of Applied Mathematics
No ratings yet
The Hong Kong Polytechnic University: Department of Applied Mathematics
15 pages
SNM Assisgnments 1-5
No ratings yet
SNM Assisgnments 1-5
16 pages
Applied Stats Exam Prep
No ratings yet
Applied Stats Exam Prep
35 pages
Statistics and Numerical Methods Syllabus
No ratings yet
Statistics and Numerical Methods Syllabus
42 pages
rr311801 Probability and Statistics
No ratings yet
rr311801 Probability and Statistics
9 pages
Stat2001 Practice Exam
No ratings yet
Stat2001 Practice Exam
5 pages
Simulation Theory 2022 - With Solution
No ratings yet
Simulation Theory 2022 - With Solution
8 pages
Probability and Statistics Exam Paper
No ratings yet
Probability and Statistics Exam Paper
43 pages
GOF Part 2 ANSWERS
No ratings yet
GOF Part 2 ANSWERS
17 pages
Lec 05
No ratings yet
Lec 05
28 pages
Specimen Exam Solutions Cs1a Ifoa 2019 Final
No ratings yet
Specimen Exam Solutions Cs1a Ifoa 2019 Final
11 pages
Data Structure Paper
No ratings yet
Data Structure Paper
4 pages
Be - Artificial Intelligence and Data Science - Semester 4 - 2024 - May - Statistics Pattern 2019
No ratings yet
Be - Artificial Intelligence and Data Science - Semester 4 - 2024 - May - Statistics Pattern 2019
4 pages
Sybsc-It Sem4 Cost Apr19
No ratings yet
Sybsc-It Sem4 Cost Apr19
4 pages
Maths Questions
No ratings yet
Maths Questions
9 pages
CAPE Applied Mathematics Past Papers 2005P2MAY PDF
No ratings yet
CAPE Applied Mathematics Past Papers 2005P2MAY PDF
6 pages
Epsc 123
No ratings yet
Epsc 123
5 pages
Stat 411 - Marking Scheme
No ratings yet
Stat 411 - Marking Scheme
9 pages
W12SGFE
No ratings yet
W12SGFE
3 pages
Probability and Statistics Exam Questions
No ratings yet
Probability and Statistics Exam Questions
10 pages
Sample Exam 2 2007-08
No ratings yet
Sample Exam 2 2007-08
13 pages
Be - Artificial Intelligence and Data Science - Semester 4 - 2022 - May - Statistics Pattern 2019
No ratings yet
Be - Artificial Intelligence and Data Science - Semester 4 - 2022 - May - Statistics Pattern 2019
4 pages
Statistics Endsem 21-22
No ratings yet
Statistics Endsem 21-22
4 pages
Statistics 2021
No ratings yet
Statistics 2021
3 pages
Statistical Inference Homework 2023
No ratings yet
Statistical Inference Homework 2023
3 pages
Statistics Exam 3 Review Solutions
No ratings yet
Statistics Exam 3 Review Solutions
8 pages
Final Exam January 2019 Ines Barkia PDF
No ratings yet
Final Exam January 2019 Ines Barkia PDF
10 pages
Probability and Statistics
No ratings yet
Probability and Statistics
3 pages
Questions Statistics Set 1
No ratings yet
Questions Statistics Set 1
3 pages
AI HL Revision Worksheet-Statistics and Probability
No ratings yet
AI HL Revision Worksheet-Statistics and Probability
30 pages
PS1 Tutorials Wk9 Solutions (2025)
No ratings yet
PS1 Tutorials Wk9 Solutions (2025)
16 pages
Analyzing Programming Language Efficiency
No ratings yet
Analyzing Programming Language Efficiency
6 pages
Statistics Exam Paper 2012 - GCU Faisalabad
No ratings yet
Statistics Exam Paper 2012 - GCU Faisalabad
11 pages
MAKAUT Question Paper GIVEN BY KKS
No ratings yet
MAKAUT Question Paper GIVEN BY KKS
4 pages
Sbaod
No ratings yet
Sbaod
8 pages
Btech Cse 3 Sem Mathematics 2012
No ratings yet
Btech Cse 3 Sem Mathematics 2012
7 pages
Pdca Process - Ps
No ratings yet
Pdca Process - Ps
8 pages
BLOCO 8 - Issues in Outcomes Research An Overview of Randomization Techniques For Clinical Trials
No ratings yet
BLOCO 8 - Issues in Outcomes Research An Overview of Randomization Techniques For Clinical Trials
7 pages
Philosophy of Research
No ratings yet
Philosophy of Research
6 pages
F1 CRD Lecture Stat-701 Final
No ratings yet
F1 CRD Lecture Stat-701 Final
10 pages
Sample Thesis With Questionnaire
100% (3)
Sample Thesis With Questionnaire
8 pages
T'boli Fables in Education
No ratings yet
T'boli Fables in Education
23 pages
PDF Definition of Literature Review
No ratings yet
PDF Definition of Literature Review
21 pages
Insights from "The Art of Thinking Clearly"
No ratings yet
Insights from "The Art of Thinking Clearly"
13 pages
UCS Research & Study Designs Notes
No ratings yet
UCS Research & Study Designs Notes
10 pages
Methods of Data Collection
No ratings yet
Methods of Data Collection
4 pages
Basurto Speer 2012 Calibration Data FsQCA
No ratings yet
Basurto Speer 2012 Calibration Data FsQCA
20 pages
Conflict Resolution Impact on Performance
No ratings yet
Conflict Resolution Impact on Performance
37 pages
(Ebook PDF) Organizational Behavior 8th Edition PDF Download
100% (1)
(Ebook PDF) Organizational Behavior 8th Edition PDF Download
101 pages
Nigerian Banks: Internal Controls Impact
No ratings yet
Nigerian Banks: Internal Controls Impact
11 pages
Language Implementation Patterns Create Your Own Domain Specific and General Programming Languages 1st Edition Terence Parr PDF Version
100% (3)
Language Implementation Patterns Create Your Own Domain Specific and General Programming Languages 1st Edition Terence Parr PDF Version
108 pages
Sulam Proposal Rubrics
No ratings yet
Sulam Proposal Rubrics
3 pages
GLoCALL-2021 Handbook
No ratings yet
GLoCALL-2021 Handbook
50 pages
Handouts (Differences and Similarities of Qualitative and Quantitative Research)
No ratings yet
Handouts (Differences and Similarities of Qualitative and Quantitative Research)
1 page
Summary of Taylor and Bogdan
No ratings yet
Summary of Taylor and Bogdan
15 pages
Action Research 2023
No ratings yet
Action Research 2023
7 pages
Ma Thesis Methodology Chapter
100% (3)
Ma Thesis Methodology Chapter
8 pages
Crafting Research Instrument
No ratings yet
Crafting Research Instrument
9 pages
Slovin's Formula for PRC Surveys
No ratings yet
Slovin's Formula for PRC Surveys
5 pages
EVS Project Writing Instruction 23-24
No ratings yet
EVS Project Writing Instruction 23-24
5 pages
SBCC and Nutrition Governance Proposal
No ratings yet
SBCC and Nutrition Governance Proposal
14 pages
Equalture Validation Report - Ferry Game - Private and Confidential
No ratings yet
Equalture Validation Report - Ferry Game - Private and Confidential
34 pages
Orcini, Sa-Written Report
No ratings yet
Orcini, Sa-Written Report
26 pages
Help On RL 06
No ratings yet
Help On RL 06
11 pages
Panduan Jenis Pustaka dan Evaluasi Literatur
No ratings yet
Panduan Jenis Pustaka dan Evaluasi Literatur
24 pages
AComparative Analysisof Englishand Urdu Translation Article Waqas Faryad
No ratings yet
AComparative Analysisof Englishand Urdu Translation Article Waqas Faryad
19 pages

Question 1 (5 Marks) : Part 3 Hypothesis Testing (5 Marks)

Uploaded by

Question 1 (5 Marks) : Part 3 Hypothesis Testing (5 Marks)

Uploaded by

Question 1 (5 marks)

should take the survey?

Part 3 Hypothesis Testing (5 marks)

Question 1 (2.5 marks)

Part 4 Simulation (10 marks)

Pr (Y = y) 0.35 0.05 0.15 0.05 0.4

Question 2: Simulating the CLT result (NO LIBRARIES ALLOWED) (8 Marks)

Part 5 Linear Regression - The Consciousness Metre Challenge (45 Marks)

You might also like