0% found this document useful (0 votes)

8 views

Analysing Data of Stats

Uploaded by

spheyenangobese22

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Analysing Data of Stats

Uploaded by

spheyenangobese22

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Activity #14: Sampling distributions and the Central Limit Theorem

So far, this unit has focused on distributions of discrete and continuous random variables.
In this activity, we’ll investigate sampling distributions — distributions of statistics.

Scenario: We want to know the average age of all cloned sheep that exist right now.
We don’t know how many cloned sheep exist, but we are able to get samples of sheep delivered to us.

Unbeknownst to us, the entire population consists of 5 cloned sheep with ages 10, 11, 12, 13, 14 months.

1. Using R, I input the ages of the sheep with the code: sheep <-‐ c(10, 11, 12, 13, 14).
I then calculated population parameters that we do not know in this scenario: μ = 12 and σ = 1.414.

To estimate μ, the unknown average age of all cloned sheep, we decide to do the following:
• Take a sample of n sheep from the population
• Calculate the average from each sample
• Repeat this process many times and sketch a distribution of the averages we calculate from our samples

Suppose we go through this process with a sample size of n=1 sheep. We first sample one sheep and then
calculate it’s “average” age. We then take another sample (possibly getting the same sheep) and calculate an
average.

a) What possible averages could we get? Sketch a dotplot of those averages: (n=1)
10 11 12 13 14

b) Suppose we sample n=1 sheep 25,000 times. What would the distribution of all those sample means look like?

c) The mean of all 25,000 sample means should be: ________________________________________________________

d) Suppose we decide to sample n = 2 sheep.

How many different averages could we get from samples of two sheep? ________________________________

e) Sketch a dotplot of all possible averages we could get from n=2 sheep: (n=2)
10 11 12 13 14

f) Suppose we sample n=2 sheep 25,000 times. What would the distribution of all those sample means look like?

g) The mean of all 25,000 sample means should be: ________________________________________________________

2. I simulated this process 25,000 times for sample sizes of n=1, 2, 3, 4, and 5 sheep. Fill-in-the-blanks:

Mean of sample Std. Deviation of Probability of a Probability of an

Sample Distribution of 25,000 means sample means usual event: unusual event:
Size sample means

n=1 11.989 1.4197

__________ __________

n=2 11.994 0.8678

__________ __________

n=3 12.001 0.5774

__________ __________

n=4 12.001 0.3526

__________ __________

n=5 12.000 0.000

__________ __________

3. From these simulations, let’s generalize. If we repeatedly take samples of size n from a population and calculate
the mean of each sample:

a) The expected value of our sample means (i.e., the mean of our means) = ________________________________

b) The standard deviation of our sample means is called the standard error.
If we take a larger sample, the size of our standard error…………………….… DECREASES INCREASES

c) If we take a larger sample, the probability of an unusual sample mean……… DECREASES INCREASES

d) If we take a larger sample, the probability of an usual sample mean……..….. DECREASES INCREASES
If we repeatedly take samples of size n from a population with an unknown distribution and
calculate the mean of each sample,

• The mean of the sample means will equal the population mean

• The standard deviation of the sample means (the standard error) shrinks
as the sample size increases

• We still don’t know what shape the distribution of sample means will have (although, in this
example, it looks like the distribution becomes unimodal and symmetric)

4. Suppose body temperatures for a population of interest follow a normal distribution with μ = 98.5 and σ = 0.75.

a) Suppose we randomly select a single individual from this population. Use a

computer (R, Wolfram Alpha, or the normal distribution applet*) to calculate:

P ( 97.75 < X < 99.25 ) = P (−1 < Z < +1) = ____________________

P ( X < 98 ) = ____________________

b) Suppose we randomly select a sample of n=100 individuals from this population. Circle the correct symbol.

P ( 97.75 < X < 99.25 ) < = > 0.683

P ( X < 98 ) < = > 0.252

c) Sketch the distribution of sample averages we’d get if we repeatedly sampled n=100 individuals.

d) Now calculate the probabilities for a sample of n=100 individuals:

P ( 97.75 < X < 99.25 ) = ____________________

P ( X < 98 ) = ____________________

e) To calculate those probabilities, we assumed the sampling distribution had what kind of shape? ______________

Applet: http://lock5stat.com/statkey/theoretical_distribution/theoretical_distribution.html#normal
To calculate the previous 2 probabilities, we needed to assume the sampling distribution was approximately normal.
Is there a way we can know the shape of the distribution of sample means?

Scenario: Researchers collected data from 4,390 babies born to

mothers in Georgia from 1980-1992. The birth weights of
these babies approximated a normal distribution with a
mean of 3156.3 grams and a standard deviation of 570.44.

I had a computer randomly sample babies from this dataset

and calculate the average weight for each sample.

Data: Adams MM, et. al. The relationship of interpregnancy interval to

I repeated this process 10,000 times to plot the sampling infant birthweight and length of gestation among low-risk women,
distributions. Georgia. Paediatr Perinat Epidemiol. 1997 Jan;11 Suppl 1:48-62

5. Below, I’ve pasted results from my computer simulations. Fill-in-the-blanks to see if these simulated sampling
distributions agree with the theory we’ve derived. Explain why the simulated results do not match the theory
perfectly.

Standard
Sample Mean of sample Theoretical Theoretical
Sampling distribution deviation of
Size means Mean standard error
sample means

2 3161.37 398.382

16 3154.85 141.975

100 3156.30 3156.3 56.933 57.044

It looks like these sampling distributions are approximately normal, but that might be because the population
distribution was approximately normal. What happens if we start with a population that is not normally distributed?
Scenario: The high school GPAs of 556 St. Ambrose freshmen in 2012
are displayed to the right. These GPAs are obviously not
normally distributed (they have a negative skew).

The mean is 3.27 with a standard deviation of 0.5527.

I had a computer randomly sample GPAs from this dataset

and calculate an average. I repeated this 10,000 times and
graphed all 10,000 mean GPAs.

6. Fill-in-the-blanks. Do our theoretical results hold for populations that are not normally distributed?

Standard
Sample Mean of sample Theoretical Theoretical
Sampling distribution deviation of
Size means Mean standard error
sample means

2 3.273 0.391

16 3.269 3.27 0.1371 0.1382

100 3.270 3.27 0.0504 0.0553

7. Under what conditions does it appear as though the distribution of the sample mean will be approximately
normal?

8. Use the following applet to predict the distribution of various sample statistics under various conditions: http://
www.onlinestatbook.com/stat_sim/sampling_dist/index.html.
Central Limit Theorem:
If we repeatedly take samples of size n from a population with an unknown distribution and
calculate the mean of each sample,

• The mean of the sample means will equal the population mean

• The standard deviation of the sample means (the standard error)

shrinks as the sample size increases

• The sampling distribution of sample means will approximate a normal distribution if:
a) The population follows a normal distribution, or
b) We repeatedly take large sample sizes (how large?)

Scenario: A (hypothetical) statistics professor often continues lecturing after

the class period should have ended. Let X = the amount of time
the professor lectures after class should have ended. Suppose
students recorded X each day for several years and found X has a
mean of 5 minutes and a standard deviation of 1.8 minutes.
0 2 4 6 8 10
time

9. Suppose we sample 1, 5, or 25 class days at random. Calculate the following probabilities:

Sample 1 day: Sample 5 days Sample 25 days

μ = and σ = μ = and σ = μ = and σ = ______

P ( X < 5.5 ) = P ( X < 5.5 ) = P ( X < 5.5 ) = __

P ( X > 7 ) = P ( X > 7 ) = P ( X > 7 ) = __

10. Suppose we repeatedly sample 25 days and calculate the average time lecturing. What average represents the
10th percentile of this distribution?

11. Complete the following:

If you sample one day, 0.95 = P ( _ ≤ X ≤ _ )

If you sample 100 days, 0.95 = P ( _ ≤ X ≤ _ )

12. What sample size would we need in order for 0.95 = P ( 4.5 ≤ X ≤ 5.5 )

EFSAS 6. Developed Slides. 04
100% (1)
EFSAS 6. Developed Slides. 04
49 pages
Detailed Project Report: Suraj Product Limited
100% (1)
Detailed Project Report: Suraj Product Limited
162 pages
Use of Robotics and Automation in Construction
No ratings yet
Use of Robotics and Automation in Construction
5 pages
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
How To Write A Character Sketch
No ratings yet
How To Write A Character Sketch
2 pages
An Introduction To Translation Studies. NQN
100% (1)
An Introduction To Translation Studies. NQN
128 pages
Sampling Distributions of Sample Means
No ratings yet
Sampling Distributions of Sample Means
7 pages
Lecture 9
No ratings yet
Lecture 9
14 pages
Lecture Transcript 3 (Sampling and Sampling Distribution)
No ratings yet
Lecture Transcript 3 (Sampling and Sampling Distribution)
5 pages
Central Limit Theorem Grade 11 Group 4
No ratings yet
Central Limit Theorem Grade 11 Group 4
7 pages
Lecture 3 - Sampling-Distribution & Central Limit Theorem
No ratings yet
Lecture 3 - Sampling-Distribution & Central Limit Theorem
5 pages
Stat - Prob 11 - Q3 - SLM - WK6-8
No ratings yet
Stat - Prob 11 - Q3 - SLM - WK6-8
34 pages
Statistics for Economists Lecture V
No ratings yet
Statistics for Economists Lecture V
37 pages
Lecture Transcript 3 (Sampling and Sampling Distribution)
No ratings yet
Lecture Transcript 3 (Sampling and Sampling Distribution)
5 pages
Sampling Distribution With CLT
No ratings yet
Sampling Distribution With CLT
22 pages
Notes STA408 - Chapter 2 PDF
No ratings yet
Notes STA408 - Chapter 2 PDF
4 pages
Chapter 1 - Comparing Normal Populations
No ratings yet
Chapter 1 - Comparing Normal Populations
39 pages
Sample: It Consists of One or More Data Drawn From The Population
No ratings yet
Sample: It Consists of One or More Data Drawn From The Population
23 pages
Central Limit Theorem
100% (3)
Central Limit Theorem
38 pages
Statistics M6
No ratings yet
Statistics M6
18 pages
Math Final
No ratings yet
Math Final
6 pages
MGMT 222 Ch. III
No ratings yet
MGMT 222 Ch. III
10 pages
And Estimation Sampling Distributions: Learning Outcomes
No ratings yet
And Estimation Sampling Distributions: Learning Outcomes
12 pages
And Estimation Sampling Distributions: Learning Outcomes
No ratings yet
And Estimation Sampling Distributions: Learning Outcomes
12 pages
Sampling: The Act of Studying Only A Segment or Subset of The Population Representing The Whole
No ratings yet
Sampling: The Act of Studying Only A Segment or Subset of The Population Representing The Whole
42 pages
Lecture 12
No ratings yet
Lecture 12
8 pages
Chapter 5
No ratings yet
Chapter 5
21 pages
Statistics Unit 6 Notes
No ratings yet
Statistics Unit 6 Notes
10 pages
Normal Prob - Sampling Distr and Estimation-2022
No ratings yet
Normal Prob - Sampling Distr and Estimation-2022
27 pages
PROBABILITY & STATISTICAL ANALYSIS
No ratings yet
PROBABILITY & STATISTICAL ANALYSIS
28 pages
Sampling Distribution
No ratings yet
Sampling Distribution
13 pages
Chapter 4
No ratings yet
Chapter 4
20 pages
Bbs14ege ch07 Sampling Distributions
No ratings yet
Bbs14ege ch07 Sampling Distributions
47 pages
Module 2 - Sample - Afterclass
No ratings yet
Module 2 - Sample - Afterclass
36 pages
SEM, Sampling Distribution, Central Limit Theorem
No ratings yet
SEM, Sampling Distribution, Central Limit Theorem
33 pages
CH I - Sampling and Sampling Distributions (6)
No ratings yet
CH I - Sampling and Sampling Distributions (6)
13 pages
STAT 206 - Chapter 7 (Sampling Distributions)
No ratings yet
STAT 206 - Chapter 7 (Sampling Distributions)
32 pages
Sampling Distribution
100% (3)
Sampling Distribution
13 pages
AdHStat1 3notes
No ratings yet
AdHStat1 3notes
10 pages
Unit 6 Notes
No ratings yet
Unit 6 Notes
13 pages
Chapter 2 : Sampling Distribution: - Sample Mean and Proportion
No ratings yet
Chapter 2 : Sampling Distribution: - Sample Mean and Proportion
18 pages
Stat T 3
100% (2)
Stat T 3
39 pages
Statistic and Probability
No ratings yet
Statistic and Probability
15 pages
Chapter 8
No ratings yet
Chapter 8
59 pages
Hypothesis Testing 23.09.2023
No ratings yet
Hypothesis Testing 23.09.2023
157 pages
STAT 410 Chapter 07 PPT Sem 231
No ratings yet
STAT 410 Chapter 07 PPT Sem 231
18 pages
central limit
No ratings yet
central limit
3 pages
Finding The Mean and Variance of The Sampling Distribution of Means
100% (1)
Finding The Mean and Variance of The Sampling Distribution of Means
25 pages
Q1W3 The Central Limit Theorem
No ratings yet
Q1W3 The Central Limit Theorem
80 pages
Sampling Distribution
No ratings yet
Sampling Distribution
20 pages
Sampling Probability Distributions
No ratings yet
Sampling Probability Distributions
5 pages
Module 6
No ratings yet
Module 6
12 pages
Lecure 5 (Sampling Distribution)
No ratings yet
Lecure 5 (Sampling Distribution)
24 pages
Math11 SP Q3 M7
No ratings yet
Math11 SP Q3 M7
16 pages
Stats Lecture 07. Sample Distribution
No ratings yet
Stats Lecture 07. Sample Distribution
36 pages
L5 Notes
No ratings yet
L5 Notes
51 pages
Sampling and Sampling Distributions (Autosaved)
0% (1)
Sampling and Sampling Distributions (Autosaved)
74 pages
Lecture #15 - Module 5
No ratings yet
Lecture #15 - Module 5
8 pages
6.1 Central Limit Theorem
No ratings yet
6.1 Central Limit Theorem
4 pages
Sampling technique and sampling distribution
No ratings yet
Sampling technique and sampling distribution
47 pages
Sampling
No ratings yet
Sampling
50 pages
Normal Distribution & CLT
No ratings yet
Normal Distribution & CLT
3 pages
Sampling in Statistics
From Everand
Sampling in Statistics
Stephanie Glen
No ratings yet
Statistics II Essentials
From Everand
Statistics II Essentials
Emil Milewski
2.5/5 (1)
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Tutorial Questions
No ratings yet
Tutorial Questions
9 pages
FM Ch4
No ratings yet
FM Ch4
29 pages
2LAW101 Capacity to Act
No ratings yet
2LAW101 Capacity to Act
15 pages
Sec 3.7 - Odds
No ratings yet
Sec 3.7 - Odds
9 pages
Chapter 3 Solutions
No ratings yet
Chapter 3 Solutions
10 pages
Chapter 2
No ratings yet
Chapter 2
4 pages
Chapter 14 Lecture Note 1
No ratings yet
Chapter 14 Lecture Note 1
9 pages
Chapter 12 Part2
No ratings yet
Chapter 12 Part2
76 pages
Chapter 9 (Independent Means Only) UPDATED!!!
No ratings yet
Chapter 9 (Independent Means Only) UPDATED!!!
27 pages
PPE Summary 08 August 2024
No ratings yet
PPE Summary 08 August 2024
7 pages
Statistics
No ratings yet
Statistics
21 pages
Populatio N Sampl E: Parameters: Statistics
No ratings yet
Populatio N Sampl E: Parameters: Statistics
21 pages
Chapter 14 - Part 3
No ratings yet
Chapter 14 - Part 3
17 pages
Business Plan Chicken Diner
No ratings yet
Business Plan Chicken Diner
32 pages
2018 ILTexas Global Representative Agreement - Indeed
No ratings yet
2018 ILTexas Global Representative Agreement - Indeed
7 pages
Forgetabout Amps - Vincent Gallo
No ratings yet
Forgetabout Amps - Vincent Gallo
5 pages
Thermochemistry Unit Plan
No ratings yet
Thermochemistry Unit Plan
18 pages
English Form 3 Sameeco 2023
No ratings yet
English Form 3 Sameeco 2023
88 pages
Sex Harassment Complaint Reporting Form 2018
No ratings yet
Sex Harassment Complaint Reporting Form 2018
2 pages
ICSE Class X Physics Question Paper 2021 Set I
No ratings yet
ICSE Class X Physics Question Paper 2021 Set I
14 pages
Browns Summer Drinks pb2 pb1
No ratings yet
Browns Summer Drinks pb2 pb1
9 pages
EDVR1000
No ratings yet
EDVR1000
1 page
Design Audio Amplifier PDF
100% (1)
Design Audio Amplifier PDF
9 pages
STF Industrial Boilers
100% (1)
STF Industrial Boilers
20 pages
CASE STUDY 2 - Complete
No ratings yet
CASE STUDY 2 - Complete
6 pages
Bihar Tender 1
No ratings yet
Bihar Tender 1
7 pages
Visual Basic 2015 Tutorial
0% (1)
Visual Basic 2015 Tutorial
11 pages
Whey Protein Isolate Edible Films Incorporated With Essential Oils - Antimicrobial Activity and Barrier Properties
No ratings yet
Whey Protein Isolate Edible Films Incorporated With Essential Oils - Antimicrobial Activity and Barrier Properties
17 pages
Interactive Panel Cornea_Dhruv
No ratings yet
Interactive Panel Cornea_Dhruv
10 pages
Corruption in India: Click To Edit Master Subtitle Style
No ratings yet
Corruption in India: Click To Edit Master Subtitle Style
10 pages
Verbele Neregulate - X
No ratings yet
Verbele Neregulate - X
7 pages
Folder Gluer: Édition: 03.99 Anglais
No ratings yet
Folder Gluer: Édition: 03.99 Anglais
16 pages
Solutions Manual Mate Quim UPR RP
No ratings yet
Solutions Manual Mate Quim UPR RP
119 pages
Carbon Emission Control Using Electrostatic Precipitator
No ratings yet
Carbon Emission Control Using Electrostatic Precipitator
13 pages
10.4324 9781315560854 Previewpdf
No ratings yet
10.4324 9781315560854 Previewpdf
92 pages
Intervention Plan in ENGLISH6
No ratings yet
Intervention Plan in ENGLISH6
4 pages
Lostbelt 5 Intro Part 1: Mashu
No ratings yet
Lostbelt 5 Intro Part 1: Mashu
12 pages
Bill of Quantities
100% (1)
Bill of Quantities
2 pages
Strength and Stiffness Properties
No ratings yet
Strength and Stiffness Properties
36 pages

Analysing Data of Stats

Uploaded by

Analysing Data of Stats

Uploaded by

Activity #14: Sampling distributions and the Central Limit Theorem

c) The mean of all 25,000 sample means should be: ________________________________________________________

d) Suppose we decide to sample n = 2 sheep.

g) The mean of all 25,000 sample means should be: ________________________________________________________

Mean of sample Std. Deviation of Probability of a Probability of an

n=1 11.989 1.4197

n=2 11.994 0.8678

n=3 12.001 0.5774

n=4 12.001 0.3526

n=5 12.000 0.000

a) Suppose we randomly select a single individual from this population. Use a

P ( 97.75 < X < 99.25 ) = P (−1 < Z < +1) = ____________________

P ( 97.75 < X < 99.25 ) < = > 0.683

P ( X < 98 ) < = > 0.252

d) Now calculate the probabilities for a sample of n=100 individuals:

P ( 97.75 < X < 99.25 ) = ____________________

Scenario: Researchers collected data from 4,390 babies born to

I had a computer randomly sample babies from this dataset

Data: Adams MM, et. al. The relationship of interpregnancy interval to

2 3161.37 __________ 398.382 __________

16 3154.85 __________ 141.975 __________

100 3156.30 3156.3 56.933 57.044

The mean is 3.27 with a standard deviation of 0.5527.

I had a computer randomly sample GPAs from this dataset

2 3.273 __________ 0.391 __________

16 3.269 3.27 0.1371 0.1382

100 3.270 3.27 0.0504 0.0553

• The standard deviation of the sample means (the standard error)

Scenario: A (hypothetical) statistics professor often continues lecturing after

9. Suppose we sample 1, 5, or 25 class days at random. Calculate the following probabilities:

Sample 1 day: Sample 5 days Sample 25 days

μ = __________ and σ = ____________ μ = __________ and σ = ____________ μ = __________ and σ = ____________

P ( X < 5.5 ) = __________ P ( X < 5.5 ) = __________ P ( X < 5.5 ) = __________

P ( X > 7 ) = __________ P ( X > 7 ) = __________ P ( X > 7 ) = __________

11. Complete the following:

If you sample one day, 0.95 = P ( _______________ ≤ X ≤ _______________ )

If you sample 100 days, 0.95 = P ( _______________ ≤ X ≤ _______________ )

You might also like

2 3161.37 398.382

16 3154.85 141.975

2 3.273 0.391

μ = and σ = μ = and σ = μ = and σ = ______

P ( X < 5.5 ) = P ( X < 5.5 ) = P ( X < 5.5 ) = __

P ( X > 7 ) = P ( X > 7 ) = P ( X > 7 ) = __

If you sample one day, 0.95 = P ( _ ≤ X ≤ _ )

If you sample 100 days, 0.95 = P ( _ ≤ X ≤ _ )