6. Samples From Populations; Statistical Significance for the Correlation Coefficient a Practical Introduction to Statistical Inference

Uploaded by

assanemir352

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views41 pages

6. Samples From Populations; Statistical Significance for the Correlation Coefficient a Practical Introduction to Statistical Inference

Uploaded by

assanemir352

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 41

PSY153 Statistics in

Behavioral Sciences I
Samples from populations / Statistical significance for the correlation
coefficient: A practical introduction to statistical inference

Burak Emre Gürsoy, Ph.D.

Samples from populations
• Samples characterize modern research. Inferential statistical
techniques are required to analyze data from samples.
• A population in statistics is all the scores on a particular
variable and a sample is a smaller set of these scores.
• Random samples are systematically drawn samples in which
each score in the population has an equal likelihood of being
selected.
• Random samples tend to be like the population from which they
are drawn in terms of their mean, variability, and so forth.
• Standard error is the variation in the means of samples drawn
from a population. It is essentially the standard deviation of the
sample means.
Samples from populations
• So far, we have mainly discussed sets of data.
• This was deliberate since most things we have
discussed in previous chapters are applicable to either
samples or populations.
• The next stage is to understand how we can use a
sample of scores to make general statements or draw
general conclusions that apply beyond that sample.
• This is a branch of statistics called inferential
statistics because it is about drawing inferences about
the population from just a sample.
Samples from populations
• A sample is just a small number of scores selected from the entirety of
scores.
• A population is the entire set of scores.
• In other words, a sample is a small set, or a subset, taken from the full set or
population of scores.

Can we generalize from samples?

What can we possibly say about the population based on our knowledge of a
sample?

• The answer is quite a lot if we are prepared to infer information from our
sample. And we have little choice other than to do that since our sample is
all that we know about.
Samples from populations
• In statistical inference, it is generally assumed that samples are
drawn at random from the population.
• Such samples are called random samples from the population.
• A random sample of scores from a population entails selecting
scores in such a way that each score in the population has an
equal chance of being selected.
• In other words, a random sample favors the selection of no
particular scores in the population.
• Although it is not difficult to draw a random sample, it does
require a systematic approach.
Samples from populations
There are several ways of drawing a random sample:
• Put the information about each member of the population on a
slip of paper, put all the slips into a hat, close your eyes, give the
slips a long stir with your hand and finally bring one slip out of
the hat.
• This slip is the first member of the sample; repeat the process to
get the second, third and subsequent members of the sample.
• Technically the slip of paper should be returned to the container
after being selected so it may be selected again. However, this is
not done, largely because with a large population it would
make little difference to the outcome.
Samples from populations
There are several ways of drawing a random sample:
• Number each member of the population.
• Press the appropriate randomization button on your scientific
calculator to generate a random number.
• Apps to generate random numbers are downloadable for your PC,
tablet, or mobile to do the same thing.
Figure 10.1
Conceptual steps for understanding significance
testing
Table 10.1
Population of 100 scores
There is a population of 100 scores – the mode is 2, the median is 6.00 and the mean is 5.52.
Table 10.2
Means of 40 samples each of five scores taken
at random from the population in Table 10.1

We can calculate the (estimated) standard deviation of these 40 sample means on SPSS
which gives us a value of 1.6. The standard deviation of sample means has a technical
name, although the basic concept differs only in that it deals with means of samples and
not scores. The special term is standard error.

So, in general, it would seem that sample means are a pretty good estimate of population
means.
Table 10.3
Means of 40 samples each of size 20 taken at
random from the population in Table 10.1

Much the same trends appear with these larger samples but for the following:
● The spread of the sample means is reduced somewhat, and they appear to cluster
closer to the population mean. The minimum value is 4.25 and the maximum value is 6.85.
The overall mean of these samples is 5.33, close to the population mean of 5.52.
● The standard deviation of these means (i.e. the standard error) of larger samples is
smaller. For Table 10.3 the standard deviation is 0.60.
● The distribution of sample means is a steeper curve than for the smaller samples.
Samples from populations
• There is another idea that is fundamental to some branches of
statistics – confidence interval of the mean.
• The smaller the margin of error the more confident we should be
in the estimate of the population based on the sample.
• Confidence intervals (CIs) are similar in that they tell us the
range of means (and other things) which is likely to contain the
actual population mean 95% of the time.
• That is, if we repeatedly draw random samples from a population,
the confidence interval is the range of means likely to contain the
actual population mean 95% of the time.
Samples from populations
A little more jargon:
• The correct term for characteristics of samples such as their
means, standard deviations, ranges and so forth is statistics.
• The same characteristics of populations are called parameters.
• In other words, you use the statistics from samples to
estimate or infer the parameters of the population from
which the sample came.
Statistical significance for the correlation coefficient:
A practical introduction to statistical inference
• It is usual to report the statistical significance of correlation
coefficients and many other statistical techniques.
• Statistical significance merely indicates whether your statistical
findings are likely to be due to chance.
• Samples drawn randomly from a population usually have similar
characteristics to those of the population.
• However, some samples are unlike the population.
Statistical significance for the correlation coefficient:
A practical introduction to statistical inference
• Null hypothesis always states that there is no relation between
two variables. Significance testing assesses the validity of the
null hypothesis.
• If our data sample is in the middle 95% of samples if the null
hypothesis is true, we say that our findings are not statistically
significant at the 5% level, and we prefer the null hypothesis.
• However, if our data sample is in the extreme 5% of samples
assuming that the null hypothesis is true, our sample does not
seem to support the null hypothesis.
• In this case, we prefer the alternative hypothesis and reject the
null hypothesis. We also say that our findings are statistically
significant.
Statistical significance for the correlation coefficient:
A practical introduction to statistical inference
• Researchers have correlated two variables for a sample of 20
people.
• They obtained a correlation coefficient of .56.
• The problem is that they wish to generalize beyond this sample
and make statements about the trends in the data which apply
more widely.
• However, their analyses are based on just a small sample which
might not be characteristic of the trends in the population.
• What do they do?
Table 11.1
Imaginary population of 60 pairs of scores with
zero correlation between the pairs
Statistical significance for the correlation coefficient:
A practical introduction to statistical inference
• Table 11.1 contains the population of pairs of scores.
• Overall, the correlation between the two variables in this
population is .0.
• That is, there is absolutely no relationship between variable X
and variable Y in the population.
• What happens, though, if we draw many samples of, say, eight
pairs of scores at random from this population and calculate the
correlation coefficients for each sample?
Table 11.2
Two hundred correlation coefficients obtained by
repeatedly random sampling eight pairs of scores
from Table 11.1
Some of the In the table
correlation correlations are
coefficients are ones as large
indeed more-or- as .81 which would
less zero, but a delight most
few are researchers
substantially
different from But this correlation
So even where
zero. is really due to
there is zero chance and, in
relationship in the truth, there is no
population, correlation in the
random samples population.
can have
correlations
which depart
Figure 11.1
Distribution of correlation coefficients presented
in Table 11.2
Statistical significance
for the correlation
coefficient: A practical
introduction to
statistical inference
If the population
correlation is zero (if the
null hypothesis is true )
• The middle 95% of the
distribution of samples are
likely
• Correlations in the
extreme 5% (usually the
extreme 2.5% in each
direction) are unlikely in
these circumstances
Statistical significance
for the correlation
coefficient: A practical
introduction to
statistical inference
If the population
correlation is zero (if the
null hypothesis is true )
• The
correlations .81, .76, .72, .6
8 and .68 and -.80, -.72,
-.71, -.70 and -.69 are in
the extreme 5% of
correlations away from
zero.
• This extreme 5% is usually
made up of the extreme
2.5% positive correlations
and the extreme 2.5%
negative correlations.
Statistical significance
for the correlation
coefficient: A practical
introduction to
statistical inference
If the population
correlation is zero (if the
null hypothesis is true )
• Therefore, a correlation of
between .68 and 1.00 or
-.69 and -1.00 is in the
extreme 5% of
correlations in our
example.
• This range we describe as
statistically significant.
Statistical significance
for the correlation
coefficient: A practical
introduction to
statistical inference
If the population correlation
is zero (if the null
hypothesis is true )
• 5% of correlations in our
example.
• This range we describe as
statistically significant.
• Statistical significance simply
means that our sample falls
in the relatively extreme part
of the distribution of samples
obtained if the null
hypothesis of no
relationship between the two
variables is true.
Statistical significance for the correlation coefficient:
A practical introduction to statistical inference
• Hypotheses in psychological statistics are usually presented as
antithetical pairs – the null hypothesis and its corresponding
alternative hypothesis.
• The null hypothesis is essentially a statement that there is no
relationship between two variables.
The following are all examples of null hypotheses:
• There is no relationship between brain size and intelligence.
• There is no relationship between gender and income.
• There is no relationship between baldness and virility.
• There is no relationship between children’s self-esteem and that
of their parent of the same sex.
Statistical significance for the correlation coefficient:
A practical introduction to statistical inference

The alternative hypothesis simply states that there is

a relationship between two variables.
In its simplest forms the alternative hypothesis
says things like:
• There is a relationship between the number of years of
education people have and their income.
• There is a relationship between people’s gender and
how much they talk about their emotional problems.
• There is a relationship between people’s mental
instability and their artistic creativity.
Statistical significance for the correlation coefficient:
A practical introduction to statistical inference

In statistics, inferences are based on the characteristics

of the population as defined by the null hypothesis.

To repeat and summarize:

• The null hypothesis is used to define a population in
which there is no relationship between two variables.
• Other characteristics, especially the variability of this
population, are estimated or inferred from the known
sample.
Statistical significance for the correlation coefficient:
A practical introduction to statistical inference
• If it is unlikely that the sample comes from the null hypothesis-
based population, the possibility that the null hypothesis is true is
rejected.
• Instead, the view that the alternative hypothesis is true is
accepted.
• That is, the alternative hypothesis that there really is a
relationship is preferred (we never say proven).
Statistical significance
for the correlation Table 11.3
coefficient: A practical
introduction to Sample of 10 pairs of scores
statistical inference
For Pearson’s correlation coefficient:
• The null hypothesis for research
involving the correlation
coefficient is that there is no
relationship between the two
variables.
• In other words, the null
hypothesis states that the
correlation coefficient between
the two variables is .00 in the
population (defined by the null
hypothesis).
• So, what if, in a sample of 10
pairs of scores, the correlation
is .94 as for the data in Table
11.3? Do we accept or reject the
null hypothesis?
Statistical significance for the correlation coefficient:
A practical introduction to statistical inference
Is it likely that such a correlation would occur in a sample if it actually
came from a population where the true correlation is zero?

• We need to know the distribution of correlations based on samples of ten

assuming the null hypothesis.
• This is not a simple task but was done at the time the correlation coefficient
was developed many decades ago.
• Mere mortals like us can use significance tables for the correlation
coefficient calculated way back then.
• All we really need to know is the minimum value which puts a correlation into
the extreme 5% of correlation coefficients.
• This tells us whether or not our correlation coefficient is statistically
significant.
Figure 11.2
Conceptual steps for understanding statistical
significance testing
Significance Table 11.1
5% significance values of the Pearson correlation coefficient (two-
tailed test). An extended and conventional version of this table is
given in Appendix C
If the sample’s By accept, we
correlation is mean that in the
smaller than the absence of any
critical value other information
required, then we or
accept the null considerations,
hypothesis that the null
there is no hypothesis
relationship cannot be
between the two rejected.
variables.

So, correlations which are smaller than the critical value are described as being statistically non-significant.
Significance Table 11.1
5% significance values of the Pearson correlation coefficient (two-
tailed test). An extended and conventional version of this table is
given in Appendix C
However, if the
correlation is equal Correlations equal
to or larger than to or larger than
the critical value the critical value
then it is in the are described as
extreme 5% of being
correlations. In this statistically
case the significant.
alternative
hypothesis is That is, we
accepted (that accept the
there is a alternative
relationship hypothesis that
Statistical significance
for the correlation
coefficient: A practical
introduction to
statistical inference
• Significance Table 11.1
indicates that for a sample
size of 10, a correlation has to
be between -.63 and -1.00 or
between .63 and 1.00 to be
sufficiently large as to be in
the extreme 5% of
correlations which support the
alternative hypothesis.
• Correlations closer to .00 than
these come in the middle
95%, which supports the null
hypothesis.
• So, our correlation of .94
based on a sample of 10 is
clearly statistically significant.
Statistical significance for the correlation coefficient:
A practical introduction to statistical inference
Figure 11.3
Type I and Type II errors
Statistical significance for the correlation coefficient:
A practical introduction to statistical inference
Significance Table 11.2
5% significance values of the significance correlation coefficient
(two-tailed test). Extended and conventional version of this table is
given in Appendix d
For Spearman’s
rho correlation
coefficient
Statistical significance for the correlation coefficient:
A practical introduction to statistical inference
Interpreting the results
Since our obtained value of the Spearman’s rho correlation
coefficient is in the range of significant correlations, we accept the
alternative hypothesis that mathematical and musical scores are
(inversely) related and reject the null hypothesis.
Reporting the results
We can report a significant correlation: ‘There is a negative
correlation of -.89 between mathematical and musical scores
which is statistically significant at the 5% level with a sample size
of 10.’ Alternatively, following the APA (2010) Publication Manual
recommendations we could write something like:
Statistical significance for the correlation coefficient:
A practical introduction to statistical inference
Mathematical scores were significantly negatively correlated with
musical scores, rs(8) = -.89, p <.05. The APA manual uses the
degrees of freedom which are given in brackets. The value of the
degrees of freedom will be the sample size minus 2 for Spearman’s
rho

Lecture Note On Statistical Methods With An Application
No ratings yet
Lecture Note On Statistical Methods With An Application
489 pages
Unit-1-Introduction To Statistical Analysis
No ratings yet
Unit-1-Introduction To Statistical Analysis
103 pages
Stats 1 for Students
No ratings yet
Stats 1 for Students
60 pages
Chapter 16
No ratings yet
Chapter 16
24 pages
Statistics Chapter2
No ratings yet
Statistics Chapter2
102 pages
CHAPTER 1 and 2
No ratings yet
CHAPTER 1 and 2
18 pages
Energy and Power Generation Handbook PDF
67% (3)
Energy and Power Generation Handbook PDF
44 pages
Principle of Statistics
No ratings yet
Principle of Statistics
108 pages
1.-Statistics
No ratings yet
1.-Statistics
125 pages
Unit 3 2020
No ratings yet
Unit 3 2020
66 pages
Sampling
No ratings yet
Sampling
62 pages
Economics Sem 1Lecture Notes Introduction to Statistics (1)
No ratings yet
Economics Sem 1Lecture Notes Introduction to Statistics (1)
90 pages
- Module 4-Sampling 2
No ratings yet
- Module 4-Sampling 2
56 pages
Sexuality and Gender
No ratings yet
Sexuality and Gender
40 pages
'MATH 233 Statistics for Social Sciences_Week 1' D_241029_161224
No ratings yet
'MATH 233 Statistics for Social Sciences_Week 1' D_241029_161224
110 pages
UNL STAT318 Notes Chapter 1-4 (2020)
No ratings yet
UNL STAT318 Notes Chapter 1-4 (2020)
66 pages
Mara Wetland
No ratings yet
Mara Wetland
168 pages
1 - Basic Concepts
No ratings yet
1 - Basic Concepts
71 pages
COM 201 - Inferential Statistics - 18032022-1
No ratings yet
COM 201 - Inferential Statistics - 18032022-1
58 pages
Multicollinearity
100% (1)
Multicollinearity
2 pages
12. One-tailed Versus Two-tailed Significance Testing
No ratings yet
12. One-tailed Versus Two-tailed Significance Testing
27 pages
STAT210_FL17_LCN_1
No ratings yet
STAT210_FL17_LCN_1
43 pages
Provisional Program v2
No ratings yet
Provisional Program v2
12 pages
Unit 1 - Descriptive Statistics
No ratings yet
Unit 1 - Descriptive Statistics
49 pages
Reading University Dissertation Deadline
100% (2)
Reading University Dissertation Deadline
8 pages
Intro123243ewqs1
No ratings yet
Intro123243ewqs1
37 pages
10. Confidence Intervals; Effect Size in Statistical Analysis Do My Findings Matter
No ratings yet
10. Confidence Intervals; Effect Size in Statistical Analysis Do My Findings Matter
23 pages
RMB W2
No ratings yet
RMB W2
22 pages
Lecture 9 Statistical Significance, Effect Size, And Confidence Intervals
No ratings yet
Lecture 9 Statistical Significance, Effect Size, And Confidence Intervals
32 pages
Preliminary Concepts On Statistical Inference
100% (1)
Preliminary Concepts On Statistical Inference
39 pages
Sociological Theories Part IV
No ratings yet
Sociological Theories Part IV
28 pages
The Welfare State1
No ratings yet
The Welfare State1
27 pages
Document (2)
No ratings yet
Document (2)
26 pages
Module 1-02 Introduction To Statistics
No ratings yet
Module 1-02 Introduction To Statistics
25 pages
Powerpoint 2 (Introduction and Sampling) 2425
No ratings yet
Powerpoint 2 (Introduction and Sampling) 2425
24 pages
Business Statistics
No ratings yet
Business Statistics
25 pages
Environmental Sustainability Practices of The Hotel and Resort Owners Within Cenro Casiguran Jurisdiction
No ratings yet
Environmental Sustainability Practices of The Hotel and Resort Owners Within Cenro Casiguran Jurisdiction
9 pages
Inferential Statistics: X (Called X Bar), To Symbolize The Sample
No ratings yet
Inferential Statistics: X (Called X Bar), To Symbolize The Sample
19 pages
To Statistics
No ratings yet
To Statistics
85 pages
Chapter 1 BKU2032
No ratings yet
Chapter 1 BKU2032
57 pages
The Future Laboratory - 2024 Services Brochure
No ratings yet
The Future Laboratory - 2024 Services Brochure
19 pages
Notes Data Analytics
No ratings yet
Notes Data Analytics
19 pages
PSY101.01_Chapter1_Introduction to Psy Brand New
No ratings yet
PSY101.01_Chapter1_Introduction to Psy Brand New
24 pages
Understanding Statistics - KB Edits040413
No ratings yet
Understanding Statistics - KB Edits040413
70 pages
Design and Development of Road Power Generation
No ratings yet
Design and Development of Road Power Generation
11 pages
Recreational Sport
No ratings yet
Recreational Sport
18 pages
Inferential Statistics
No ratings yet
Inferential Statistics
29 pages
1B. Topic 1_Introduction to Statistics_16_04_2009
No ratings yet
1B. Topic 1_Introduction to Statistics_16_04_2009
26 pages
Metlit-02 Populasi, Sampel & Variabel - Prof. Dr. Sudigdo S, SpA (K)
No ratings yet
Metlit-02 Populasi, Sampel & Variabel - Prof. Dr. Sudigdo S, SpA (K)
55 pages
4th Unit - Statistics (1)
No ratings yet
4th Unit - Statistics (1)
13 pages
Bachu Assignment
No ratings yet
Bachu Assignment
25 pages
Lesson+1+Introduction+to+Statistics
No ratings yet
Lesson+1+Introduction+to+Statistics
12 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
44 pages
The DCT Playbook
100% (1)
The DCT Playbook
16 pages
Statistics-Glossary CSE
No ratings yet
Statistics-Glossary CSE
13 pages
Tos-Bcal Final Exam.
No ratings yet
Tos-Bcal Final Exam.
15 pages
Ch 1 Lecture Notes
No ratings yet
Ch 1 Lecture Notes
10 pages
Data Clustering Using Particle Swarm Optimization: PSO Is PSO by PSO A
No ratings yet
Data Clustering Using Particle Swarm Optimization: PSO Is PSO by PSO A
6 pages
Sampling Design and Analysis MTH 494 Lecture-32: Ossam Chohan Assistant Professor CIIT Abbottabad
No ratings yet
Sampling Design and Analysis MTH 494 Lecture-32: Ossam Chohan Assistant Professor CIIT Abbottabad
119 pages
Inferential Statistics
100% (1)
Inferential Statistics
38 pages
The Benefits of Predictive Maintenance in Manufact
No ratings yet
The Benefits of Predictive Maintenance in Manufact
9 pages
Difference Between Descriptive and Inferential Statistics
100% (1)
Difference Between Descriptive and Inferential Statistics
9 pages
01 SPSS
No ratings yet
01 SPSS
14 pages
Brochure Master Big Data - ADEO CY Tech
No ratings yet
Brochure Master Big Data - ADEO CY Tech
8 pages
Chapter 1
No ratings yet
Chapter 1
4 pages
Isu2 Dalam Psikologi Industri Dan Organisasi
No ratings yet
Isu2 Dalam Psikologi Industri Dan Organisasi
9 pages
CH 3 - AUDIT DOCUMENTATION AND AUDIT EVIDENCE
No ratings yet
CH 3 - AUDIT DOCUMENTATION AND AUDIT EVIDENCE
33 pages
Inferential Statistics: by The End of This Chapter You Should Be Able To
No ratings yet
Inferential Statistics: by The End of This Chapter You Should Be Able To
46 pages
Case Studies As Method For Architectural Research
No ratings yet
Case Studies As Method For Architectural Research
8 pages
Statistics - The Big Picture
No ratings yet
Statistics - The Big Picture
4 pages
Paper 1
No ratings yet
Paper 1
6 pages
STATISTICS (Tanya) PG 1 - 28
No ratings yet
STATISTICS (Tanya) PG 1 - 28
35 pages
Industrial Tour Report
No ratings yet
Industrial Tour Report
8 pages
Statistics
No ratings yet
Statistics
4 pages
Basic Concepts Lecture Notes
No ratings yet
Basic Concepts Lecture Notes
7 pages
Coefficient of Variation and Areas Under Normal Curve
No ratings yet
Coefficient of Variation and Areas Under Normal Curve
5 pages
Statistics Lecture Notes
No ratings yet
Statistics Lecture Notes
6 pages
Aux Lecture Notes
No ratings yet
Aux Lecture Notes
9 pages
Introduction To Statistics: There Are Two Major Divisions of Inferential Statistics: Confidence Interval
No ratings yet
Introduction To Statistics: There Are Two Major Divisions of Inferential Statistics: Confidence Interval
8 pages
Statistics: Grade 3, Semester 1 Statistics: Grade 3, Semester 2
No ratings yet
Statistics: Grade 3, Semester 1 Statistics: Grade 3, Semester 2
7 pages
Why is Academic Research Important
No ratings yet
Why is Academic Research Important
5 pages
Sta301 1-9
No ratings yet
Sta301 1-9
9 pages
Concept Testing
No ratings yet
Concept Testing
4 pages
How Do I Write A Statement of Purpose The SOP For FMS
No ratings yet
How Do I Write A Statement of Purpose The SOP For FMS
4 pages
Central Limit Theorm
No ratings yet
Central Limit Theorm
101 pages
Brief Lecture Notes
No ratings yet
Brief Lecture Notes
13 pages
The Impact of Artificial Intelligence (AI) and Its Effects on the Legal Field
No ratings yet
The Impact of Artificial Intelligence (AI) and Its Effects on the Legal Field
5 pages
Settling The Debate On Birth Order and Personality
No ratings yet
Settling The Debate On Birth Order and Personality
2 pages
Infographic Example 2
No ratings yet
Infographic Example 2
1 page
Appendix 2 Microsoft Certified Educator PDF
No ratings yet
Appendix 2 Microsoft Certified Educator PDF
2 pages
HCDE333 Elevator Pitch Assignment
No ratings yet
HCDE333 Elevator Pitch Assignment
2 pages
Statistics For Dummies
From Everand
Statistics For Dummies
Deborah J. Rumsey
4/5 (27)
Sampling in Statistics
From Everand
Sampling in Statistics
Stephanie Glen
No ratings yet
Statistics II Essentials
From Everand
Statistics II Essentials
Emil Milewski
2.5/5 (1)
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet

6. Samples From Populations; Statistical Significance for the Correlation Coefficient a Practical Introduction to Statistical Inference

Uploaded by

6. Samples From Populations; Statistical Significance for the Correlation Coefficient a Practical Introduction to Statistical Inference

Uploaded by

PSY153 Statistics in

Burak Emre Gürsoy, Ph.D.

Can we generalize from samples?

The alternative hypothesis simply states that there is

In statistics, inferences are based on the characteristics

To repeat and summarize:

• We need to know the distribution of correlations based on samples of ten

You might also like