Chapter 1 the Nature of Probability and Statistics Updated Spring 2023-2024
Chapter 1 the Nature of Probability and Statistics Updated Spring 2023-2024
Chapter 1
The nature of probability
and statistics
2
Chapter Outline
• 1.1 Introduction
• 1.2 Descriptive and Inferential Statistics
• 1.3 Variables and Types of Data
• 1.4 Data Collection and Sampling Techniques
• 1.5 Observational and experimental Studies
3
Objectives
After completing this chapter, you should be able to:
5
What is Statistics?
9
Population size vs sample size
Population Sample
There are two types of A sample size is the
populations number of observations
1- Finite population and the size of the or units in the sample
populations usually denoted by (N) and usually denoted by
2- Infinite population ( Population size (n)
unknown)
.
10
Important Terms
Parameter Statistic
Any measure obtained from a Any measure calculated
population is called a from a sample is called a
parameter statistic
Example Example
Average gross income of all people 2002 gross income of
in the United States in 2002. people from a sample of
three states.
11
Important Terms
To gain knowledge, statisticians collect information for
variables
13
Two Branches of Statistics
Descriptive Statistics
Consists of the collection, organization,
summarization, and presentation of data
Example
The national census conducted by the U.S.
government every 10 years.
Inferential Statistics
Consists of generalizing from
samples to populations, performing
estimations and hypothesis tests,
determining relationships among variables,
and making predictions .inferential techniques
Are based on probability theory. 14
Section 1.3
Variables and
types of data
15
A variable can be classified according to
the type and the level of measurement
Type of Level of
measurement measurement
Qualitative Nominal
Data
Ordinal
Quantitative
Data
Interval
Ratio
16
Types of variables
Discrete Continuous
1- Have gaps between values (in most 1-Can assume all values between any two
cases Values are integers ) specific values .Possibilities are decimals
2-Limited Number of values 2-Have an unlimited number of possibilities
3-Results from counting 3-Results from measuring
Example: Number of kids in the house , Example: Weight , Height , Age, Distance
number of pencils on the table between Home and University , speed17
Types of Variables
Qualitative (or categorical) data consists of
• attributes
• labels
• non-numerical entries. i.e The possibilities are
names , categories (text)
Examples: The color of shirts, the type of car, the
condition of a patient.
20
21
Levels of Measurement
1- Nominal-level data ( lowest level)
22
Levels of Measurement
2- Ordinal (it is the next higher level of data)
23
Levels of Measurement
3- Interval (it is the third level of measurement)
• Quantitative data
• Data can be ordered
• Differences between data entries are meaningful
• Meaningless zero : Zero represents a position on
• a scale (not an inherent zero – zero does not imply
“none”) It does not mean the absence of the
characteristic.
25
Difference between
Interval Example: Temperature is a an interval level
variable since there is a meaningful difference of
1° F between each unit, such as 72 and 73° F.
Ranks data and precise
differences between
One property is lacking: there is no true zero :
data do exist however for example 0 ° F does not mean no temperature
there is no meaningful at all.
zero Other examples : IQ ,shoe size
29
1.4.1 Data collection
31
1- Simple random sample
32
2- Systematic sample
• Subjects are selected by using every kth number after the first subject
is randomly selected through 1 to k .
33
3- Stratified sampling
• Samples are obtained by dividing the population into groups according to
some characteristic that is important to the study, then sampling from each
group.
Example 1: You could stratify (group) your college population by department
and then choose a proportionate simple random sample from each stratum
(each department) to get a stratified random sample. To choose a simple
random sample from each department, number each member of the first
department, number each member of the second department, and do the same
for the remaining departments. Then use simple random sampling to choose
proportionate numbers from the first department and do the same for each of
the remaining departments.
Example 2: To collect a stratified sample of the people who live in Ajman ,you
could divide the household in to socioeconomic levels and then randomly
select households
34
4- Cluster sampling
• Samples are obtained by using intact groups (already-
formed group) called clusters.
• Intact meaning :choose all members from selected clusters
Example: If you randomly sample four departments from your
college population, the four departments make up the cluster
sample. Divide your college faculty by department. The
departments are the clusters. Number each department, and
then choose four different numbers using simple random
sampling. All members of the four departments with those
numbers are the cluster sample.
35
Section 1.5
Observational and
experimental Studies
36
Statistical studies are classified as:
Observational and Experimental studies
Observational study
• The researcher observes what is happening or what has happened in
the past and makes conclusions based on these observations.
Example: Data were collected on incomes and ages of motorcycle
owners for the years 1980 and 1998 and then compared ,the findings
showed considerable changes in ages and incomes of motorcycle
owners. Conclusion: motorcycle owners are getting older and richer.
Experimental study
• The researcher manipulates one of the variables and tries to find how
the manipulation affects other variables
Example: applying treatment to individuals and attempts to study the
effect of this treatment on the response (
37
Summary
Statistics, Why?
• Definitions: Statistics , Variable, Sample, Population, Data.
• Classification of statistics: Descriptive and Inferential
Statistics.
• Types of variables: Qualitative and Quantitative (discrete
and continuous).
• Measurement scale (nominal, ordinal, interval, ratio)
• Methods of data collection: Telephone surveys, Mailed
surveys, Personal interview , Surveying records, Direct
observation.
• Sampling methods: Random, systematic, stratified, clustered
38