Ae 9 - Chapter 1-2
Ae 9 - Chapter 1-2
INTRODUCTION TO THE
STATISTICAL
CONCEPTS
Objectives:
• Define statistics.
limitations of statistics
statistics.
continuous variables.
measurement of a variable.
Statistics plays a major role in many aspects of our
in any conclusions.
evidence.
making.
definition?
Field of Statistics
in the abstract.
Statistics is indispensable to almost all sciences - social, physical and natural. It is very
often used in most of the spheres of human activity. In spite of the wide scope of the
subject it has certain limitations. Some important limitations of statistics are the
following:
Limitation of Statistics:
qualitative phenomenon.
studying a problem.
sample.
result.
population
You are walking down the street and notice that a person walking in front of you drops
PHP100. Nobody seems to notice the PHP100 except you. Since you could keep the
money without anyone knowing, would you keep the money or return it to the owner?
Suppose you wanted to use this scenario as a gauge of the morality of students at your
school by determining the percent of students who would return the money. How might
you do this? You could attempt to present the scenario to every student at the school,
but this would be difficult or impossible if the student body is large. A second possibility
is to present the scenario to 50 students and use the results to make a statement about
all the students at the school.
In the PHP100 study presented, the population is all the students at the school. Each
student is an individual. The sample is the 50 students selected to participate in the
study. Suppose 39 of the 50 students stated that they would return the money to the
owner. We could present this result by saying that the percent of students in the survey
who would return the money to the owner is 78%. This is an example of a descriptive
statistic because it describes the results of the sample without making any general
conclusions about the population. So 78% is a statistic because it is a numerical
summary based on a sample. Descriptive statistics make it easier to get an overview of
what the data are telling us.
If we extend the results of our sample to the population, we are performing inferential
statistics. The generalization contains uncertainty because a sample cannot tell us
everything about a population. Therefore, inferential statistics includes a level of
confidence in the results. So rather than saying that 78% of all students would return the
money, we might say that we are 95% confident that between 74% and 82% of all
students would return the money. Notice how this inferential statement includes a level
of confidence (measure of reliability) in our results. It also includes a range of values to
account for the variability in our results. One goal of inferential statistics is to use
statistics to estimate parameters.
PROCESS OF STATISTICS:
the questions.
Example:
each plant.
of the result.
Take Note!
population.
Example:
inferential statistics.
(Descriptive Statistics)
Statistics)
h e r s i x e x am s c o r e s i n A l g e b r a.
(Descriptive Statistics)
tallysheetofelectoralreturns.
(Descriptive Statistics)
For example:
If variables did not vary, they would be constants, and statistical inference would not be
necessary. Think about it this way: If each tomato had the same weight, then knowing
the weight of one tomato would allow us to determine the weights of all tomatoes.
However, the weights of the tomatoes vary.
One goal of research is to learn the causes of the variability so that we can learn to
grow plants that yield the best tomatoes.
It is helpful to divide variables into different types, as different statistical methods are
applicable to each. The main division is into qualitative (or categorical) or quantitative
(or numerical variables).
class or category.
amount or quantity.
Example:
qualitative or quantitative.
1. Haircolor (Qualitative)
2. Temperature (Quantitative)
discrete.
Example:
gas. (Continuous)
LEVELS OF MEASUREMENT
ration scales.
Example:
credit card)
Example:
- Food Preferences
- Severity of Pain
not onlyclassifiesandordersthe
in Sampling DESIGN
Objectives:
secondary data.
sample size.
techniques.
sampling.
Analysis of data can lead to powerful results. Data
from.
Data collection is the process of gathering and measuring information that enables one
to answer stated research questions, test hypotheses, and evaluate outcomes.
set objectives.
5. Collect data.
SOURCES OF DATA
of original research.
Example:
Letters
Diaries
Original Photographs
Autobiographies
p r im a r y s o u r c e. T h e d at a c o l l e ct o r s
persuasive.Theyofteninvolve
Example:
Newspaper article
Biography
TV Documentaries
of the collection of the data by the primary agency may have been different from the
purpose of the user of these secondary data.
i nt e r v i ew e e. T h e r e s e a r c h e r g at h e r s
interviewee.
tools:
analysis of records?
• Of whom will we ask questions and what
t h e t o p i c s uff i c i e nt l y t o d e s i g n a
ourselves?
option.
questions.
Questionnaire
introduction.
respondents.
survey. (Pre-test)
Example:
labor started?
members?
objective data.
Example:
yesterday?
• Eggs. Yes No
variable of interest.
- Radiographic
- biochemical
- X-ray machines
- Microscope
- Clinical examinations
- Microbiological examinations
machines.
periodicals.
SAMPLE SIZE
survey”?
accurate result.
Take Note!
important consideration.
considerations.
Definitions:
population.
Enumeration
- Less Labor
- Reduced Cost
- Greater Speed
- Convenience
1. Probability Sample
c h a n c e m e c h a n i sm, t h u s i n v o l v i n g
sampling frame.
samples.
Sampling Procedure
the population.
- Ask the question, can I generalize to the
population?
sampling.
sample.
• Cluster Sampling
clusters.
• Multi - Stage Sampling
Take Note!
under study
Probability Sampling
a n d f a st e st w a y t o g et r e a ct i o n s
certain issue.
Useful
subjects
1. Non-sampling Error
sample-to-sample variability.
1. Non-responses
2. Interviewer Error
3. Misrepresented Answers
5. Questionnaire Design
6. Wording of Questions
7. Selection Bias
• Slovin’s Formula
Slovin’s formula is used to calculate the sample size n given the population size and
error. It is computed as
Where:
ü Level of Precision
Also called sampling error, the level of precision, is the range in which the true value of
the population is estimated to be. The degree of sampling errors in survey results.
Example:
sample of size.
Example:
Solution:
every kth unit subsequently. Suppose the random number i is 6, then we select 15, 25,
35, 45, .. .
- The individuals within each stratum should be homogeneous (or similar) in some way.
Example:
Solution:
Given:
The sample sizes are 20 from A and 30 from B. Then the units from each institution are
to be selected by simple random sampling