Chap 1
Chap 1
1. Introduction
Statistics is the science of conducting studies to collect, organize, summarize, analyze, and draw
conclusions from data. Or statistics is the methodology for collecting, analyzing, interpreting and drawing
conclusions from information.
Putting it in other words, statistics is the methodology which scientists and mathematicians have developed
for interpreting and drawing conclusions from collected data.
Terminology
Population - the entire group of objects that we are interested in.
Sample - the subset that we actually measure.
Characteristic - the particular measurement we are interested in.
Parameter – a value which summarizes the characteristic of the population.
Statistic – the estimate of the parameter that we make from the sample.
Page 1 of 5
Instructor: Yohannes D (PhD. Cand.)
Continuous variables, by comparison, can assume an infinite number of values in an
interval between any two specific values.
Temperature, for example, is a continuous variable, since the variable can assume an
infinite number of values between any two given
temperatures.
There are two major types of statistics depending on how data are used.
1. Descriptive statistics
2. Inferential statistics
The branch of statistics devoted to the summarization and description of data is called descriptive
statistics and the branch of statistics concerned with using sample data to make an inference
about a population of data is called inferential statistics.
Descriptive statistics consists of the collection, organization, summarization, and presentation of
data.
Descriptive statistics includes the construction of graphs, charts, and tables, and the calculation
of various descriptive measures such as averages, measures of variation, and percentiles.
Descriptive statistics involves methods of organizing, picturing and summarizing information
from data.
1. Descriptive Statistics
Descriptive statistics is the type of statistics that probably springs to most people’s minds when
they hear the word “statistics.” Here the goal is to describe.
Numerical measures are used to tell about features of a set of data. There are a number of items
that belong in this portion of statistics, such as:
The average, or measure of center, consisting of the mean, median, mode or midrange.
The spread of a data set, which can be measured with the range or standard deviation.
Overall descriptions of data such as the five number summary.
Other measurements such as skewness and kurtosis.
The exploration of relationships and correlation between paired data.
The presentation of statistical results in graphical form.
2. Inferential Statistics
For the area of inferential statistics we begin by differentiating between two groups. The
population is the entire collection of individuals that we are interested in studying. It is typically
impossible or infeasible to examine each member of the population individually. So we choose a
representative subset of the population, called a sample.
Inferential statistics studies a statistical sample, and from this analysis is able to say something
about the population from which the sample came.
Page 2 of 5
Instructor: Yohannes D (PhD. Cand.)
Inferential statistics involves methods of using information from a sample to draw conclusions
about the population. Inferential statistics includes methods like point estimation, interval
estimation and hypothesis testing which are all based on probability theory.
Descriptive and inferential statistics are interrelated. It is almost always necessary to use methods
of descriptive statistics to organize and summarize the information obtained from a sample before
methods of inferential statistics can be used to make more thorough analysis of the subject under
investigation. Furthermore, the preliminary descriptive analysis of a sample often reveals features
that lead to the choice of the appropriate inferential method to be later used.
1. Clarifying the problem and formulating questions or hypotheses that can be answered with the
data,
2. Designing or creating an appropriate experiment that can collect the data required,
3. Finding and using the appropriate techniques needed to accurately analyze the collected data,
and
4. Interpreting the collected data and results so as to answer the questions and hypotheses that
were proposed in the first place.
The main steps in a statistical investigation are thought to be a cylindrical process that helps ensure
that all procedures are completed logically. The aim of opening a statistical investigation is to
answer the many questions that are present in the world, and is a technique that has been commonly
applied by statisticians.
a). Condensation:
Generally speaking by the word ‘ to condense’ , we mean to reduce or to lessen. Condensation is
mainly applied at embracing the understanding of a huge mass of data by providing only few
observations. Thus, Statistical measures help to reduce the complexity of the data and consequently
to understand any huge mass of data.
b). Comparison:
Classification and tabulation are the two methods that are used to condense the data. They help us
to compare data collected from different sources. Grand totals, measures of central tendency
measures of dispersion, graphs and diagrams, coefficient of correlation etc provide ample scope
for comparison.
As statistics is an aggregate of facts and figures, comparison is always possible and in fact
comparison helps us to understand the data in a better way.
Page 3 of 5
Instructor: Yohannes D (PhD. Cand.)
c). Forecasting:
By the word forecasting, we mean to predict or to estimate beforehand. In business also forecasting
plays a dominant role in connection with production, sales, profits etc. The analysis of time series
and regression analysis plays an important role in forecasting.
d).Estimation:
One of the main objectives of statistics is drawn inference about a population from the analysis for
the sample drawn from that population.
In estimation theory, we estimate the unknown value of the population parameter based on the
sample observations.
Suppose we are given a sample of heights of hundred students in a school, based upon the heights
of these 100 students, it is possible to estimate the average height of all students in that school.
Statistics with all its wide application in every sphere of human activity has its own limitations.
Some of them are given below.
Statistics does not give any specific importance to the individual items, in fact it deals with an
aggregate of objects. Individual items, when they are taken individually do not constitute any
statistical data and do not serve any purpose for any statistical enquiry.
Page 4 of 5
Instructor: Yohannes D (PhD. Cand.)
3. Statistical laws are not exact:
It is well known that mathematical and physical sciences are exact. But statistical laws are not
exact and statistical laws are only approximations. Statistical conclusions are not universally true.
They are true only on an average.
Page 5 of 5
Instructor: Yohannes D (PhD. Cand.)