0% found this document useful (0 votes)

18 views

1 Unnamed 04 01 2024

Statistics is the science of collecting, organizing, summarizing, and interpreting data. It involves gathering raw data, organizing it into tables or diagrams, numerically summarizing the data using measures like the mean, median and mode, analyzing patterns in the data using mathematical formulas, and making inferences about the broader population based on the sample data. Key steps in any statistical study are collecting raw data, tabulating it, representing it pictorially or graphically, summarizing it numerically using measures of central tendency and dispersion, analyzing it mathematically, and drawing a conclusion.

Uploaded by

vanchagarg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views

1 Unnamed 04 01 2024

Uploaded by

vanchagarg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 66

Statistics:

Statistics is the branch of science where we plan, gather and

analyze information about a particular collection of individuals or
objects under investigation.
Statistics is defined differently by different authors over a period of
time.

 Statistics are numerical statement of facts in any department of

enquiry placed in relation to each other.
- A.L. Bowley
 Statistics may be defined as the science of collection, presentation
analysis and interpretation of numerical data from the logical
analysis. It is clear that the definition of statistics by Croxton and
Cowden is the most scientific and realistic one. According to this
definition there are four stages: Collection of Data, Presentation of
data, Analysis of data and Interpretation of data.
- Croxton and Cowden
Basic Steps in a Statistical Study:
For any statistical study, there are some basic steps to be followed once
we draw a sample. These are:

• Step 1: Gather first-hand information from the sample and this is called
the raw data.
• Step 2: Tabular representation of the raw data, i.e., represent the raw data
in a table.
• Step 3: Pictorial representation of the data, i.e., draw diagrams with the
organized data in a table.
• Step 4: Numerically summarize the data, i.e., describe the entire data set
with some key numbers.
• Step 5: Analyze the data using mathematical formulae.
• Step 6: Draw the final inference or conclusion about the population
under study.
Data Analysis:

• The data can be collected in connection with time or

geographical location or in connection with time and location.

• Any statistical data can be classified under two categories

depending upon the sources utilized.

• Primary data

• Secondary data
Primary Data:

Primary data is the one, which is collected by the investigator himself

for the purpose of a specific inquiry or study. Such data is original in
character and is generated by survey conducted by individuals or
research institution or any organisation

Example:
If a researcher is interested to know the impact of noon meal scheme
for the school children, he has to undertake a survey and collect data
on the opinion of parents and children by asking relevant questions.
Such a data collected for the purpose is called primary data.
Methods for Collecting Primary Data:

The primary data can be collected by the following five methods.

1. Direct personal interviews

2. Indirect Oral interviews
3. Information from correspondents
4. Mailed questionnaire method
5. Schedules sent through enumerators
Secondary Data:

Secondary data are those data which have been already collected
and analyzed by some earlier agency for its own use; and later
the same data are used by a different agency.
Frequency Distribution:
Frequency distribution is a series when a number of observations
with similar or closely related values are put in separate bunches or
groups, each group being in order of magnitude in a series. It is
simply a table in which the data are grouped into classes and the
number of cases which fall in each class are recorded. It shows the
frequency of occurrence of different values of a single Phenomenon.

A frequency distribution is constructed for three main reasons:

1) To facilitate the analysis of data.

2) To estimate frequencies of the unknown population distribution
from the distribution of sample data.
3) To facilitate the computation of various statistical measures.
Raw Data or Ungrouped Data:

The statistical data collected are generally raw data or ungrouped data.

Example:
Let us consider the daily wages (in Rs.) of 30 laborers in a factory.

800, 700, 550, 500, 600, 650, 400, 300, 800, 900, 750, 450, 350, 650,
700, 800, 820, 550, 650, 800, 600, 550, 380, 650, 750, 850, 900, 650,
450, 750.
Given a raw data set, we can rearrange it in two different ways.

 Frequency distribution or Discrete frequency distribution:

Using the frequency of the variable we can arrange it. This representation
of the data is known as frequency distribution.

 Grouped frequency distribution / Continuous frequency distribution:

Again we can arrange it for the class intervals. For this situation, it is
called as Grouped frequency distribution of the variable.
Examples:

𝒙 𝑓 𝒙 𝑓 𝒙 𝑓
15 2 1-9 3 5 - 10 2
17 3 10 - 19 5 10 - 15 3
18 5 20 - 29 10 15 -20 5
20 4 30 - 39 4 20 - 25 4
22 7 40 - 49 7 25 - 30 7
25 9 50 – 59 6 30 -35 9
30 3 60 - 69 3 35 - 40 3

(𝒊) (𝒊𝒊) (𝒊𝒊𝒊)

Frequency distribution Grouped Frequency distribution Continuous Frequency distribution
Or
Discrete frequency distribution
Special case in grouped frequency distribution:

If “𝑑” is the gap between the upper limit of any class and the lower limit
of the succeeding class, the class boundaries for any class are then given by:

𝒅
Upper class boundary = Upper class limit +
𝟐
𝒅
Lower class boundary = Lower class limit -
𝟐
Summarizing a raw data set or an organized data set
There are two basic properties of a quantitative data set that are
commonly studied. These are central tendency and variability (or
dispersion).

Central Tendency: Quite often it is found that the entries in data set
cluster around a central (or middle) value. This behavior of the data
set is called the central tendency. The main Challenge is to locate a
central value around which the clustering takes place.

Three standard methods to measure the location of central tendency

are:
* Mean
* Median
* Mode
Variability or Dispersion: Variability or dispersion of data set means
the amount of discrepancies among the data entries. There are several
ways to measure dispersion or variability in a data set and these are:

* Range
* Quartile deviation
* Variance
* Standard deviation
1. Arithmetic Mean or Average

 Mean of “𝑛” observations (𝑥1 , 𝑥2 , ⋯ , 𝑥𝑛 ) is given by

1
𝑥= 𝑥𝑖 .
𝑛

 In case of the discrete frequency distribution:

If 𝑓𝑖 ’s are the frequencies of the variable 𝑥𝑖 ’s then mean

1
𝑥= 𝑓𝑖 𝑥𝑖 , where n = 𝑓𝑖 .
𝑛

 In case of the continuous frequency distribution:

If 𝑓𝑖 ’s are the frequencies of the class intervals then mean

(𝑚𝑖𝑑𝑣𝑎𝑙𝑢𝑒 𝑜𝑓 𝑒𝑎𝑐ℎ 𝑐𝑙𝑎𝑠𝑠)×𝑓𝑖
𝑥= .
𝑓𝑖
Calculation of mean by using deviation concept:
Sometime the values of the variable (𝑥) or frequency (𝑓) or both are large.
Then, the calculation of mean by previous formulas is quite time-consuming.
Hence, to avoid such situations we are calculating mean by taking the
deviations of the given values from any arbitrary point “𝐴” as explained
below:
1
Discrete frequency distribution: 𝑥=𝐴 + 𝑓𝑖 𝑑𝑖 ,
𝑁
where “𝐴” is an arbitrary point, 𝑑𝑖 = 𝑥𝑖 − 𝐴 and 𝑁 = 𝑓𝑖 .
ℎ
Continuous frequency distribution: 𝑥 = 𝐴 + 𝑓𝑖 𝑑𝑖 ,
𝑁

where “𝐴” is an arbitrary point, "ℎ” is the magnitude of class interval

𝑥 −𝐴
and 𝑁 = 𝑓𝑖 . Here, 𝑑𝑖 = 𝑖 , where 𝑥𝑖 ’s are the mid value of each
ℎ
class.
65
Karl Pearson Relationship:

Sometimes mode is estimated from the mean and the median.

For a symmetrical distribution, mean, median and mode
coincide. If the distribution is moderately asymmetrical, the
mean, median and mode obey the following empirical
relationship (due to Karl Pearson) :

The distance between mean and median is about one-third of

the distance between the mean and mode

Mean – Mode = 3 (Mean - Median)

Which gives, Mode = 3 Median – 2 Mean.

Relation between Mean, Median, Mode:

1. In symmetrical distribution Mean = Median = Mode.

2. In positively skewed distribution Mode < Median < Mean.

3. In negatively skewed distribution Mean < Median < Mode.

Partitions:
• Quartiles
• Deciles
• Percentiles
Partitions:
These are the values which divided the series into a number of equal parts.

Quartiles: The three points which divided the series in to four equal parts
are called quartiles. It is denoted by 𝑄1 , 𝑄2 , 𝑄3 .

Deciles: The nine points which divided the series in to ten equal parts
are called deciles. It is denoted by 𝐷1 , 𝐷2 , ⋯ , 𝐷9 .

Percentiles: The ninety-nine points which divided the series in to hundred

equal parts are called percentiles. It is denoted by 𝑃1 , 𝑃2 , ⋯ , 𝑃99 .
For discrete frequency distribution:
𝑘𝑁
Quartiles:- 𝑄𝑘 ∶ ; Identify the same value in 𝑐𝑓 list, otherwise find 𝑐𝑓
4
𝑘𝑁
just greater than , the corresponding variable is the
4
quartile value. Here, 𝑘 = 1, 2, 3.
𝑘𝑁
Deciles:- 𝐷𝑘 ∶ ; Identify the same value in 𝑐𝑓 list, otherwise find 𝑐𝑓
10
𝑘𝑁
just greater than ,
the corresponding variable is the
10
decile value. Here, 𝑘 = 1, 2, 3, 4, 5, 6, 7, 8, 9.

𝑘𝑁
Percentiles:- 𝑃𝑘 ∶ ; Identify the same value in 𝑐𝑓 list, otherwise find 𝑐𝑓
100
𝑘𝑁
just greater than ,
the corresponding variable is the
100
percentile value. Here, 𝑘 = 1, 2, ⋯ , 99.
For continuous frequency distribution:

Quartiles:- Step-1: Find quartile class by:

𝑘𝑁
Compute ; Identify the same value in 𝑐𝑓 list, otherwise
4
𝑘𝑁
find 𝑐𝑓 just greater than .
4
Step-2: Use the formula
ℎ 𝑘𝑁
𝑄𝑘 = 𝑙 + − 𝑐 ; 𝑘 = 1, 2, 3.
𝑓 4

Deciles:- Step-1: Find decile class by:

𝑘𝑁
Compute ; Identify the same value in 𝑐𝑓 list, otherwise
10
𝑘𝑁
find 𝑐𝑓 just greater than .
10
Step-2: Use the formula
ℎ 𝑘𝑁
𝐷𝑘 = 𝑙 + − 𝑐 ; 𝑘 = 1, 2, ⋯ , 9.
𝑓 10
Percentiles:- Step-1: Find percentile class by:
𝑘𝑁
Compute ; Identify the same value in 𝑐𝑓 list, otherwise
100
𝑘𝑁
find 𝑐𝑓 just greater than .
100

Step-2: Use the formula

ℎ 𝑘𝑁
𝑃𝑘 = 𝑙 + − 𝑐 ; 𝑘 = 1, 2, ⋯ , 99.
𝑓 100
Problems based on the Moments, Skewness and Kurtosis concepts:

Q1) Find the first four moments about 𝑥 = 10 for the series 4, 7, 10, 13, 16, 19, 22.

Q2) Calculate the first four moments about the mean for the series 4, 7, 10, 13, 16,
19, 22.

Q3) The first four moments of a distribution about 𝑥 = 4 are 1, 4, 10 and 45.
Comment upon the nature of the distribution.

Q4) In a certain distribution, the first four moments about x=5 are 2, 20, 40 and 50.
Calculate and state whether the distribution is leptokurtic or platykurtic.

Q5) The first four central moments of a distribution are 0, 2.5, 0.7 and 18.75. Test the
skewness and kurtosis of the distribution.

Case Study On Business Mathematics
No ratings yet
Case Study On Business Mathematics
10 pages
InfinintyQS SPC Boot Camp Training Manual
100% (1)
InfinintyQS SPC Boot Camp Training Manual
27 pages
Stem-And-Leaf Questions All
0% (1)
Stem-And-Leaf Questions All
20 pages
FALLSEM2024-25 BMAT202L TH VL2024250109241 2024-08-10 Reference-Material-I
No ratings yet
FALLSEM2024-25 BMAT202L TH VL2024250109241 2024-08-10 Reference-Material-I
53 pages
Statatics Chapter 1
No ratings yet
Statatics Chapter 1
21 pages
Chapter 1 BFC34303
No ratings yet
Chapter 1 BFC34303
104 pages
Math 5
No ratings yet
Math 5
3 pages
Sta 131 Complete Note
No ratings yet
Sta 131 Complete Note
33 pages
Educational Statistics
100% (1)
Educational Statistics
106 pages
Chapter 15 (3)nnn
No ratings yet
Chapter 15 (3)nnn
16 pages
SMA 160 Stds Notes.pdf
No ratings yet
SMA 160 Stds Notes.pdf
41 pages
Unit 4 - Descriptive Statistics (A)
No ratings yet
Unit 4 - Descriptive Statistics (A)
19 pages
Statistics
No ratings yet
Statistics
17 pages
CH 9 Statistics
No ratings yet
CH 9 Statistics
67 pages
Research and statistics NSC 328 20242025 2
No ratings yet
Research and statistics NSC 328 20242025 2
19 pages
Statistics - Basic Concepts
No ratings yet
Statistics - Basic Concepts
29 pages
Module 1
No ratings yet
Module 1
108 pages
CAS_Descriptive Statistics_Final PPT-1
No ratings yet
CAS_Descriptive Statistics_Final PPT-1
112 pages
Statistics
No ratings yet
Statistics
16 pages
DBBA2102
No ratings yet
DBBA2102
10 pages
Measure of Central Tendency
100% (1)
Measure of Central Tendency
70 pages
1.-INTRO-TO-STAT
No ratings yet
1.-INTRO-TO-STAT
46 pages
Data Management ( 1)
No ratings yet
Data Management ( 1)
46 pages
Statistics, mg4
No ratings yet
Statistics, mg4
58 pages
Mathematics in The Modern World
No ratings yet
Mathematics in The Modern World
50 pages
Elements of Statistics BCA Sem-I.
No ratings yet
Elements of Statistics BCA Sem-I.
46 pages
Statistics Is The ": Science Which Deals With The Collection, Analysis and Interpretation of Numerical Data"
No ratings yet
Statistics Is The ": Science Which Deals With The Collection, Analysis and Interpretation of Numerical Data"
98 pages
Basic-Statistical-Concepts-_-Measures-of-Location.docx
No ratings yet
Basic-Statistical-Concepts-_-Measures-of-Location.docx
14 pages
GNED 03 Finals Reviewer
No ratings yet
GNED 03 Finals Reviewer
10 pages
Staticus: Math 103 Lecture 9 Class Notes
No ratings yet
Staticus: Math 103 Lecture 9 Class Notes
4 pages
MMW Mod#4 Statistics
No ratings yet
MMW Mod#4 Statistics
6 pages
L3 - Data Analysis - Central Tendency 20 - 21
No ratings yet
L3 - Data Analysis - Central Tendency 20 - 21
22 pages
Intro To Statistics
No ratings yet
Intro To Statistics
38 pages
Sta 103 L1 Upda2
No ratings yet
Sta 103 L1 Upda2
104 pages
Statistics
No ratings yet
Statistics
5 pages
IEM Outline Lecture Notes Autumn 2016
No ratings yet
IEM Outline Lecture Notes Autumn 2016
198 pages
Chapter 1 BFC34303 (Lyy)
No ratings yet
Chapter 1 BFC34303 (Lyy)
104 pages
QUALITATIVE DATA Are Measurements For Which There Is No Natural
No ratings yet
QUALITATIVE DATA Are Measurements For Which There Is No Natural
9 pages
Chapter 1
No ratings yet
Chapter 1
23 pages
Statistics L 1
No ratings yet
Statistics L 1
27 pages
Powerpoint Presentation On: "Frequency
100% (2)
Powerpoint Presentation On: "Frequency
36 pages
Research II Q4 M2
No ratings yet
Research II Q4 M2
14 pages
summry biostatstics pptx
No ratings yet
summry biostatstics pptx
32 pages
Research 3 Quarter 3 - MELC 1 Week 1-2 Inferential Statistics
No ratings yet
Research 3 Quarter 3 - MELC 1 Week 1-2 Inferential Statistics
39 pages
Statistics
No ratings yet
Statistics
12 pages
Statistics-deals-with-experimental-designs-and-procedures-which-include-data-collection-classification-organization-and-interpretation-and-decision-making-regarding-these-data.-Can-be-can-be
No ratings yet
Statistics-deals-with-experimental-designs-and-procedures-which-include-data-collection-classification-organization-and-interpretation-and-decision-making-regarding-these-data.-Can-be-can-be
41 pages
AL- I (Unit -I)
No ratings yet
AL- I (Unit -I)
19 pages
Quantitative Methods For Decision Making: Dr. Akhter
No ratings yet
Quantitative Methods For Decision Making: Dr. Akhter
100 pages
Chapter One Illustration
No ratings yet
Chapter One Illustration
9 pages
Chapter 4 Describing Educational Data Libmanan Group
No ratings yet
Chapter 4 Describing Educational Data Libmanan Group
31 pages
Statistics A Review
No ratings yet
Statistics A Review
47 pages
Math Reviewer
No ratings yet
Math Reviewer
6 pages
Physics
No ratings yet
Physics
6 pages
Reviewer For Stat
No ratings yet
Reviewer For Stat
7 pages
Statistics A. Introduction
50% (2)
Statistics A. Introduction
24 pages
Statistics Review
No ratings yet
Statistics Review
59 pages
4th Grade 7 Reviewer
No ratings yet
4th Grade 7 Reviewer
2 pages
Eng 2015 Prelims Reviewer
No ratings yet
Eng 2015 Prelims Reviewer
11 pages
2.1 Measures of Central Tendency
No ratings yet
2.1 Measures of Central Tendency
32 pages
FDS UNIT 2 NOTES
No ratings yet
FDS UNIT 2 NOTES
46 pages
Data-Management-Lecture-Notes
No ratings yet
Data-Management-Lecture-Notes
14 pages
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
Statistics I Essentials
From Everand
Statistics I Essentials
Emil G. Milewski
No ratings yet
11 Verilog Operators 19-08-2023
No ratings yet
11 Verilog Operators 19-08-2023
48 pages
5-Stages of Algorithm Development & Time Complexity Analysis-05-01-2024
No ratings yet
5-Stages of Algorithm Development & Time Complexity Analysis-05-01-2024
31 pages
2-Functional Components of A Computer - Registers and Register Files-04-01-2024
No ratings yet
2-Functional Components of A Computer - Registers and Register Files-04-01-2024
9 pages
3-Interconnection of Components - Overview of IAS Computer Function-06-01-2024
No ratings yet
3-Interconnection of Components - Overview of IAS Computer Function-06-01-2024
12 pages
1-Introduction - Overview of Organization and Architecture-03-01-2024
No ratings yet
1-Introduction - Overview of Organization and Architecture-03-01-2024
24 pages
Measure of Central Tendency Lecture 123
No ratings yet
Measure of Central Tendency Lecture 123
33 pages
Lind17e Chapter03 TB
No ratings yet
Lind17e Chapter03 TB
11 pages
Descriptive Statistics - Lec1 PDF
No ratings yet
Descriptive Statistics - Lec1 PDF
62 pages
Statistics PPT UNIT I 28.11.2020
No ratings yet
Statistics PPT UNIT I 28.11.2020
150 pages
Tensile Testing of Metals Proficiency Testing Program Round 4
No ratings yet
Tensile Testing of Metals Proficiency Testing Program Round 4
39 pages
STA112 Week 2 Class Note
No ratings yet
STA112 Week 2 Class Note
102 pages
Statistics: 17.1 Data
100% (1)
Statistics: 17.1 Data
46 pages
Trading Algorithm Selection
No ratings yet
Trading Algorithm Selection
8 pages
I.E IMP Questions
No ratings yet
I.E IMP Questions
13 pages
Term Test 1 MCQ
100% (1)
Term Test 1 MCQ
18 pages
Widi Taufik Aliftiyo 22010111130058 Lap - KTI Bab8
No ratings yet
Widi Taufik Aliftiyo 22010111130058 Lap - KTI Bab8
26 pages
Statistical Analysis Report
No ratings yet
Statistical Analysis Report
7 pages
MA232 Final Exam Fall2020 Online
No ratings yet
MA232 Final Exam Fall2020 Online
9 pages
One-Sample Kolmogorov-Smirnov Test: Npar Tests
No ratings yet
One-Sample Kolmogorov-Smirnov Test: Npar Tests
25 pages
Math 7 Q4 Module 8
No ratings yet
Math 7 Q4 Module 8
28 pages
Reubs High School: Statistics Project
No ratings yet
Reubs High School: Statistics Project
13 pages
q4 Week 2 Las Math 10
No ratings yet
q4 Week 2 Las Math 10
17 pages
Take Home Quiz 2
No ratings yet
Take Home Quiz 2
2 pages
Central Tendency and Variability: The Two Most Essential Features of A Distribution
No ratings yet
Central Tendency and Variability: The Two Most Essential Features of A Distribution
29 pages
Intro To Probability and Statistics
100% (3)
Intro To Probability and Statistics
70 pages
Performance Evaluation of Normalization-Based CBR Models For Improving
No ratings yet
Performance Evaluation of Normalization-Based CBR Models For Improving
13 pages
Lesson 2 Ungrouped Data Descriptive Statistics
100% (3)
Lesson 2 Ungrouped Data Descriptive Statistics
21 pages
Am 4.1 HW PDF
100% (1)
Am 4.1 HW PDF
4 pages
Data Profiling PPT - How To
100% (2)
Data Profiling PPT - How To
69 pages
SASA211: Finding The Center
No ratings yet
SASA211: Finding The Center
138 pages
4.10 Descriptive Statistics
No ratings yet
4.10 Descriptive Statistics
18 pages

1 Unnamed 04 01 2024

Uploaded by

1 Unnamed 04 01 2024

Uploaded by

Statistics:

Statistics is the branch of science where we plan, gather and

 Statistics are numerical statement of facts in any department of

• The data can be collected in connection with time or

• Any statistical data can be classified under two categories

Primary data is the one, which is collected by the investigator himself

The primary data can be collected by the following five methods.

1. Direct personal interviews

A frequency distribution is constructed for three main reasons:

1) To facilitate the analysis of data.

 Frequency distribution or Discrete frequency distribution:

 Grouped frequency distribution / Continuous frequency distribution:

(𝒊) (𝒊𝒊) (𝒊𝒊𝒊)

Three standard methods to measure the location of central tendency

 Mean of “𝑛” observations (𝑥1 , 𝑥2 , ⋯ , 𝑥𝑛 ) is given by

 In case of the discrete frequency distribution:

If 𝑓𝑖 ’s are the frequencies of the variable 𝑥𝑖 ’s then mean

 In case of the continuous frequency distribution:

If 𝑓𝑖 ’s are the frequencies of the class intervals then mean

where “𝐴” is an arbitrary point, "ℎ” is the magnitude of class interval

Sometimes mode is estimated from the mean and the median.

The distance between mean and median is about one-third of

Mean – Mode = 3 (Mean - Median)

Which gives, Mode = 3 Median – 2 Mean.

1. In symmetrical distribution Mean = Median = Mode.

2. In positively skewed distribution Mode < Median < Mean.

3. In negatively skewed distribution Mean < Median < Mode.

Percentiles: The ninety-nine points which divided the series in to hundred

Quartiles:- Step-1: Find quartile class by:

Deciles:- Step-1: Find decile class by:

Step-2: Use the formula

You might also like