0% found this document useful (0 votes)

43 views35 pages

Introduction To Statistics

Uploaded by

Dee soni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views35 pages

Introduction To Statistics

Uploaded by

Dee soni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 35

STATISTIC

S
STATISTICS
 Statistics is the study of the collection,
analysis, interpretation, presentation, and
organization of data. In other words, it is a
mathematical discipline to collect, summarize
data.
APPLICATIONS OF
STATISTICS

Statistics in Statistics in Statistics in Statistics in Statistics in social

Economics industry insurance astronomy sciences

Statistics in Statistics in
Statistics in
Biology and Psychology and Statistics in war
Physical Science
Medical Science Education
LIMITATIONS OF STATISTICS
It is not concerned
It does not It does not reveal
with the
recognize the the entire story of
qualitative
individual items: a phenomenon:
phenomena:

Its results are true

It laws are not It is likely to be
only on an
exact: misused:
average:
STATISTIC VS. PARAMETER

A statistic is a characteristic of a sample.

• It is a numerical or graphic way to summarize data obtained from a
sample
A parameter is a characteristic of a population.
• It is a numerical or graphic way to summarize data obtained from
the population
TYPES OF NUMERICAL DATA
 There are two fundamental types of numerical data:
 Categorical data: obtained by determining the frequency of occurrences in each of several
categories
 Quantitative data: obtained by determining placement on a scale that indicates amount or degree
TYPES OF ENQUIRY

OFFICIAL, SEMI- INITIAL OR CONFIDENTIAL DIRECT OR REGULAR OR CENSUS OR

OFFICIAL OR UN- REPETITIVE OR NON- INDIRECT AD-HOC SAMPLE
OFFICIAL CONFIDENTIAL

PRIMARY OR
SECONDARY
CLASSIFICATI
ON OF DATA
Classification is the process of arranging data into
sequences and groups according to their common
characteristics or separating them into different but
related parts.
FUNCTIONS OF
CLASSIFICATION
It condenses the data

It facilities comparisons

It helps to study the relationships

It facilitates the statistical treatment of the data:

TECHNIQUES FOR SUMMARIZING
QUANTITATIVE DATA

Frequency Histograms Stem and Leaf Distribution Averages Variability

Distributions Plots curves
DISCRETE VS CONTINUOUS FREQUENCY
DISTRIBUTIONS
Continuous Frq. Dis.
Raw Data Arranged Data Discrete Frq.Dis. Exclusive Classes
Marks Marks Marks Freqency
Marks Frequency
76 32 32 1
0 - 25 0
93 39 39 2
25 - 50 3
39 39 50 2
50 - 75 3
50 50 66 1
75 – 100 5
76 50 76 2
81 66 81 1 Inclusive Classes
66 76 90 1 Marks Frequency
50 76 93 1 0 - 25 0
39 81 26 - 50 5
90 90 51 - 75 1
32 93 76 – 100 5
Histogram, Stem and Leaf Diagram
Histogram Stem and Leaf Plot

Stem Leaf
6 78
9 018
SUMMARY MEASURES
Summary Measures

Central Tendency Quartile Variation

Arithmetic Median Mode
Mean
Range Coefficient of
Variation
Variance
Geometric Mean
Standard Deviation
MEASURES OF CENTRAL
TENDENCY
Central Tendency

Average (Mean) Median Mode

X i
X  i 1

n
N

X i
 i 1

N
MEAN (ARITHMETIC MEAN)
 Mean (arithmetic mean) of data values
 Sample mean

Sample Size
n

X i
X1  X 2    X n
X i 1

 Population mean
n n
Population Size
N

X i
X1  X 2    X N
 i 1

N N
The most common measure of central tendency
Affected by extreme values (outliers)

0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 12 14

Mean = 5 Mean = 6

MEAN
WEIGHTED MEAN
A form of mean obtained from groups of data in


which the different sizes of the groups are
accounted for or weighted. f ( x)
xw 
N total
MEDIAN
 Robust measure of central tendency
 Not affected by extreme values

0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 12 14


Median = 5
In an Ordered array, median is the “middle” number
Median = 5
 If n or N is odd, median is the middle number
 If n or N is even, median is the average of the two middle numbers
MODE
 A measure of central tendency
 Value that occurs most often
 Not affected by extreme values
 Used for either numerical or categorical data
 There may be no mode
 There may be several modes

0 1 2 3 4 5 6
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

No Mode
Mode = 9
QUARTILES
 Split Ordered Data into 4 Quarters

25% 25% 25% 25%

Q 
1 Q2  Q3 
i  n  1
 Position of i-th Quartile

Qi  
4
Data in Ordered Array: 11 12 13 16 16 17 18 21 22

19  1 12  13 
Position of Q1   2.5 Q1   12.5
4 2
DIFFERENCES IN MEASURES
OF CENTRAL TENDENCY
 Mode, median and mean could be three different numbers in asymmetrical distributions of
data.
 For any data set there is only one mean and median but there may be many modes.
 Median is less influenced by the extreme values than mean.
 Mean is almost never observed, median is observed in only odd numbered data sets and mode
is always observed in the data set.
MEASURES OF VARIABILITY
 Measures of variability show how spread out the distribution of scores is from the mean,
or how much dispersion or scatter exists in the distribution. If there is a large degree of
dispersion, that is, if the scores are very dissimilar, we say the distribution has a large or high
variability, or variance. If the scores are very similar, there is a small degree of dispersion and
a small variance.
MEASURES OF VARIATION
Variation

Variance Standard Deviation Coefficient

of Variation
Range
Population Population Standard
Variance (σ2) deviation (σ)
Inter-
quartile Sample
Range Variance (S2)
Sample Standard
deviation (S)
RANGE
The range is simply the numerical
difference between the highest and lowest
scores in the distribution.
INTERQUARTILE RANGE

Can eliminate some outlier problems by using the interquartile range

Eliminate high- and low-valued observations and calculate the range of

the middle 50% of the data

Interquartile range = 3rd quartile – 1st quartile

• IQR = Q3 – Q1
STANDARD
DEVIATION
 The measure of variability used most often in
research is the standard deviation, a statistic
that indicates the average distance of the scores
from the mean of the distribution.
COMPARING STANDARD
DEVIATIONS
Data A Mean = 15.5
S = 3.338
11 12 13 14 15 16 17 18 19 20 21

Data B
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 S = .9258

Data C
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 S = 4.57
STANDARD DEVIATION
WITH A MEAN OF 62 AND A SD OF
3, 95% OF SCORES SHOULD FALL
BETWEEN 62-2*3 AND 62+2*3. I.E
56 AND 68
VARIANCE
 The Variance, s2, represents the amount of variability of the data
relative to their mean
 As shown below, the variance is the “average” of the squared
deviations of the observations about their mean

s 2

 ( x  x)
i
2

n 1
► The Variance, s2, is the sample variance, and is used to
estimate the actual population variance, s 2

 2

 (x  )
i
2

N
COEFFICIENT OF VARIATION
 Measures relative variation

 Always in percentage (%)

 Shows variation relative to mean

 Is used to compare two or more sets of data measured in different units

S 
CV   100%
X 
SKEWNESS
When graphing the mean, median and mode of
a distribution, roughly speaking, a distribution
has positive skew if the right tail is longer and
negative skew if the left tail is longer.
SHAPE OF A DISTRIBUTION
 Describes how data is distributed

 Measures of shape
 Symmetric or skewed

Left-Skewed Symmetric Right-Skewed

Mean < Median < Mode Mean = Median =Mode Mode < Median < Mean
POSITIVELY SKEWED
 This distribution has a positive skew. Note that the mean is larger than the median.
NEGATIVELY SKEWED
 This distribution has a negative skew. The median is larger than the mean.

DevOps Engineer Learning Path - Kodekloud
No ratings yet
DevOps Engineer Learning Path - Kodekloud
10 pages
Statistics: a QuickStudy Laminated Reference Guide
From Everand
Statistics: a QuickStudy Laminated Reference Guide
BarCharts Publishing, Inc.
No ratings yet
Introduction To Descriptive Statistics
No ratings yet
Introduction To Descriptive Statistics
73 pages
Descriptive Statistics 1
No ratings yet
Descriptive Statistics 1
63 pages
RMBS BPT402
No ratings yet
RMBS BPT402
103 pages
EDA W3 Obtaining-Data
No ratings yet
EDA W3 Obtaining-Data
57 pages
2 Basic Statistics Unit-II Class PPT
No ratings yet
2 Basic Statistics Unit-II Class PPT
28 pages
Measures of Location and VARIATION For 1 Variable
No ratings yet
Measures of Location and VARIATION For 1 Variable
44 pages
Unit 1 - Business Statistics & Analytics
No ratings yet
Unit 1 - Business Statistics & Analytics
25 pages
f592b059 1643454320549
No ratings yet
f592b059 1643454320549
39 pages
Biostats Lesson 3
No ratings yet
Biostats Lesson 3
6 pages
Interpreting Test Score: Online Workshop 8602 Aiou
100% (1)
Interpreting Test Score: Online Workshop 8602 Aiou
39 pages
Psychology Project
No ratings yet
Psychology Project
14 pages
Math in The Modern World Stat Lecture
No ratings yet
Math in The Modern World Stat Lecture
3 pages
Lecture 06-Describing Data Visual Information
No ratings yet
Lecture 06-Describing Data Visual Information
49 pages
St130: Basic Statistics Week 3: Lecture: School of Computing Information and Mathematical Sciences
No ratings yet
St130: Basic Statistics Week 3: Lecture: School of Computing Information and Mathematical Sciences
62 pages
Dsbda Unit 2
No ratings yet
Dsbda Unit 2
155 pages
Lesson 02 Probability and Statistics
No ratings yet
Lesson 02 Probability and Statistics
127 pages
Measures of Central Tendency
100% (15)
Measures of Central Tendency
15 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
38 pages
Chapter 3
No ratings yet
Chapter 3
17 pages
Jerome Statistics
No ratings yet
Jerome Statistics
12 pages
Stats Prac 1
No ratings yet
Stats Prac 1
10 pages
Statistical Analysis - Descriptive Stat
No ratings yet
Statistical Analysis - Descriptive Stat
6 pages
Lecture Notes 2 - Descriptive Statistics-1720598791715
No ratings yet
Lecture Notes 2 - Descriptive Statistics-1720598791715
21 pages
City Uni of New York
No ratings yet
City Uni of New York
33 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
4 pages
03 Numerical Description
No ratings yet
03 Numerical Description
52 pages
Descriptive Stat
No ratings yet
Descriptive Stat
13 pages
Data Management (1) (1) - Compressed
No ratings yet
Data Management (1) (1) - Compressed
46 pages
Hns 2321 Biostatistics Descritive Statistics
No ratings yet
Hns 2321 Biostatistics Descritive Statistics
35 pages
Stat Chapter 5-9
No ratings yet
Stat Chapter 5-9
32 pages
2 - Introduction To Statistics
No ratings yet
2 - Introduction To Statistics
97 pages
WEEK 3 - Central-Tendency-Variation-And-Shape
No ratings yet
WEEK 3 - Central-Tendency-Variation-And-Shape
39 pages
Bioepi Lesson 6. Descriptive Statistics
No ratings yet
Bioepi Lesson 6. Descriptive Statistics
38 pages
Statistics I Chapter 2: Univariate Data Analysis
No ratings yet
Statistics I Chapter 2: Univariate Data Analysis
27 pages
2nd Unit - Statistics
No ratings yet
2nd Unit - Statistics
15 pages
Chapter 2 Descriptive Statistics
No ratings yet
Chapter 2 Descriptive Statistics
12 pages
Module 10 Introduction To Data and Statistics
No ratings yet
Module 10 Introduction To Data and Statistics
63 pages
8614.educational Statitics Unit 4
No ratings yet
8614.educational Statitics Unit 4
34 pages
Stat 1101 4 7
No ratings yet
Stat 1101 4 7
18 pages
ISM Session 1-8+webinar1,2 Merged
No ratings yet
ISM Session 1-8+webinar1,2 Merged
718 pages
Notes 3 Descriptive Statistics RJMurden 2021
No ratings yet
Notes 3 Descriptive Statistics RJMurden 2021
47 pages
CH 2 Lecture Notes
No ratings yet
CH 2 Lecture Notes
12 pages
Presentation 4
No ratings yet
Presentation 4
29 pages
Click To Add Text Dr. Cemre Erciyes
No ratings yet
Click To Add Text Dr. Cemre Erciyes
69 pages
2a. Describing Variables With Numbers
No ratings yet
2a. Describing Variables With Numbers
30 pages
Statistics 3: DR Taher
No ratings yet
Statistics 3: DR Taher
38 pages
Statistical Data
No ratings yet
Statistical Data
41 pages
Ch01 Intro Stat&DataAnalysis
No ratings yet
Ch01 Intro Stat&DataAnalysis
106 pages
Module 3 4 MMW
No ratings yet
Module 3 4 MMW
6 pages
Biostatistics (Descriptive Statistics)
No ratings yet
Biostatistics (Descriptive Statistics)
30 pages
NITKclass 1
No ratings yet
NITKclass 1
50 pages
MCS Lecture 3
No ratings yet
MCS Lecture 3
57 pages
STAE Lecture Notes - LU3
No ratings yet
STAE Lecture Notes - LU3
24 pages
DDDDDD 2
No ratings yet
DDDDDD 2
5 pages
Class Test 1 Revision Notes
No ratings yet
Class Test 1 Revision Notes
10 pages
DSJ BMS Unit2
No ratings yet
DSJ BMS Unit2
18 pages
Descriptive Statistic
No ratings yet
Descriptive Statistic
37 pages
TDA1
No ratings yet
TDA1
57 pages
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Ohn Oe
No ratings yet
Ohn Oe
2 pages
STEP 7 V56 - Compatibility List
No ratings yet
STEP 7 V56 - Compatibility List
31 pages
Research Paper 12 Abm Efficient Honrados Group
No ratings yet
Research Paper 12 Abm Efficient Honrados Group
34 pages
Weekly Assessment in Science
No ratings yet
Weekly Assessment in Science
1 page
Szymanowski List of Compositions
No ratings yet
Szymanowski List of Compositions
12 pages
EEE229/EEE223/GEE202 - Problem Sheet 1
No ratings yet
EEE229/EEE223/GEE202 - Problem Sheet 1
1 page
CD Lab Exam
No ratings yet
CD Lab Exam
3 pages
Ca1 Prelim
No ratings yet
Ca1 Prelim
60 pages
Unit 3 Theories and Principles in The Use and Design of Technology Driven Learning Lessons
100% (1)
Unit 3 Theories and Principles in The Use and Design of Technology Driven Learning Lessons
49 pages
9 - Class INTSO Work Sheet - 3 - Basic Concepts of Geometry
No ratings yet
9 - Class INTSO Work Sheet - 3 - Basic Concepts of Geometry
8 pages
Advanced Structural Analysis Prof. Devdas Menon Department of Civil Engineering Indian Institute of Technology, Madras
No ratings yet
Advanced Structural Analysis Prof. Devdas Menon Department of Civil Engineering Indian Institute of Technology, Madras
32 pages
(Mycology Series 16) D.H. Howard-Pathogenic Fungi in Humans and Animals-Marcel Dekker (2003)
100% (1)
(Mycology Series 16) D.H. Howard-Pathogenic Fungi in Humans and Animals-Marcel Dekker (2003)
804 pages
CHEMISTRY Exam
No ratings yet
CHEMISTRY Exam
8 pages
Alison Vidal
No ratings yet
Alison Vidal
5 pages
MyEdBC Family Portal Instructional Manual
No ratings yet
MyEdBC Family Portal Instructional Manual
6 pages
Noting and Drafting Skills
100% (2)
Noting and Drafting Skills
33 pages
The Present Continuous
No ratings yet
The Present Continuous
4 pages
Solutions To Chapter 4 Problems: Problem 4.1
No ratings yet
Solutions To Chapter 4 Problems: Problem 4.1
59 pages
Smit Vipul Kalamkar - CV
No ratings yet
Smit Vipul Kalamkar - CV
2 pages
HR Interview Questions
No ratings yet
HR Interview Questions
8 pages
Anticipation Guide-Phonics and Word Recognition
No ratings yet
Anticipation Guide-Phonics and Word Recognition
5 pages
Chi Square Test
No ratings yet
Chi Square Test
11 pages
AHP Template SCBUK
No ratings yet
AHP Template SCBUK
24 pages
Benchmarking Sox Costs, Hours and Controls
No ratings yet
Benchmarking Sox Costs, Hours and Controls
45 pages
Master Thesis - BUILDING A RISK MODEL FOR OIL & GAS - Submitted by Himanshu Singh
No ratings yet
Master Thesis - BUILDING A RISK MODEL FOR OIL & GAS - Submitted by Himanshu Singh
56 pages
Physical Education Revision
No ratings yet
Physical Education Revision
3 pages
How To Send or Receive SMS Message Via GSM Module by at Commands
100% (1)
How To Send or Receive SMS Message Via GSM Module by at Commands
6 pages
ECommerce Virtual Assistant Course
100% (1)
ECommerce Virtual Assistant Course
18 pages
Statement of Purpose (Ashok)
No ratings yet
Statement of Purpose (Ashok)
2 pages

Introduction To Statistics

Uploaded by

Introduction To Statistics

Uploaded by

STATISTIC

Statistics in Statistics in Statistics in Statistics in Statistics in social

Its results are true

A statistic is a characteristic of a sample.

OFFICIAL, SEMI- INITIAL OR CONFIDENTIAL DIRECT OR REGULAR OR CENSUS OR

It helps to study the relationships

It facilitates the statistical treatment of the data:

Frequency Histograms Stem and Leaf Distribution Averages Variability

Central Tendency Quartile Variation

Average (Mean) Median Mode

25% 25% 25% 25%

Variance Standard Deviation Coefficient

Can eliminate some outlier problems by using the interquartile range

Eliminate high- and low-valued observations and calculate the range of

Interquartile range = 3rd quartile – 1st quartile

 Always in percentage (%)

 Shows variation relative to mean

 Is used to compare two or more sets of data measured in different units

Left-Skewed Symmetric Right-Skewed

You might also like