0% found this document useful (0 votes)

35 views26 pages

Descriptive Statistics

Uploaded by

Rameir Angelo Catamora

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views26 pages

Descriptive Statistics

Uploaded by

Rameir Angelo Catamora

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Descriptive

Statistics
MODULE 2
Data – descriptive facts and figures collected, analyzed, and summarized for presentation and
interpretation.

Data mining – analytical techniques to better understand patterns and relationships

◦ data collection,
◦ cleaning,
◦ exploratory data analysis (ex. Data query),
◦ identifying variables for further analysis,
◦ model building (regression, decision trees, etc. ),
◦ pattern discovery (predictions),
◦ presentation and integration.
4 Vs of Big Data
Volume – how do we store data?

Velocity – how do we keep real-time/up-to-date data

Variety – how to analyze different data formats? (text, audio, video)

Veracity – how much uncertainty is in the data?

Variables vs. Population vs.
Observation Sample
Quantitative vs. Categorical Data – Frequency Tables, Measures of
Location (arithmetic can be performed).
Cross-sectional vs. time-series data – same point in time vs. several time
periods.
Modifying
Data
MODULE 2
In your respective fields, how can we
utilize this function in order to make
Sorting and Filtering data analysis manageable?
Conditional Formatting

Quick Analysis tool

How can we utilize/apply
this function in your
respective fields? What
data query can you come
up with?
Creating Distributions from Data
Frequency Distribution – summary of
data that shows the number
(frequency) of observations in each of
several nonoverlapping classes (bins).

Percentage frequency distribution –

estimating the probability distribution
that characterizes its variability.
Creating Distributions from Data
SPSS – Frequency Tables

Data transformation – Variable View (bottom left) set the Values and Measure
Creating Distributions from Data
SPSS – Frequency Tables
Creating Distributions from Data
Frequency Table for a Quantitative/Numerical Data What numerical data can be
grouped into bins/categories?
◦ 1. Determine the number of nonoverlapping bins.
◦ 2. Determine the width of each bin. (largest – smallest data value / number of bins)
◦ 3. Determine the bin limits (upper and lower limit)

=COUNTIFS >=,<= or Histogram may be used

Measures of Central Tendency
Mean – Average
◦ Arithmetic Mean – simple average (additive data, linear relationship)
◦ Geometric Mean – growth rates/ratios or other variables that compounds over time (multiplicative data)
◦ Mean Rate of Change over several successive periods.
◦ Compounded Annual Growth Rate (CAGR) = Rate of Return = (Ending value/Beginning value)^(1/n)-1
Example: Annual Growth Rates for year 1,2,3 are as follow: 15%; -10%; 21%.
EXCEL - =RRI (nper,present,future)^(1/n-1)-1

Median – middle value when arranged in ascending order.

Excel: = MEDIAN (data range)
Mode – value that occurs most frequently. What is the importance of understanding measures of
Excel: = MODE (data range) central tendency/location in the context of
management?
Measures of Variability/Dispersion
Range – Max and Min values (difference); sensitive to outliers or extreme values.

Excel: =MAX(data range)-MIN(data range)

Variance- (S²) = variability based on the deviation from the mean =∑ (xi –x bar)^2 / n-1(unbiased
estimate of the population variance)

Excel: VAR.S(data range)

Standard Deviation = √S² (square root of the variance) ; measured in the same units as the
original data.

Excel: STDEV.S (data range) What is the importance of understanding variance

and deviations in the management context?
Measures of Variability/Dispersion
Z-Scores – measures the relative location of a
value in the data set. Also referred to as
“standardized value” . Z= (X−μ)/ σ
*Compute for the Mean and SD first.
EXCEL: =STANDARDIZE (data point, Mean, SD)

Z-Scores as used in management:

- Competitiveness (example: price
competitiveness)
- Consumer Behavior (seasonality, customer
satisfaction)
- Demand and Capacity (optimal z-scores for
maximizing revenue)
Empirical Rule

How do we interpret the z-values of ticket

prices with reference to the standard
deviation?
How do we interpret the Box plot presented here? What insights can we generate
from the visualization?
Other Visualizations
Data-Ink Ratio
◦ Maximize the use of ink to represent and communicate
actual data, minimizing non-data ink.
◦ = Ink Used to Display Data/Total Ink Used in the Graphic

Maximizing Data-Ink Ratio:

1. Remove non-essential Ink.
2. Simplify Label.
3. Minimize Chartjunk (decorative elements).
4. Maximize Data Density (e.g. use of small multiples
thru shared axes)
Other Visualizations
Ask yourself these questions:

•Who is my audience?

•What questions do they have?

•What answers am I finding for them?

•What am I trying to say?

•What other questions will my visualization inspire or

what conversations may result?
Other Visualizations
Scatter Chart – relationship between two quantitative variables.
- Check if trend line fits into
the data.
- R2 can be used in
assessing how well the
trendline fits the data.
- Value of .30 up explains
that 30% of the variability
observed in the target
variable is explained by the
regression model.
Other Visualizations
Line Chart
◦ Useful for time series data (e.g. sales performance over the past 12months).
◦ Can be used for comparative and trend analysis, and for numerical/quantitative data Y axis, and
sequential data.

Bar Chart – provides graphical summary of categorical data.

Bubble Chart – used for three variable visualization in a single plot. Each bubble represents
magnitude or size.
Measures of Shape
Normal Distribution

Detection of outliers.

Making probabilistic statements about population parameters.

It simplifies interpretation and makes it easier to draw meaningful conclusions from statistical
analyses.
Measures of Shape
Skewness – measure of symmetry or more precisely, the lack of symmetry. A dataset is symmetric if it
looks the same to the left and to the right of the center point.
Pearson’s correlation of Skewness = Mean-Median/Standard Deviation
- Between -0.5 and +0.5 (nearly symmetrical)
- Between -1 and -0.5 (negative skewed); +1 and 0.5 (positive skewed)
- lower than -1 and higher than 1 (extremely skewed)
-
Kurtosis – measure of whether the data are heavy-tailed or light-tailed relative to a normal distribution.
- Mesokurtic; Leptokurtic; Platykurtic
Measures of Associations
Nominal/Categorical Data
Crosstabulations/Contingency Tables
•The chi-square statistic is a measure of the
difference between the observed and
expected frequencies. A larger chi-square
value indicates a greater deviation from
expected values.
•The null hypothesis assumes independence,
and a significant chi-square test suggests that
the variables are not independent.
Measures of Associations
Continuous/numerical variables

Covariance – descriptive measure of the linear association of continuous variables.

- magnitude of the covariance is difficult to interpret. If value is >0 = they are positively related; <0
= they are negatively related; =0 not related.

=COVARIANCE.S(x datarange; y datarange)

Correlation Coefficient – relationship between 2 continuous variables, but units of measurement

does not affect the calculation. Magnitude is measured. Value is between -1 and +1.

= CORREL(xdatarange;ydatarange)
Summary Tables

It0089 Finalreviewer
100% (1)
It0089 Finalreviewer
143 pages
Utilizes The Standards (Criteria or Checklist) in Evaluating Research Paper Peer/Group/Expert Evaluation)
100% (1)
Utilizes The Standards (Criteria or Checklist) in Evaluating Research Paper Peer/Group/Expert Evaluation)
5 pages
A - 11 - Artificial Intelligence in Marketing Strategies
No ratings yet
A - 11 - Artificial Intelligence in Marketing Strategies
17 pages
MODULE 8-LESSON 3-Writing A Survey Report
No ratings yet
MODULE 8-LESSON 3-Writing A Survey Report
27 pages
1.9 Data and data analysis
No ratings yet
1.9 Data and data analysis
31 pages
Predicting results of social science experiments using large language models
No ratings yet
Predicting results of social science experiments using large language models
18 pages
Business Statistics and Computing Complete Ppts (1)
No ratings yet
Business Statistics and Computing Complete Ppts (1)
213 pages
Muhammad Syuaib Samsir, Zailawati Khalid, Norazah Attan, Goh Kai Chen & Haryati Shafii
No ratings yet
Muhammad Syuaib Samsir, Zailawati Khalid, Norazah Attan, Goh Kai Chen & Haryati Shafii
18 pages
Sample Questions Asked Thesis Defense
100% (2)
Sample Questions Asked Thesis Defense
8 pages
Lecture 5 (2)
No ratings yet
Lecture 5 (2)
33 pages
3RD QUARTER STATISTICS AND PROBABILITY (1)
No ratings yet
3RD QUARTER STATISTICS AND PROBABILITY (1)
7 pages
Get An Introduction to Scientific Research Methods in Geography and Environmental Studies 2nd Edition Daniel R. Montello free all chapters
100% (3)
Get An Introduction to Scientific Research Methods in Geography and Environmental Studies 2nd Edition Daniel R. Montello free all chapters
60 pages
DOM503 Session 1
No ratings yet
DOM503 Session 1
19 pages
DATA ANALYSIS
No ratings yet
DATA ANALYSIS
6 pages
Notes Stats
No ratings yet
Notes Stats
21 pages
Data Analysis
No ratings yet
Data Analysis
30 pages
Stat Quick Overview
No ratings yet
Stat Quick Overview
35 pages
Descriptive Statistics Analysis Part 1
No ratings yet
Descriptive Statistics Analysis Part 1
42 pages
Unit 1 - Slides
No ratings yet
Unit 1 - Slides
71 pages
Data Analytics Summary
No ratings yet
Data Analytics Summary
80 pages
Data Analytics Summary
No ratings yet
Data Analytics Summary
89 pages
RM-EBBA-class-8-CH0-11-Quatitative-analysis
No ratings yet
RM-EBBA-class-8-CH0-11-Quatitative-analysis
37 pages
Descriptive Statistics (1)
No ratings yet
Descriptive Statistics (1)
63 pages
Week 8 Quantitative Data Analysis - Descriptive Statistics
No ratings yet
Week 8 Quantitative Data Analysis - Descriptive Statistics
59 pages
g5 Quantitative Method
No ratings yet
g5 Quantitative Method
34 pages
From Data Management to Actionable Findings
No ratings yet
From Data Management to Actionable Findings
2 pages
Elective Finals 3A
No ratings yet
Elective Finals 3A
2 pages
ISOM Cheat Sheet 1
No ratings yet
ISOM Cheat Sheet 1
6 pages
Biostatistics - i
No ratings yet
Biostatistics - i
46 pages
Getting Started With Evaluating Impact: Further Reading
No ratings yet
Getting Started With Evaluating Impact: Further Reading
2 pages
chapter2-statistical analysis
No ratings yet
chapter2-statistical analysis
86 pages
Basic Statistics
No ratings yet
Basic Statistics
90 pages
MC Stat
No ratings yet
MC Stat
101 pages
Research Methodology: Result and Analysis (Part 1)
No ratings yet
Research Methodology: Result and Analysis (Part 1)
65 pages
Iba Unit - Ii
No ratings yet
Iba Unit - Ii
31 pages
101 Nonscientific Methods PDF
No ratings yet
101 Nonscientific Methods PDF
2 pages
Lecture Week 2 Statistics
No ratings yet
Lecture Week 2 Statistics
57 pages
02Data (2)
No ratings yet
02Data (2)
36 pages
02Data Edited v2
No ratings yet
02Data Edited v2
43 pages
ISDS 361A - Cheat Sheet Exam 1.pdf
No ratings yet
ISDS 361A - Cheat Sheet Exam 1.pdf
2 pages
Data Analysis Topics Discussed Getting Data Ready For Analysis 1) - Editing Data (Definition)
No ratings yet
Data Analysis Topics Discussed Getting Data Ready For Analysis 1) - Editing Data (Definition)
8 pages
Chapter 4 SAMPLE
No ratings yet
Chapter 4 SAMPLE
3 pages
Descriptive Statistics and Exploratory Data Analysis
No ratings yet
Descriptive Statistics and Exploratory Data Analysis
36 pages
DA Major Notes
No ratings yet
DA Major Notes
46 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
24 pages
Business Research Chapter 4
No ratings yet
Business Research Chapter 4
38 pages
Chapter 1
No ratings yet
Chapter 1
20 pages
Study On Purchasing Behaviour of Males & Females
50% (2)
Study On Purchasing Behaviour of Males & Females
8 pages
7u7 PDF
No ratings yet
7u7 PDF
31 pages
Topic 8 Data Processing and Analysis PDF
No ratings yet
Topic 8 Data Processing and Analysis PDF
157 pages
Estadístic A Descriptiv A: Dr. Lázaro Bustio Martínez Otoño 2023
No ratings yet
Estadístic A Descriptiv A: Dr. Lázaro Bustio Martínez Otoño 2023
42 pages
Transportation Data Mining: Chapter 2. Getting To Know Your Data
No ratings yet
Transportation Data Mining: Chapter 2. Getting To Know Your Data
77 pages
Article Review 1 Eng
No ratings yet
Article Review 1 Eng
30 pages
Quantitative Data Analysis
No ratings yet
Quantitative Data Analysis
22 pages
Biostats Lesson 3
No ratings yet
Biostats Lesson 3
6 pages
Educational Assessment Mcqs For PSC Headmaster
67% (133)
Educational Assessment Mcqs For PSC Headmaster
24 pages
ge8 statistics
No ratings yet
ge8 statistics
2 pages
Effects of Digital Game-Based STEM Education On Students' Learning Achievement: A Meta-Analysis
No ratings yet
Effects of Digital Game-Based STEM Education On Students' Learning Achievement: A Meta-Analysis
13 pages
Identification of Problem in Action
No ratings yet
Identification of Problem in Action
11 pages
SUF ESG Score Methodology 2022-06
No ratings yet
SUF ESG Score Methodology 2022-06
15 pages
Research Methodology Notes: Section:A
No ratings yet
Research Methodology Notes: Section:A
25 pages
02 Data
No ratings yet
02 Data
64 pages
Quantitative AnalysisJD
No ratings yet
Quantitative AnalysisJD
64 pages
Chapter 2 - Understand Data
No ratings yet
Chapter 2 - Understand Data
63 pages
Day 01-Basic Statistics
No ratings yet
Day 01-Basic Statistics
36 pages
Six Sigma: Statistics: By: - Hakeem-Ur-Rehman
No ratings yet
Six Sigma: Statistics: By: - Hakeem-Ur-Rehman
44 pages
Problem Solving and Formulation
No ratings yet
Problem Solving and Formulation
10 pages
Heidegger and The Hermeneutic Turn of Philosophy
No ratings yet
Heidegger and The Hermeneutic Turn of Philosophy
16 pages
How Much Data Does Google Handle?
No ratings yet
How Much Data Does Google Handle?
132 pages
What Is Psychological Research
No ratings yet
What Is Psychological Research
3 pages
Unit .......
No ratings yet
Unit .......
45 pages
ES031 M1 DataCollection&Presentation
No ratings yet
ES031 M1 DataCollection&Presentation
64 pages
Introduction To Descriptive Statistics I: Sanju Rusara Seneviratne Mbpss
No ratings yet
Introduction To Descriptive Statistics I: Sanju Rusara Seneviratne Mbpss
35 pages
Concepts and Techniques: - Chapter 2
No ratings yet
Concepts and Techniques: - Chapter 2
29 pages
Algebra 1 Unit 6 Describing Data Notes
No ratings yet
Algebra 1 Unit 6 Describing Data Notes
13 pages
Calculation of Measurement Uncertainty in Environnemental Laboratories PDF
No ratings yet
Calculation of Measurement Uncertainty in Environnemental Laboratories PDF
55 pages
Business Statistics I BBA 1303: Muktasha Deena Chowdhury Assistant Professor, Statistics, AUB
100% (1)
Business Statistics I BBA 1303: Muktasha Deena Chowdhury Assistant Professor, Statistics, AUB
54 pages
MCA TIP Science Fair Rubric Quarter 1 - MS Revised 2
No ratings yet
MCA TIP Science Fair Rubric Quarter 1 - MS Revised 2
7 pages
Risk Analysis
No ratings yet
Risk Analysis
9 pages
Systematic Reviews
No ratings yet
Systematic Reviews
21 pages
Class Test 1 Revision Notes
No ratings yet
Class Test 1 Revision Notes
10 pages
Lesson 10 Simple Test of Hypothesis
No ratings yet
Lesson 10 Simple Test of Hypothesis
15 pages
Business Analytics (MIS171) Summary Notes
No ratings yet
Business Analytics (MIS171) Summary Notes
6 pages
WK 1b Biostat
No ratings yet
WK 1b Biostat
38 pages
f592b059 1643454320549
No ratings yet
f592b059 1643454320549
39 pages
It0089 Finalreviewer
No ratings yet
It0089 Finalreviewer
143 pages
Bustat Reviewer
No ratings yet
Bustat Reviewer
6 pages
Chap 3 Self Confidence 123
No ratings yet
Chap 3 Self Confidence 123
8 pages
Illuminating Data: A hands on guide to data visualization in R
From Everand
Illuminating Data: A hands on guide to data visualization in R
Eman Ahmad
No ratings yet
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet

Descriptive Statistics

Uploaded by

Descriptive Statistics

Uploaded by

Descriptive

Data mining – analytical techniques to better understand patterns and relationships

Velocity – how do we keep real-time/up-to-date data

Variety – how to analyze different data formats? (text, audio, video)

Veracity – how much uncertainty is in the data?

Quick Analysis tool

Percentage frequency distribution –

=COUNTIFS >=,<= or Histogram may be used

Median – middle value when arranged in ascending order.

Excel: =MAX(data range)-MIN(data range)

Excel: VAR.S(data range)

Excel: STDEV.S (data range) What is the importance of understanding variance

Z-Scores as used in management:

How do we interpret the z-values of ticket

Maximizing Data-Ink Ratio:

•What questions do they have?

•What answers am I finding for them?

•What am I trying to say?

•What other questions will my visualization inspire or

Bar Chart – provides graphical summary of categorical data.

Making probabilistic statements about population parameters.

Covariance – descriptive measure of the linear association of continuous variables.

=COVARIANCE.S(x datarange; y datarange)

Correlation Coefficient – relationship between 2 continuous variables, but units of measurement

You might also like