0% found this document useful (0 votes)
14 views36 pages

CH 01

This document discusses descriptive analysis and data visualization. It aims to develop skills in exploring and descriptively analyzing data through univariate and bivariate techniques. Additionally, it teaches using graphs and visualizations to identify patterns and relationships in data. Various types of graphs and charts are covered for descriptive analysis, including bar charts, histograms, scatter plots, box plots, pie charts and line graphs. Microsoft Excel is used for the module.

Uploaded by

Kelly Aguilar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views36 pages

CH 01

This document discusses descriptive analysis and data visualization. It aims to develop skills in exploring and descriptively analyzing data through univariate and bivariate techniques. Additionally, it teaches using graphs and visualizations to identify patterns and relationships in data. Various types of graphs and charts are covered for descriptive analysis, including bar charts, histograms, scatter plots, box plots, pie charts and line graphs. Microsoft Excel is used for the module.

Uploaded by

Kelly Aguilar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 36

Análisis descriptivo y visualización

de datos
• Objetivo: Desarrollar habilidades en la
exploración de datos y el análisis
descriptivo mediante técnicas univariadas y
bivariadas.
• Además, enseñar el uso de gráficas y
visualizaciones para identificar patrones y
relaciones en los datos.
• Total: 20 horas
Índice
• Exploración de datos y medidas de resumen:
• Análisis univariado y bivariado.
• Medidas de tendencia central, dispersión y posición.
• Tablas de frecuencia y distribuciones de frecuencia.
• Gráficas y visualizaciones para el análisis descriptivo:
• Gráficos de barras, histogramas y diagramas de
dispersión.
• Gráficos de caja y bigotes, gráficos de torta y gráficos
de línea.
• Representación visual de relaciones y patrones en los
datos.
• Nota. Para este módulo se utilizará Microsoft Excel.
Slides Prepared by
JOHN S. LOUCKS
St. Edward’s University

© 2002 South-Western /Thomson LearningTM Slide


3
Chapter 1
Data and Statistics
 Applications in Business and Economics
 Data
 Data Sources
 Descriptive Statistics
 Statistical Inference

Slide
4
Applications in
Business and Economics
 Accounting
Public accounting firms use statistical sampling
procedures when conducting audits for their clients.
 Finance
Financial advisors use a variety of statistical
information, including price-earnings ratios and
dividend yields, to guide their investment
recommendations.
 Marketing
Electronic point-of-sale scanners at retail checkout
counters are being used to collect data for a variety of
marketing research applications.

Slide
5
Applications in
Business and Economics
 Production
A variety of statistical quality control charts are used
to monitor the output of a production process.
 Economics
Economists use statistical information in making
forecasts about the future of the economy or some
aspect of it.

Slide
6
Data

 Elements, Variables, and Observations


 Scales of Measurement
 Qualitative and Quantitative Data
 Cross-Sectional and Time Series Data

Slide
7
Data and Data Sets

 Data are the facts and figures that are collected,


summarized, analyzed, and interpreted.
 The data collected in a particular study are referred to
as the data set.

Slide
8
Elements, Variables, and Observations

 The elements are the entities on which data are


collected.
 A variable is a characteristic of interest for the
elements.
 The set of measurements collected for a particular
element is called an observation.
 The total number of data values in a data set is the
number of elements multiplied by the number of
variables.

Slide
9
Data, Data Sets,
Elements, Variables, and Observations
Observation
Variables
Stock Annual Earn/
Company Exchange Sales($M) Sh.($)
Dataram AMEX 73.10 0.86
EnergySouth OTC 74.00 1.67
Keystone NYSE 365.70 0.86
LandCare NYSE 111.40 0.33
Psychemedics AMEX 17.60 0.13

Elements Data Set Datum

Slide
10
Scales of Measurement

 Scales of measurement include:


• Nominal
• Ordinal
• Interval
• Ratio
 The scale determines the amount of information
contained in the data.
 The scale indicates the data summarization and
statistical analyses that are most appropriate.

Slide
11
Scales of Measurement

 Nominal
• Data are labels or names used to identify an
attribute of the element.
• A nonnumeric label or a numeric code may be
used.

Slide
12
Scales of Measurement

 Nominal
• Example:
Students of a university are classified by the
school in which they are enrolled using a
nonnumeric label such as Business,
Humanities, Education, and so on.
Alternatively, a numeric code could be used for
the school variable (e.g. 1 denotes Business, 2
denotes Humanities, 3 denotes Education, and
so on).

Slide
13
Scales of Measurement

 Ordinal
• The data have the properties of nominal data and
the order or rank of the data is meaningful.
• A nonnumeric label or a numeric code may be
used.

Slide
14
Scales of Measurement

 Ordinal
• Example:
Students of a university are classified by their
class standing using a nonnumeric label such as
Freshman, Sophomore, Junior, or Senior.
Alternatively, a numeric code could be used for
the class standing variable (e.g. 1 denotes
Freshman, 2 denotes Sophomore, and so on).

Slide
15
Scales of Measurement

 Interval
• The data have the properties of ordinal data and
the interval between observations is expressed in
terms of a fixed unit of measure.
• Interval data are always numeric.

Slide
16
Scales of Measurement

 Interval
• Example:
Melissa has an SAT score of 1205, while Kevin
has an SAT score of 1090. Melissa scored 115
points more than Kevin.

Slide
17
Scales of Measurement

 Ratio
• The data have all the properties of interval data
and the ratio of two values is meaningful.
• Variables such as distance, height, weight, and
time use the ratio scale.
• This scale must contain a zero value that indicates
that nothing exists for the variable at the zero
point.

Slide
18
Scales of Measurement

 Ratio
• Example:
Melissa’s college record shows 36 credit hours
earned, while Kevin’s record shows 72 credit
hours earned. Kevin has twice as many credit
hours earned as Melissa.

Slide
19
Qualitative and Quantitative Data

 Data can be further classified as being qualitative or


quantitative.
 The statistical analysis that is appropriate depends on
whether the data for the variable are qualitative or
quantitative.
 In general, there are more alternatives for statistical
analysis when the data are quantitative.

Slide
20
Qualitative Data

 Qualitative data are labels or names used to identify


an attribute of each element.
 Qualitative data use either the nominal or ordinal
scale of measurement.
 Qualitative data can be either numeric or
nonnumeric.
 The statistical analysis for qualitative data are rather
limited.

Slide
21
Quantitative Data

 Quantitative data indicate either how many or how


much.
• Quantitative data that measure how many are
discrete.
• Quantitative data that measure how much are
continuous because there is no separation between
the possible values for the data..
 Quantitative data are always numeric.
 Ordinary arithmetic operations are meaningful only
with quantitative data.

Slide
22
Cross-Sectional and Time Series Data

 Cross-sectional data are collected at the same or


approximately the same point in time.
• Example: data detailing the number of building
permits issued in June 2000 in each of the counties
of Texas
 Time series data are collected over several time
periods.
• Example: data detailing the number of building
permits issued in Travis County, Texas in each of
the last 36 months

Slide
23
Data Sources

 Existing Sources
• Data needed for a particular application might
already exist within a firm. Detailed information
is often kept on customers, suppliers, and
employees for example.
• Substantial amounts of business and economic
data are available from organizations that
specialize in collecting and maintaining data.

Slide
24
Data Sources

 Existing Sources
• Government agencies are another important
source of data.
• Data are also available from a variety of industry
associations and special-interest organizations.

Slide
25
Data Sources

 Internet
• The Internet has become an important source of
data.
• Most government agencies, like the Bureau of the
Census (www.census.gov), make their data
available through a web site.
• More and more companies are creating web sites
and providing public access to them.
• A number of companies now specialize in making
information available over the Internet.

Slide
26
Data Sources

 Statistical Studies
• Statistical studies can be classified as either
experimental or observational.
• In experimental studies the variables of interest
are first identified. Then one or more factors are
controlled so that data can be obtained about how
the factors influence the variables.
• In observational (nonexperimental) studies no
attempt is made to control or influence the
variables of interest.
• A survey is perhaps the most common type of
observational study.

Slide
27
Data Acquisition Considerations

 Time Requirement
• Searching for information can be time consuming.
• Information might no longer be useful by the time
it is available.
 Cost of Acquisition
• Organizations often charge for information even
when it is not their primary business activity.
 Data Errors
• Using any data that happens to be available or
that were acquired with little care can lead to poor
and misleading information.

Slide
28
Descriptive Statistics

 Descriptive statistics are the tabular, graphical, and


numerical methods used to summarize data.

Slide
29
Example: Hudson Auto Repair

The manager of Hudson Auto would like to


have
a better understanding of the cost of parts used in the
engine tune-ups performed in the shop. She examines
50 customer invoices for tune-ups. The costs of parts,
rounded to the nearest dollar, are listed below.
91 78 93 57 75 52 99 80 97 62
71 69 72 89 66 75 79 75 72 76
104 74 62 68 97 105 77 65 80 109
85 97 88 68 83 68 71 69 67 74
62 82 98 101 79 105 79 69 62 73

Slide
30
Example: Hudson Auto Repair

 Tabular Summary (Frequencies and Percent


Frequencies)
Parts Percent
Cost ($) Frequency Frequency
50-59 2 4
60-69 13 26
70-79 16 32
80-89 7 14
90-99 7 14
100-109 5 10
Total 50 100

Slide
31
Example: Hudson Auto Repair

 Graphical Summary (Histogram)


18
16
14
Frequency

12
10
8
6
4
2
Parts
50 60 70 80 90 100 110 Cost ($)

Slide
32
Example: Hudson Auto Repair

 Numerical Descriptive Statistics


• The most common numerical descriptive statistic
is the average (or mean).
• Hudson’s average cost of parts, based on the 50
tune-ups studied, is $79 (found by summing the
50 cost values and then dividing by 50).

Slide
33
Statistical Inference

 Statistical inference is the process of using data


obtained from a small group of elements (the sample)
to make estimates and test hypotheses about the
characteristics of a larger group of elements (the
population).

Slide
34
Example: Hudson Auto Repair

 Process of Statistical Inference

1. Population
consists of all 2. A sample of 50
tune-ups. Average engine tune-ups
cost of parts is is examined.
unknown.

4. The value of the 3. The sample data


sample average is used provide a sample
to make an estimate of average cost of
the population average. $79 per tune-up.

Slide
35
End of Chapter 1

Slide
36

You might also like