CH 01
CH 01
de datos
• Objetivo: Desarrollar habilidades en la
exploración de datos y el análisis
descriptivo mediante técnicas univariadas y
bivariadas.
• Además, enseñar el uso de gráficas y
visualizaciones para identificar patrones y
relaciones en los datos.
• Total: 20 horas
Índice
• Exploración de datos y medidas de resumen:
• Análisis univariado y bivariado.
• Medidas de tendencia central, dispersión y posición.
• Tablas de frecuencia y distribuciones de frecuencia.
• Gráficas y visualizaciones para el análisis descriptivo:
• Gráficos de barras, histogramas y diagramas de
dispersión.
• Gráficos de caja y bigotes, gráficos de torta y gráficos
de línea.
• Representación visual de relaciones y patrones en los
datos.
• Nota. Para este módulo se utilizará Microsoft Excel.
Slides Prepared by
JOHN S. LOUCKS
St. Edward’s University
Slide
4
Applications in
Business and Economics
Accounting
Public accounting firms use statistical sampling
procedures when conducting audits for their clients.
Finance
Financial advisors use a variety of statistical
information, including price-earnings ratios and
dividend yields, to guide their investment
recommendations.
Marketing
Electronic point-of-sale scanners at retail checkout
counters are being used to collect data for a variety of
marketing research applications.
Slide
5
Applications in
Business and Economics
Production
A variety of statistical quality control charts are used
to monitor the output of a production process.
Economics
Economists use statistical information in making
forecasts about the future of the economy or some
aspect of it.
Slide
6
Data
Slide
7
Data and Data Sets
Slide
8
Elements, Variables, and Observations
Slide
9
Data, Data Sets,
Elements, Variables, and Observations
Observation
Variables
Stock Annual Earn/
Company Exchange Sales($M) Sh.($)
Dataram AMEX 73.10 0.86
EnergySouth OTC 74.00 1.67
Keystone NYSE 365.70 0.86
LandCare NYSE 111.40 0.33
Psychemedics AMEX 17.60 0.13
Slide
10
Scales of Measurement
Slide
11
Scales of Measurement
Nominal
• Data are labels or names used to identify an
attribute of the element.
• A nonnumeric label or a numeric code may be
used.
Slide
12
Scales of Measurement
Nominal
• Example:
Students of a university are classified by the
school in which they are enrolled using a
nonnumeric label such as Business,
Humanities, Education, and so on.
Alternatively, a numeric code could be used for
the school variable (e.g. 1 denotes Business, 2
denotes Humanities, 3 denotes Education, and
so on).
Slide
13
Scales of Measurement
Ordinal
• The data have the properties of nominal data and
the order or rank of the data is meaningful.
• A nonnumeric label or a numeric code may be
used.
Slide
14
Scales of Measurement
Ordinal
• Example:
Students of a university are classified by their
class standing using a nonnumeric label such as
Freshman, Sophomore, Junior, or Senior.
Alternatively, a numeric code could be used for
the class standing variable (e.g. 1 denotes
Freshman, 2 denotes Sophomore, and so on).
Slide
15
Scales of Measurement
Interval
• The data have the properties of ordinal data and
the interval between observations is expressed in
terms of a fixed unit of measure.
• Interval data are always numeric.
Slide
16
Scales of Measurement
Interval
• Example:
Melissa has an SAT score of 1205, while Kevin
has an SAT score of 1090. Melissa scored 115
points more than Kevin.
Slide
17
Scales of Measurement
Ratio
• The data have all the properties of interval data
and the ratio of two values is meaningful.
• Variables such as distance, height, weight, and
time use the ratio scale.
• This scale must contain a zero value that indicates
that nothing exists for the variable at the zero
point.
Slide
18
Scales of Measurement
Ratio
• Example:
Melissa’s college record shows 36 credit hours
earned, while Kevin’s record shows 72 credit
hours earned. Kevin has twice as many credit
hours earned as Melissa.
Slide
19
Qualitative and Quantitative Data
Slide
20
Qualitative Data
Slide
21
Quantitative Data
Slide
22
Cross-Sectional and Time Series Data
Slide
23
Data Sources
Existing Sources
• Data needed for a particular application might
already exist within a firm. Detailed information
is often kept on customers, suppliers, and
employees for example.
• Substantial amounts of business and economic
data are available from organizations that
specialize in collecting and maintaining data.
Slide
24
Data Sources
Existing Sources
• Government agencies are another important
source of data.
• Data are also available from a variety of industry
associations and special-interest organizations.
Slide
25
Data Sources
Internet
• The Internet has become an important source of
data.
• Most government agencies, like the Bureau of the
Census (www.census.gov), make their data
available through a web site.
• More and more companies are creating web sites
and providing public access to them.
• A number of companies now specialize in making
information available over the Internet.
Slide
26
Data Sources
Statistical Studies
• Statistical studies can be classified as either
experimental or observational.
• In experimental studies the variables of interest
are first identified. Then one or more factors are
controlled so that data can be obtained about how
the factors influence the variables.
• In observational (nonexperimental) studies no
attempt is made to control or influence the
variables of interest.
• A survey is perhaps the most common type of
observational study.
Slide
27
Data Acquisition Considerations
Time Requirement
• Searching for information can be time consuming.
• Information might no longer be useful by the time
it is available.
Cost of Acquisition
• Organizations often charge for information even
when it is not their primary business activity.
Data Errors
• Using any data that happens to be available or
that were acquired with little care can lead to poor
and misleading information.
Slide
28
Descriptive Statistics
Slide
29
Example: Hudson Auto Repair
Slide
30
Example: Hudson Auto Repair
Slide
31
Example: Hudson Auto Repair
12
10
8
6
4
2
Parts
50 60 70 80 90 100 110 Cost ($)
Slide
32
Example: Hudson Auto Repair
Slide
33
Statistical Inference
Slide
34
Example: Hudson Auto Repair
1. Population
consists of all 2. A sample of 50
tune-ups. Average engine tune-ups
cost of parts is is examined.
unknown.
Slide
35
End of Chapter 1
Slide
36