0% found this document useful (0 votes)
1 views

Basic Ideas of Data Management

Uploaded by

reylene12814
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
1 views

Basic Ideas of Data Management

Uploaded by

reylene12814
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 32

GED 102 - MATHEMATICS IN THE MODERN WORLD

AIZELLE KEI D. CAMILON


DATA MANAGEMENT or
STATISTICS?

No part of this material may be reproduced, stored in a retrieval system or transmitted in any form or by any means – electronic, mechanical,
photocopying, recording or otherwise, without permission in writing from the author or lecturer.
• The science of collecting, organizing, presenting,
analyzing and interpreting numerical data.

• Sampling Methods – covers how data will be


collected.
Descriptive Statistics
• Descriptive statistics are used to describe the basic
features of the data in a study. They provide simple
summaries about the sample and the measures.

• The main focus of descriptive statistics is to summarize


and present data.
Inferential Statistics
• Inferential statistics also called Statistical Inference or
Inductive Statistics, is concerned with analyzing the organized
data leading to prediction or inferences.

• It implies that before carrying out an inference, appropriate


and correct descriptive measures or methods are employed
to bring out good results.
Variable
• The characteristic that is being studied.
• A variable is a characteristic that is observable or measurable
in every unit of the universe.
• It varies across individuals or objects.
• It includes age, race, gender, intelligence, personality type,
attitudes, political or religious affiliation, height, weight,
marital status, eye color, etc.
Qualitative Variables
• Qualitative variables express a categorical attribute, such
as sex (male or female),
religion, marital status, region of residence, highest
educational attainment.

• Qualitative variables do not strictly take on numeric values


(although we can have numeric codes for them, e.g., for
sex variable, 1 and 2 may refer to male, and female,
respectively).
Qualitative Variables
• Qualitative data answer questions “what kind.” Sometimes,
there is a sense of ordering in qualitative data, e.g., income
data grouped into high, middle and low-income status.
Data on sex or religion do not have the sense of ordering,
as there is no such thing as a weaker or stronger sex, and
a better or worse religion.

• Qualitative variables are sometimes referred to as


categorical variables.
Quantitative Variables
• Quantitative variables(otherwise called numerical) data,
whose sizes are meaningful, answer questions such as
“how much” or “how many”.

• Quantitative variables have actual units of measure.


Examples of quantitative variables include the height,
weight, number of registered cars, household size, and
total household expenditures/income of survey
respondents.
Quantitative data may be further classified into:
a. Discrete data are those data that can be counted, e.g.,
the number of days for cellphones to fail, the ages of
survey respondents measured to the nearest year, and
the number of patients in a hospital. These data assume
only (a finite or infinitely) countable number of values.

b. Continuous data are those that can be measured, e.g.


the exact height of a survey respondent and the exact
volume of some liquid substance. The possible values
are uncountably infinite.
Dependent and Independent Variable

a. Dependent Variable – the variable whose value is being


predicted.
b. Independent Variable – the predictor.

Examples:
• To predict the amount of sunlight on the growth of a
certain plant.
• To evaluate the effect of using computer to the
performance of students.
Data
• A collection of object on one or more variables.

• Factual information such as measurements or statistics


used as a basis for reasoning, discussion, or calculation.

• Information in numerical form that can be digitally


transmitted or processed (Merriam-Webster Dictionary)

• The raw material which the statistician works. It can be


found through surveys, experiments, numerical records,
and other modes of research.
Data Set
Data on the Profile of College Students in terms of Residence, Year level,
College, Religion, Family Income and Student Category
Methods of Data Collection
Variables were observed or measured using any of the three
methods of data collection:
1. Objective - uses any or combination of the five senses
(sense of sight, touch, hearing, taste and smell) to
measure the variable

2. Subjective - obtains data by getting responses through a


questionnaire.

3. Use of existing records.


• The objective and subjective methods obtained the data
directly from the source. The resulting data from Objective
and Subjective methods of data collection is referred to as
primary data.

• On the other hand, secondary data are obtained through


the use of existing records or data collected by other
entities for certain purposes.

• For example, when we use data gathered by the Philippine


Statistics Authority, we are using secondary data and the
method we employ to get the data is the use of existing
records. Other data sources include administrative records,
news articles, internet, and the like.
Univariate Analysis of Data
• Examination across cases of one variable at a time.

• Three major characteristics of a single variable that you tend to


look at:
1) The distribution,

2) The measures of central tendency and,

3) The measures of dispersion.


The Distribution
• Summary of the frequency of individual values or ranges of
values for a variable.

• Various techniques commonly used to describe the distribution:


1) Textual

2) Tabular

3) Graphical
Common Ways to Describe a Single Variable Using
Tabular Form

Frequency distribution

Cumulative Frequency distribution

Relative frequency distribution


Frequency Distribution
Cumulative Frequency Distribution
Relative Frequency Distribution
Graphical Methods Used for Interval Data
1. Histograms- a graph in which the classes are marked on
the horizontal axis (x-axis) and the frequencies on the
vertical axis (y-axis). No gap exist between columns because
the data is continuous. Continuous data can assume any
value in an interval.

2. Bar graph- graph similar to histogram, there are spaces


between bars.
Graphical Methods Used for Interval Data

3. Pie charts offer a variety of ways to display categorical data.


Data organized in discrete groups.

4. Frequency polygon – a line graph that displays data points


which are connected by lines.

5. Cumulative Frequency polygon (Ogives). A graph that displays


the cumulative frequencies for the classes in a frequency
distribution.
Scale of Measurement of Data

1. Nominal
2. Ordinal
3. Interval
4. Ratio
Scale of Measurement of Data
1. Nominal level of measurement arises when we have variables that
are categorical and nonnumeric or where the numbers have no sense
of ordering.
▪ As an example, consider the numbers on the uniforms of basketball
players. Is the player wearing a number 7 a worse player than the
player wearing number 10? Maybe, or maybe not, but the number
on the uniform does not have anything to do with their performance.
The numbers on the uniform merely help identify the basketball
player.
▪ Other examples of the variables measured at the nominal level
include sex, marital status, religious affiliation.
Scale of Measurement of Data
2. Ordinal level also deals with categorical variables like the nominal
level, but in this level ordering is important, that is the values of the
variable could be ranked.

▪ Examples of the ordinal scale include socio economic status (A to E,


where A is wealthy, E is poor), difficulty of questions in an exam
(easy, medium difficult), rank in a contest (first place, second place,
etc.), and perceptions in Likert scales.
Scale of Measurement of Data
3. Interval level does not only include “greater than” or “less than”
relationships but also has a limit of measurement that permits us how to
describe how much more or less one subject possesses than another.

It has no true zero which means zero is not really nothing.

Examples:
▪ Temperature (0℃)
▪ Scores on a test as a measure of knowledge (a score of 5 is better
than a 0 score)
Scale of Measurement of Data
4. Ratio Level also tells us that one unit has so many times as much of the property as
does another unit. The ratio level possesses a meaningful (unique and non-arbitrary)
absolute, fixed zero point and allows all arithmetic operations. The existence of the zero
point is the only difference between ratio and interval level of measurement.

▪ Examples of the ratio scale include mass, heights, weights, energy and electric
charge. With mass as an example, the difference between 120 grams and 135 grams
is 15 grams, and this is the same difference between 380 grams and 395 grams. The
level at any given point is constant, and a measurement of 0 reflects a complete lack
of mass.

▪ Amount of money is also at the ratio level. We can say that 2000 pesos is twice more
than 1,000 pesos. In addition, money has a true zero point: if you have zero money,
this implies the absence of money.
No part of this material may be reproduced, stored in a retrieval system or transmitted in any form or by any means – electronic, mechanical,
photocopying, recording or otherwise, without permission in writing from the author or lecturer.

You might also like