0% found this document useful (0 votes)
8 views

DS_UNIT 3

Chapter 3 of the Grade IX Data Science curriculum focuses on data visualization, emphasizing its importance in simplifying complex data and uncovering insights through graphical representations. It covers various visualization techniques, including histograms, bar graphs, and dot plots, while also discussing single and multivariable plots. The chapter includes practical applications, exercises, and worksheets to reinforce understanding of the concepts presented.

Uploaded by

Diya Koul
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views

DS_UNIT 3

Chapter 3 of the Grade IX Data Science curriculum focuses on data visualization, emphasizing its importance in simplifying complex data and uncovering insights through graphical representations. It covers various visualization techniques, including histograms, bar graphs, and dot plots, while also discussing single and multivariable plots. The chapter includes practical applications, exercises, and worksheets to reinforce understanding of the concepts presented.

Uploaded by

Diya Koul
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 37

Grade IX

Data
Science

Version 1.0
Chapter 3:

Data Visualizations
LEARNING OUTCOME
• The importance of visualization
• Plotting data
• Histograms
• Use of shapes
• Use of single and multivariable
plots
Data Visualization

Data visualization is the


mechanism of representing raw
data in the form of graphical
representations that allow users
to explore the data and uncover
quick insights.
Importance of Data Visualization
• With so much information around us, it
is challenging to view the data and
derive insights from it.

• Representing data through


visualizations like graphs, charts, maps,
etc., gives us a visual context of the
data.

• It also makes complex data simple and


enables the human mind to understand
its significance.
Visualizations allow us to recognize trends, patterns and
outliers from seemingly meaningless records of data.
Data visualization techniques use visual data in a universal,
fast, and powerful way to communicate information. This
approach enables viewers, mostly business analysts and
company executives to determine which areas need to be
changed, which factors determine customer satisfaction
and customer dissatisfaction.
Visualized data gives a more precise prediction of revenue
volumes and potential development for customers,
company owners, and decision-makers.

Let us now look at a few real-life uses of data


visualizations.
1. Tracking student progress with
scorecards
Recording student scores over a period helps
analyze the progress of the students. Also helps
understand their strengths and weaknesses
helping teachers and parents provide better
assessment for the students.

2. Identifying usage trend of a website


Imagine you are a website administrator. You want
to identify the pattern when most people visit your
website. To do this, you need to track user visits
and the time when they used your website. If you
plot this in a bar chart, you will quickly identify at
what time usage of your website peaks.
3.Monitoring goals and results of a sales
executive
Most sales executives in organizations have
goals. Visualizing their sales records in charts
and graphs helps to easily picture how close
they are to their goals and what steps they
need to take.

4.Visualizing spread and impact of


pandemics
Pandemics like COVID 19 have impacted the
entire world. Data visualization techniques help
to identify the most affected countries or
regions. They also show a trend if the spread
increases or decreases, allowing governments
and other bodies to take necessary actions.
Plotting data
There are different ways to visualize data depending
on the data being modeled and its purpose. Different
kinds of graphs and tables can be used to visualize
the data.

Students will learn about the following in this


section:
• Dot plots

• Bar graphs

• Maximum and Minimum

• Frequency
Here, we shall be using
spreadsheets from the
open office to plot the
data. Open office can be
downloaded and installed
using the URL:
https://www.openoffice.org
Dot Plot
• A dot plot is a graphical display
of data using dots.
• Dots are used in dot plots to
illustrate the quantitative values
associated with the categorical
values.
For Example, Minutes to reach
school.
This data shows how long does it
take five people to reach the
school.
Minutes: 6, 2, 4, 8, 5
Person: A, B, C, D, E
Bar Graphs

• A bar graph is a graphical


display of data using bars
of different heights. It is
possible to plots the bars
vertically or horizontally.
• In a bar graph, the bars
are presented to show
elements so that they do
not touch each other.
Column Chart

A bar graph is called a


column chart or graph.
Maximum and Minimum

The minimum of the data


The maximum of the data
is less than or equal to any
is greater than or equal to
other values in our data
all other values. If we had
set. If we had to order all
to order all our data in
our data in ascending
ascending order, so the
order, so the first number
last number listed would
in our list would be the
be the maximum.
minimum.
Frequency

The frequency of a data value is the number of times the data value
occurs/repeats.

For Example, if five students have a score of 85 in English, the score of


85 is said to have a frequency of 5. The frequency of a data value is
often represented by f.
Histograms and Use of
shapes
• A histogram is a graphical display of data using
bars of different heights. It is used to
summarize discrete or continuous data.
• In other words, a histogram displays the
number of data points that fall within a given
set of values (called 'bins') to provide a visual
representation of numeric data. Unlike a
vertical bar graph, a histogram shows no gaps
between the bars.
• A histogram also gives the value of the mode
of the frequency distribution graphically.
• Bin Widths - Simply stated, Bin widths are the
range size.
Activity
The below data set contains the height of some students
for a class.
101,105,102,107,104,108,101,102,109,104,103,109,106,
101
To draw a histogram from this, we first need to organize
the data into intervals. These intervals are also called
logical ranges or bins.
Use of shapes
In this module, we will understand the different shapes of a histogram and what they
mean.
Normal Distribution
Data points in a normal distribution are as likely to occur on one side of the average as on
the other side of the average.
Right Skewed Distribution
Many data points occur on the left side, with fewer data points on the right side in a right-
skewed distribution. A right-skewed distribution occurs when the data has a range
boundary on the left-hand side of the histogram. A right-skewed distribution is also known
as a positively skewed distribution
Left Skewed Distribution
Many data points in a left-skewed distribution occur on the right side with a scarcer
number of data points on the left side.
A left-skewed distribution usually occurs when the data has a range boundary on the
histogram's right-hand side. A left-skewed distribution is also known as a negatively
skewed distribution.
Bimodal Distribution
A bimodal distribution has two peaks. In a
bimodal distribution, the data should be
separated and analyzed as separate normal
distributions.

Random Distribution
A random distribution lacks an apparent
pattern and has several peaks.
Use of single and
multivariable plots
Single Variable plots
To visualize one variable, the type of
graphs to use depends on the type of
the variable:
For categorical variables (or grouping
variables). You can visualize the count of
categories using a bar plot or a pie chart
to show each type's proportion.
You can visualize the variable's
distribution for a continuous variable
using density plots, histograms, etc.
Use of single and multivariable plots

Multi-Variable plots

Multi-variable plots are used to


display relationship among several
variables Example: We have a
survey of how many students
enrolled in schools 1 and 2 from
1995-2006. This is the data:
Example of a multi variable plot
50

45

40

35

30

25

20

15

10

0
Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

Company A Company B Company C


Exercises
Objective Type Questions
1. Data can be visualized using:
a. Graphs
b. Maps
c. Charts
d. All of the above
Answer: d
2. Which of the following statements is false?
a. Data visualization can absorb information quickly.
b. Data visualization decreases the insights and takes slower decisions.
c. Data visualization is a type of visual art.
d. None of the above
Answer: b
3. Which of the following is a use case of data visualization?
a. Healthcare
b. Sales and Marketing
c. Politics/Campaigning
d. All of the above
Answer: d
4. Bar Graph is a​
a. One-dimensional graph​
b. Two-dimensional graph​
c. Graph with no dimension​
d. None of the above​
Answer: a​
5. The data represented through a histogram can help in finding graphically the
a. Median
b. Mean
c. Mode
d. All of the above
Answer: c
6. Pie Chart is a
a. One-dimensional graph
b. Two-dimensional graph
c. Graph with no dimension
d. None of the above
Answer: a
7. Can a Line chart be used to plot multiple variables?
a. True
b. False
Answer: a
8. The height of your classmates is recorded and arranged in ascending order. The
data is represented as a histogram. What type of shape does the histogram have
a. Right-skewed Distribution
b. Left-skewed Distribution
c. Bimodal Distribution
d. Random Distribution
Answer: b
CHAPTER 3 – DATA VISUALISATION
WORKSHEET-3.1
FILL IN THE BLANKS

1.A ____________________ is a graphical display of data using dots.


2. _____________________ of a data value is the number of times the data
value occurs/repeats.
3.____________ defines the range size.
4. Data points in a ______________________ occur on one side of the
average as on the other side of the average.
CHAPTER 3 – DATA VISUALISATION
WORKSHEET-3.1
FILL IN THE BLANKS

1.A DOT PLOT is a graphical display of data using dots.


2.FREQUENCY of a data value is the number of times the data value
occurs/repeats.
3.BIN WIDTH defines the range size.
4. Data points in a NORMAL DISTRIBUTION occur on one side of the
average as on the other side of the average.
CHAPTER 3 – DATA VISUALISATION

WORKSHEET-3.1

Ques 2. Answer the following questions:

1. Define Data Visualization.

2. Give three points stating the importance of data visualization.

3. What is a histogram?

4. State the difference between a single-variable plots and multi-variable plots.

5. Differentiate between Line chart and pie chart.


WORKSHEET-3.2
Q1. MULTIPLE CHOICE QUESTIONS
1. Column graph is a
a) One-dimensional graph
b) two-dimensional graph
c) Graph with no dimension
d)None of the above
2. Which amongst the following are the chart elements:
a) Title
b) Legend
c) Grid
d)All of the above
WORKSHEET-3.2
Q1. MULTIPLE CHOICE QUESTIONS
1. Column graph is a
a) One-dimensional graph
b) two-dimensional graph
c) Graph with no dimension
d)None of the above
2. Which amongst the following are the chart elements:
a) Title
b) Legend
c) Grid
d)All of the above
3. Select the type of graph shown below:​​

a) Bar graph​
B) Line Graph​
c) Column Graph​
d)Pie Chart​
4. The data represented through a histogram can help in finding graphically the:​
a) Mean​
b) Median​
c) Mode​
d) All of the above​
5.A _____________________ is a tabular representation that summarizes the raw
categorical data.​
a) Frequency Table​
b) Data Range​
c) Bin Widths​
d) None of the above​

3. Select the type of graph shown below:​​

a) Bar graph​
B) Line Graph​
c) Column Graph​
d)Pie Chart​
4. The data represented through a histogram can help in finding graphically the:​
a) Mean​
b) Median​
c) Mode​
d) All of the above​
5.A _____________________ is a tabular representation that summarizes the raw
categorical data.​
a) Frequency Table​
b) Data Range​
c) Bin Widths​
d) None of the above​
Thank You

You might also like