0% found this document useful (0 votes)
103 views4 pages

Data Handling and Visualization Midterm Assessment (20pts)

This document provides a midterm assessment on data handling and visualization. It asks students to define primary and foreign keys, describe how to address duplicate data values using TRIM in Excel, compare delivering a live presentation versus a written report, suggest improvements to a sample data visualization as a line chart, and identify appropriate visualizations and analytics types for different business questions.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
103 views4 pages

Data Handling and Visualization Midterm Assessment (20pts)

This document provides a midterm assessment on data handling and visualization. It asks students to define primary and foreign keys, describe how to address duplicate data values using TRIM in Excel, compare delivering a live presentation versus a written report, suggest improvements to a sample data visualization as a line chart, and identify appropriate visualizations and analytics types for different business questions.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Data Handling and Visualization Midterm Assessment (20pts)

1. Please define the concepts of a primary key and a foreign key as components of a relational
database (2pts):

A primary key is used to ensure data in the specific column is unique. It is


a column cannot have null values. It is either an existing table column or a
column that is specifically generated by the database according to a defined
sequence.
A foreign key is a column or group of columns in a relational database
table that provides a link between data in two tables. It is a column (or columns)
that references a column (most often the primary key) of another table.

2. You have been given an Excel file that contains information about sales for a company that
manufactures cabinets. They operate all over the United States. The client that gave you the
file asks you to run a report that summarizes sales by state. You decide that you need to
clean the file first. You run a pivot table on the data but notice that the state VA appears
three times. What issue has more than likely occurred here, and how would you remediate it
using an Excel function? (2pts)

The issue here might be of white spaces. The state name VA might be
having one or more white spaces before or after it, thus making the three
observations unique to excel and hence they appear thrice.
We would use the excel function, TRIM(), to remove the white
spaces appearing before and after the name, thus combining the three
observations into one.

3. You have been hired to consult with an organization pertaining to their new sales initiative.
You have done a lot of analysis, and now its time to present your findings. You are debating
between doing a live presentation for the client and writing a report and sending it to the
client. Compare and contrast delivering a live presentation vs. delivering a written report.
(2pts)

Delivering a live presentation is preferable when we want to control


what we want to convey to the audience. We can put forward the information we
want to share in a concise manner and explain the rest, the way we want to.
Sending a written report is beneficial when we want to convey the
facts and figure and express the data as it is. So, we can type out all the information
in detail explaining everything, which is to be conveyed to the employees of the
organization.

4. Review the below table, and note two items you could change to place focus on the data:
(2pts)

Metric Metric Metric


Team Location 1 2 3
Hornets NC 11.5 85.44 1.1
Cowboys TX 15.2 81.21 1.3
Chargers NV 17.1 87.98 1.3
Jets NY 11.2 86.01 1.1

The changes that can be made are:

1. ) Remove the borders, or keep it really light.

2. )Keeping a uniform white background across the table.

3. )Making the grid lines less bold.

5. Jackson has been asked to create a data visualization that shows how expenses have trended
for past 18 months. Which type of visualization would be optimal to show this? (1pt)

A simple bar graph or a line chart

6. Amber needs to show the relationship between the cost of supplies and the number of hours
worked by employees. Which type of visualization would show this relationship the best?
(1pt)

A scatter plot
7. Review the below visualizations. Name two ways in which you could improve this
visualization, if it must remain a line chart (4pts):

This visualization can be improved by:


1. )Remove the chart border.
2. )Remove grid lines.
3. )Remove data markers.
4. )Keeping the x-axis labels vertical.
8. )Please list four types of data analytics, and write a business question that could be addressed
by that analytic (6pts):

The four types of Analytics are:

1.) Descriptive analytics: These are procedures that summarize existing data to
determine what has happened in the past.

Q: Which product was the most profitable in the last five years?

2.)Diagnostic analytics: These are procedures that explore the current data to determine
why something has happened the way it has, typically comparing the data to a benchmark.

Q: Why did the business lose money in the last financial year even though the sales were
better than the previous years?

3.)Predictive analytics: These are procedures used to generate a model that can be used
to determine what is likely to happen in the future.

Q: What is the maximum duration for which a business can maintain an outstanding
payment from a sale?

4.)Prescriptive analytics: These are procedures that model data to enable


recommendations for what should be done in the future.

Q: What would be less costly, having a local manufacturer for the business or
outsourcing it to a Chinese factory?

You might also like