Data Handling and Visualization Midterm Assessment (20pts)
Data Handling and Visualization Midterm Assessment (20pts)
1. Please define the concepts of a primary key and a foreign key as components of a relational
database (2pts):
2. You have been given an Excel file that contains information about sales for a company that
manufactures cabinets. They operate all over the United States. The client that gave you the
file asks you to run a report that summarizes sales by state. You decide that you need to
clean the file first. You run a pivot table on the data but notice that the state VA appears
three times. What issue has more than likely occurred here, and how would you remediate it
using an Excel function? (2pts)
The issue here might be of white spaces. The state name VA might be
having one or more white spaces before or after it, thus making the three
observations unique to excel and hence they appear thrice.
We would use the excel function, TRIM(), to remove the white
spaces appearing before and after the name, thus combining the three
observations into one.
3. You have been hired to consult with an organization pertaining to their new sales initiative.
You have done a lot of analysis, and now its time to present your findings. You are debating
between doing a live presentation for the client and writing a report and sending it to the
client. Compare and contrast delivering a live presentation vs. delivering a written report.
(2pts)
4. Review the below table, and note two items you could change to place focus on the data:
(2pts)
5. Jackson has been asked to create a data visualization that shows how expenses have trended
for past 18 months. Which type of visualization would be optimal to show this? (1pt)
6. Amber needs to show the relationship between the cost of supplies and the number of hours
worked by employees. Which type of visualization would show this relationship the best?
(1pt)
A scatter plot
7. Review the below visualizations. Name two ways in which you could improve this
visualization, if it must remain a line chart (4pts):
1.) Descriptive analytics: These are procedures that summarize existing data to
determine what has happened in the past.
Q: Which product was the most profitable in the last five years?
2.)Diagnostic analytics: These are procedures that explore the current data to determine
why something has happened the way it has, typically comparing the data to a benchmark.
Q: Why did the business lose money in the last financial year even though the sales were
better than the previous years?
3.)Predictive analytics: These are procedures used to generate a model that can be used
to determine what is likely to happen in the future.
Q: What is the maximum duration for which a business can maintain an outstanding
payment from a sale?
Q: What would be less costly, having a local manufacturer for the business or
outsourcing it to a Chinese factory?