Data Transformation Assignment Instructions
Data Transformation Assignment Instructions
A focus area of this class is data analytics. Data analytics are the foundation for decision-making
in business. Often in your job, you will be asked to analyze data that is provided for you in the
form of files or spreadsheets. In most instances, these files will not contain all of the data fields
(columns) that you need to create high-quality analyzes. In this assignment, you will be
provided with a file of data. Excel will be utilized to derive new fields of data to improve the
value of the file for future analysis.
The purpose of the assignment is to transform and prepare data in a spreadsheet for data
analytics that we will perform in a future assignment. The intent of this assignment is NOT to
do the analytics but prepare the spreadsheet for analytics.
Before you attempt this assignment, I strongly recommend that you watch the “Data
Transformation Helpful Hints” video and perform the steps shown in the video using the “Data
Transformation Helpful Hints – Spreadsheet”.
You will perform the data transformations, identified below, utilizing Excel functions. To
successfully complete this assignment, you must use the following Excel functions at least once:
• TEXT
• VLOOKUP
• IF
• LEFT
• RIGHT
• PROPER
• CONCATENATE
• DAYS
• TRIM
• MATH FUNCTIONS (*, /, -)
1. Fill in all the columns that are highlighted in yellow using a function. Do not use the
flash fill capability. For this assignment we want you to learn to use the functions listed
above.
2. Selling Plant can be derived by using the first character of Sales Org. There is a
worksheet titled “Sales Org” that tells you the selling plant based on the sales org. Use a
VLOOKUP to derive the Selling Plant.
3. Customer Name is simply the Customer Description without the numbers and in the
proper case syntax. For example, “26003-WALMART” would be “Walmart”.
4. Order Month and Order Day should be in their full format (February, Tuesday) based on
the Order Date.
5. Region can be derived from Region Code and should be North, South or West. Use the
“If” function for this.
6. Region-City-State is to be the region, city and state columns with a “-“ between them.
For example, “North-Anklam-Mecklenburg”.
7. Material Description can be derived by using Material ID and the Cereals worksheet.
Use VLOOKUP to derive this column. Hint Material ID may have extra spaces in it.
8. Material Size – once you have material description this column is to either be 1KG or
500G based on the Material Description.
9. Delivery Time is number of days between Order Date and Delivery Date.
10. Delivery Month and Delivery Day should be in their full format (February, Tuesday)
based on the Delivery Date.
11. Price Per Unit Euro is the Revenue Euro / Quantity.
12. Quantity can be derived from the QTY/Unit field and must be formatted to be a number
to do calculations.
13. Gross Profit EURO can be calculated by subtracting COGS Euro from Revenue Euro.
14. Revenue USD and COGS USD can be calculated by multiplying the EURO number by 1.09.
15. Gross Profit USD can be calculated by subtracting COGS USD from Revenue USD.
16. You should format all Euro columns in accounting format using the € symbol and all USD
columns in accounting format using the $ sign.
Place the resulting Excel spreadsheet document in the Assignment4: Data Transformation
Dropbox.
Grading Rubric
• -5 for each function not used in the list above
• -6 for each column output that is incorrect
• -2 for each currency column not formatted correctly.