Tableua Assignment
Tableua Assignment
For this exercise, submit a WORD DOCUMENT WITH A SCREENSHOT OF YOUR VISUALS AND a “.twb” file. File | Save
As | Tableau Workbook (*.twb).
By the way, the Flight data in exercises 1 and 3 consists of 7.1 million rows of data. Excel is limited to 1 million rows
max. This is one advantage of Tableau over Excel; it can manipulate much larger files than Excel.
1. One aspect of flying that people do not like is having their flight canceled. Is there a certain day of the
week that flights are more likely to be canceled? Which day and how many flights were canceled on that
day (think the day of the week like Wednesday--not a particular date 1/11)
Wednesday (21,779)
You will need to filter the “cancelled” variable to true to only show flights that are cancelled.
2. You are interested in identifying days that are particularly heavy in terms of travel each year. Identify the
day(s) that seem to be the busiest and show on the chart the number of flights for those days over the range of data
provided (May, 8, 2011).
• When looking at data “over time”, line charts tend to be a good choice.Consider the level of detail (i.e., quarters,
months, weeks, days, etc) being requested. If you start with Years and break it down to finer periods such as
weeks, you can always remove the unneeded units of time from the charts to leave just the specific time period
needed.
• Look at exercise 3 to get an idea of what some of the charts in Exercise 1 should look like.
3.Next, you want to know who operates the most flights. Is there one airline that operated more flights to Texas than
any other single airline for each month in this dataset?
Hints: You will also want just a single visual for this exercise. Maybe use color or shape to separate out the different
airlines.
4. You would also like to know which states tend to experience the longest arrival delays. Create a single map
that visualizes the arrival delay in minutes that you can expect in each origin state for Southwest airlines. Allow the
user to filter by airline. Colors should range from dark red for states with longest arrival delays to dark green for states
with shortest arrival delays.
Hints:
• When looking at Arrival Delay, consider how to consolidate this data. What makes sense from a usability
perspective. For example, suppose you were looking at Flight Times from Indianapolis to Orlando. Would you
care about the total (i.e., sum) of Flight Times of all the flights in one month or would another consolidation
other than Sum be more useful? Like average?