Visualisation For Data Science Predict Overview 3267
Visualisation For Data Science Predict Overview 3267
Predict
Predict Summary
In this predict, we will use PowerBI to connect to new data sources related to Eskom’s power generation, change the data source locations, and
create new visuals to derive the insight required to appropriately answer several MCQs.
01
Problem Introduction
02
Connecting to the Data
03
Sourcing Additional Data and Setting Relationships
04
Creating Visuals
You will be supplied with a ‘broken’ dashboard and the underlying data.
The reports you have created will be used to answer the multiple choice
questions at the end of this predict.
01
Problem Introduction
02
Connecting to the Data
04
Creating Visuals
01
Problem Introduction
02
Connecting to the Data
03
Sourcing Additional Data and Setting Relationships
04
Creating Visuals
Source Data
Capacities and Efficiencies Datasets:
Create Relationship
4. Repeat steps 1-3 for the Eskom power stations
efficiencies.xlsx dataset
5. Rename the Efficiencies dataset to ‘Eskom-
Efficiencies’ and the Capacities dataset to
‘Eskom-Capacities’
6. Create a 1-1 relationship between the ‘Names’
column of the Eskom-Capacities and Eskom-
Efficiencies datasets
Access Datasets
Dataset
1. Remove ‘Column10’
2. For ‘Fixed Operation and Maintenance Cost’ - fill
the missing values with 188
3. For ‘Variable Operation and Maintenance Cost’ -
fill the missing values with 45
4. For ‘Ramp Rate per hour’ - fill the missing
values with .3068
5. For ‘Energy Efficiency’ - fill the missing values
with .326731
01
Problem Introduction
02
Connecting to the Data
03
Sourcing Additional Data and Setting Relationships
04
Creating Visuals
The infrastructure dashboard gives us insight into attributes of each station. Using Power BI slicers and filters we can draw correlations between
station attributes, and explore and analyse this dataset in-depth.
Using what you’ve learnt up to this point, you are tasked to restore the supplied ‘broken’ infrastructure dashboard. You may use the below standing as guidance in your quest to rebuild
the dashboard
The Eskom capacity and efficiency dashboard is aimed at providing information on individual station performance.
Before data is sourced, cleaned
and relationships are created
Slicers
Add a ‘number of units’ slicer to the dashboard. Set the slicer type as between
Cards
Add the following cards to the report:
and relationships are created
After data is sourced, cleaned
Visuals
• For the scatter chart, add the feature ‘Max nominal capacity (MW)’ to the size
column
• For the clustered bar chart add the feature ‘status’ to the visualisations legend
field
Twitter Dashboard
The twitter dashboard summarises Eskom mentions (@) and tags (#) over a defined period
This page will be updated periodically with common predict-related questions which may arise during the Sprint. Consider consulting this
space before asking your course facilitator a question.