IBDT Project 2
IBDT Project 2
CLOUD BASED
PROTOTYPE
22.04.2024 1
FINAL
PROJECT
Pieces of the project
22.04.2024 2
FINAL
PROJECT
Prerequisites for the start
Class 6
Your object storage (bucket)
with data
Enter ID of GSOM Service account to work with Ask me if it is
Federation services not working
bpfq2tge5rupggql4gmq Your project selected …and…
Your GSOM account to Data for the project is Check the
enter processed permissions!
DataLens connection to your
data is tested
Class 7
22.04.2024 3
FINAL
PROJECT
Pipeline S3-YQL: data visualization
Steps:
1. Write YQL (SQL) queries for the EDA
Options market data (Exploratory Data Analysis)
2. Create connection with Yandex.DataLens for
further visualizations
YQL 3. Create a dataset(s) within Yandex.DataLens
for the charts and diagrams
TODAY’S 4. Create a dashboard for the final project’s
SEMINAR presentation
22.04.2024 4
FINAL
PROJECT
Pipeline S3-ClickHouse-Datalens: data visualization
Steps:
1. Verify the data in ClickHouse or PostgreSQL
JupyterHub logs 2. Create connection with Yandex.DataLens for
further visualizations
3. Verify the structure of the data (data model)
for the datasets
4. (optionally) Create new tables in the
database to speed up visualizations
TODAY’S 5. Create a dataset(s) within Yandex.DataLens
SEMINAR for the charts and diagrams
6. Create a dashboard for the final project’s
presentation
22.04.2024 5
FINAL PROJECT:
PRESENTATION
How it will be organized
22.04.2024 6
FINAL PROJECT:
PRESENTATION
My expectations / 1
22.04.2024 7
FINAL PROJECT:
PRESENTATION
My expectations / 2
1 EDA
Size of the data, number of
records
Structure 3 Final dashboard
Data types 2-3 pages / sheets
EDA (exploratory data Unique values for categories recommended
analysis) and insights Indicators are good
Insights and analytics: we do not practice
2 Bar plots
build a model, but we can find
dependencies Pie charts
High-load time periods Histograms
Trends Tables (if needed)
Structure changes ~8-10 diagrams /
indicators required
22.04.2024 8
FINAL PROJECT:
PRESENTATION
Criteria
22.04.2024 9
FINAL PROJECT:
LAST OF LABS
Plan for lab today
22.04.2024 10
FINAL
PROJECT
Quest for homework
22.04.2024 11