Case Study DataCamp
Case Study DataCamp
I N T R O D U C T I O N T O D ATA E N G I N E E R I N G
Vincent Vankrunkelsven
Data Engineer @ DataCamp
Ratings at DataCamp
Recalculate daily
Course Rating
course_id user_id
title
course_id
description
rating
programming_language
Course Rating
course_id user_id
title course_id
description rating
programming_language
Vincent Vankrunkelsven
Data Engineer @ DataCamp
The recommendations table
Matrix factorization
course_id
rating
course_id avg_rating
1 4.8
74 4.78
21 4.5
32 4.9
Recommend three highest rated SQL courses which are not 12 and 52.
Vincent Vankrunkelsven
Data Engineer @ DataCamp
What you've done so far
Update daily
recommendations.to_sql(
"recommendations",
db_engine,
if_exists="append",
)
dag = DAG(dag_id="recommendations",
scheduled_interval="0 0 * * *")
task_recommendations = PythonOperator(
task_id="recommendations_task",
python_callable=etl,
)
Vincent Vankrunkelsven
Data Engineer @ DataCamp
Introduction to data engineering
Databases