0% found this document useful (0 votes)
43 views

Data Enguneer

Uploaded by

chai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
0% found this document useful (0 votes)
43 views

Data Enguneer

Uploaded by

chai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
You are on page 1/ 5
1. Computer Science Fundamentals (If you don’t have a CS background) Watch this if you don’t have a computer science background, as a Data Engineer having good knowledge of CS fundamentals is important to understand big systems and how they work Watching these videos will give you a basic understanding of CS fundamentals You can watch the first 7 lectures from this playlist a. C950 2022 b. Book - Grokking Algorithms: An illustrated guide Do any courses, your main goal here is to understand how to write basic Python Code and how to work with different datasets! a. Darshil - Python for Data Engineering (Recommended) b, DataCamp - Data Enaineering With Python ©, Coursera - Python for Everybody Specialization (Do this if you don't know anything about python) 4d. Udemy - Python Bootcamps: Learn Python Programming and Code Training @. freeCodeCamp - Learn Python - Full Course for Beginners Practice Projects: ‘© Scrape Data Using BeautifulSoup Library eg. Amazon, Covid, Wikipedia, or any website you like Build A Calculator Using Python 3. SQL (Structured Query Language) Learn about the basics of SQL and how to write queries, once you complete the course make sure you do hands-on practice on Hackerrank or any website you like! a, Udemy - The Complete SQL Bootcamp for the Manipulation and Analysis of Data (Recommended) b. Coursera - SQL for Data Science ¢. DataCamp - Intro To SQL DataCamp Practice SQL here © Hackerrank SQL 4, Basics Of Linux Why Linux? Because you will be working with many remote machines, doing SSH to access them, and performing operations so it's important to learn them. You don't have to remember all the commands but just understand what they do and how to write them a, Udemy - Linux for Beginners: Linux Basics b. Coursera - Linux Fundamentals ¢. freeCodeCamp - Top 50 Most Popular Linux Commands (Recommended) Do Hands-On Project © Beginner Data Engineering Portfolio Project (Recommended) 5. Big Data Fundamentals This section is theoretical and you need to understand how big data system works and their history of them a, Coursera - Big Data Specialization (Recommended) b. Udemy - Lear Big Data: The Hadoop Ecosystem Masterclass (Do this if you want to learn about legacy systems) Lear Fundamentals and then learn one tool, Snowflake, BigQuery, Redshift, etc... Just learn one and you are good! a, Fundamentals i. Coursera - Data Warehousing for Business Intelligence Specialization (recommended for deep dive) Udemy - Data Warehouse Fundamentals for Beginners (recommended for quick learning) b, Tools i. Snowflake - Snowflake — The Complete Masterclass ji, Snowflake Doc - httos:/www snowflake. com/certifications! a, Spark Fundamentals i. DataCamp - Big Data Fundamentals with PySpark (recommended) ii. Udemy - Spark and Python for Bia Data with PySpark b, Databricks i. Udemy - Azure Databricks & Spark Core ii, Udemy - Databricks Certified Data Engineer Associate ili, Coursera - Databricks for Data Engineering a, Realtime Streaming (Kafka) i. Udemy - Apache Kafka Course for Beginners: Leam Kafka Online (check this) ji. edX - Building ETL and Data Pipelines with Bash, Airflow, and Kafka Do Hands-On Project - Stock Market Real-Time Streaming Pipeline a. Udemy - The Complete Hands-On Introduction to Apache Airflow b. DataCamp - Airflow Do Hands-On Projéet\- Twitter Data Pipeline using Airflow 10. Cloud Computing ‘Advance section, do courses, and then do the certification to add value in your Resume, If you are new then start with AWS but if you know about other clouds then you can do that too! a. AWS (Amazon Web Services) i, Udemy - Ultimate AWS Certified Cloud Practitioner ii, Udemy - Ultimate AWS Certified Solutions Architect Associate (SAA\ ili, Coursera - AWS Solution Architect Associate b. GCP (Google Cloud Platform) i. Coursera - Cloud Data Engineer Professional Certificate ¢. Microsoft Azure i. Coursera - Microsoft Azure Data Engineering Associate ii, Udemy - AZ-900: Microsoft Azure Fundamentals ili, Udemy - Azure Data Engineer Certified:8 COURSE BUNDLE Do Hands-On Project 1, Build ETL Pipeline Using AWS Cloud 2. Covid Data Analysis Project 3. YouTube Data Analysis (End-To-End Data Engineering Project) 11, Learn Modern Data Stack a, Learn Basics - httos://analyticsindiamag.com/moder-data-stack-and-what-we-know-aboutit! b, Dbt- httos/iwww.cetdbt.com/dbttearny ©. Airbyte - httpsi/airbyte com! d, Fivetran - hitps:/www.fivetran.com/ a. Docker Guide - https://www.coursera org/projects/docker-for-absolute-beginners b, Udemy - Docker & Kubernetes: The Practical Guide Recommended Books 1. Designing Data-Intensive Applications 2. Fundamentals of Data Engineering 3, ‘The Data Warehouse Toolkit Read Real-World Case Studies 1. Netflix - https://netflixtechblog medium.com/ 2. AWS - https://aws.amazon.com/solutions/case-studies/ 3. GCP - https://cloud,qooale.com/customers 4, Azure - https://azure microsoft, com/en-us/resources/customer-stories/ Follow Me Here: 1, Twitter - https://twitter.com/parmardarshilO7, 2. Linkedin - https://www.linkedin, com/in/darshil-parmar! 3. YouTube - https:/www.youtube.com/c/DarshilParmar All the best <3

You might also like