Brochure Professional Certificate in Data Engineering
Brochure Professional Certificate in Data Engineering
FOUNDATIONS OFCERTIFICATE
CYBERSECURITY
AND
IN RISKENGINEERING
DATA MANAGEMENT
Get cutting-edge skills to advance your data engineering career.
Delivered
Deliveredinincollaboration
collaborationwith
with
Overview
As technology proliferates, data is evolving into a true strategic asset, and the demand for data engineers and
their specialized expertise is growing in tandem. In fact, data engineer was the fastest growing tech occupation in
2020, according to the Dice Tech Job Report. Why? Because before data scientists can glean useful information
from the mountains of data today’s organizations possess, the data must be architected, warehoused, and acces-
sible, and data engineers are responsible for building this infrastructure.
The MIT xPRO Professional Certificate in Data Engineering is an immersive 6–month program that’s designed to
provide job-ready, in-demand data engineering skills and a competitive edge in the marketplace. Through an
exploration of core concepts, tools, techniques and best practices, participants will learn data engineering essen-
tials, from building an effective data architecture and data warehouses to designing data models, streamlining
data processing, automating data pipelines, data wrangling, and big data engineering. They can also take advan-
tage of personalized feedback, live weekly office hours with course leaders, and the opportunity to develop a
GitHub portfolio for potential employers.
MIT xPRO’s online learning programs showcase industry-aligned content from world-renowned experts to make
learning accessible anytime, anywhere and solve this challenge for developing technical professionals.
“Data engineers build the ‘nervous system’ of the company. Without it, the company cannot react to
changes in the external business environment or within the organization. They build the software and
hardware systems that power the company’s vision and are masters not just of software, but of hard-
ware, networks, and analytic apps that are changing everyday data.”
Earn a certificate from MIT xPRO that recognizes your skills and success
Career switchers: Mid-career professionals aiming to switch to data engineering from IT,
analytics, finance, project management, supply chain, or another technical field.
Program prerequisites: there are no prerequisites for this program, but prior coding experience (in any language) and
a minimum of a bachelor's degree is recommended.
“Data engineering really is a core component of today’s data infrastructure. And because organizations
can’t function without data, it’s also a career with a great deal of opportunity and incredibly interesting
work as well.”
– Abel Sanchez, Research Scientist and Executive Director
of MIT’s Geospatial Data Center
Key Takeaways
This program is designed to prepare you with the skills you will need to start or continue
your career in data engineering. High-level learning outcomes for this program include:
Manage big data using data warehousing and workflow management platforms
Explore artificial intelligence and machine learning concepts, including reinforcement learning and deep
neural networks
Program Schedule
This program is organized into three main sections:
Section 1
As you begin your learning journey in data engineering, you will begin with learning (or refreshing your knowledge of)
the Python programming language as well as how to work with relational databases using SQL and how to work with
Python to create databases and server pipelines.
Module 0: Module 4:
Orientation Databases: Basic SQL Statements
The first week is an orientation module. You will have You will learn how to design a database using SQL and the
access to the learning platform from the program start MySQL platform conceptually and logically. This module also
date. You will also install the software and tools you will includes an introduction to physical database design.
use in the course on your own computer. There is no
teaching, all the content is pre-recorded.
Module 1: Module 5:
Introduction to Python Databases: Basic SQL Statements
You will learn the basic Python syntax including funda- In this module, you will learn more about physical database
mental data types and constructs. You will practice design using SQL queries, logical operators, and regular
using Python in multiple assignments. expressions.
Module 2: Module 6:
Python: NumPy Databases: Advanced SQL Statements
You will be introduced to the Numpy library and its You will have an opportunity to perform data cleaning and
functions for array manipulation and probability, visualize data in SQL. You will be introduced to the
distributions, and interpreting histograms. You will also client-server interface and how to connect to a driver using
learn how to use the Matplotlib library to create data MySQL and Python.
visualizations.
Section 3
In this section of the course, you will explore big data and data warehousing. You will discover the connections between
artificial intelligence (AI), machine learning, and data engineering.
Note: break weeks are included to cover project assignment work and prepare for the upcoming module.
Assignments and Portfolio Projects
Each module includes engaging assignments and culminates in at least one GitHub portfolio project that you’ll
complete based on what you have learned in that portion of the program.
Assignments
Coding Exercises
Coding exercises are integrated into various modules through simple activities using Jupyter Notebook. They
allow you to practice building composite skills to prepare you for the assignments and portfolio projects.
Portfolio Projects
Building a Predictive Machine Learning Model involving Feature Selection for Linear Regression
Building a Reinforcement Learning Model for Robot Navigation (from scratch in Python)
Running TensorFlow for a Deep Neural Network Model (Deep Dream in Colab)
Building a Producer/Subscriber Broker for Visualizing Streaming MQTT Sensor Data (integrating Things-
Board and FireBase)
Stream load over 100 million lines of data using DASK. Creating and write 20 files in parallel using DASK.
You will receive personalized feedback from your course leaders to include in your GitHub repositories, securing a
market-ready portfolio that’s ready to share with potential employers.
Career Preparation and Guidance
Stepping into a career in data engineering requires a variety of both hard and soft skils. This course offers you
guidance on navigating a career path into tech, including crafting your elevator pitch and communication tips. These
services are provided by Emeritus, our learning collaborator for this program. The program support team includes
course leaders to help you reach your learning goals. Eligible participants may receive introductions to our hiring
partners; however, job placement is not guaranteed.
Reflecting on your skills to learn how to troubleshoot and learn more quickly
“Companies today are being forced to respond faster and with more precision than ever. Data engi-
neers are responsible for innovating and building pipelines that can turn raw data into business
advantage. Companies like AirBnB, Uber, and Robinhood are masters of leveraging real-time data,
and many others are trying to understand how to do this. Data engineers are the problem solvers of
the cyber-info world.” – John R. Williams
Program Faculty
JOHN R. WILLIAMS He contributed to the 2013 report for the UK Office for Science
Foresight Project – The Future of Manufacturing.
Professor of Information Engineering in The MIT
Department of Civil and Environmental Engineering Alongside Bill Gates and Larry Ellison, he was named as one of
the 50 most powerful people in computer networks. He
consults for companies including Accenture, Schlumberger, SAP
Research, Microsoft Research, Kajima Corp, US Lincoln Labora-
tory, Sandia National Laboratories, US Intelligence Advanced
Research Projects Activity, Motorola, Phillip-Morris Inc., Ford
Motor Company, Exxon-Mobil, Shell, Total, and ARAMCO.
E
receive 75% to pass and obtain the certificate of
P L
completion.
AM
This is to certify that
S
has successfully completed
registering for the program. All certificate images are Sanjay Sarma John R. Williams Abel Sánchez
Vice President for Open Learning Professor of Information Engineering in Research Scientist and
About Emeritus
MIT xPRO is collaborating with online education provider Emeritus to deliver this online course through a dynamic,
interactive, digital learning platform. This course leverages MIT xPRO's thought leadership in engineering and
management practice developed over years of research, teaching, and practice.
Easily schedule a call with a program advisor
from Emeritus to learn more about this
MIT xPRO program. Connect with a
program advisor
Schedule a call
Email: [email protected]
Phone: +1-315-538-6867
You can apply for the program here
Apply