0% found this document useful (0 votes)
17 views

Data Scientist Syllabus Upd

Uploaded by

tjmwinter
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
17 views

Data Scientist Syllabus Upd

Uploaded by

tjmwinter
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

Data Scientist

Syllabus

Each course lasts two weeks or three weeks and represents 9 months
approximately 40 hours of study.

1 Basic Python

Your introduction to the world of data science! Key concepts and 3 weeks,

basic syntax in Python. Loops, conditions, and functions. The 
 40 hours


pandas library for data analysis. Your first analytical case study,
followed by your first project.


Chapter 1. Variables, Printing, Data Types, and Arithmetic Operations


Chapter 2. Strings


Chapter 3. Lists


Chapter 4. for Loops


Chapter 5. Nested Lists


Chapter 6. Conditions and Loops


Chapter 7. Creating Functions


Chapter 8. Dictionaries


Chapter 9. pandas for Data Analysis


Chapter 10. Data Preprocessing


Chapter 11. Analyzing Data and Presenting Results

• Chapter 12. A Quick Overview of the Jupyter Notebook


Data Preprocessing
+1 project for 

2 Compensating for less-than-perfect data. Handling missing your portfolio

and duplicate values. Changing data types. Systems thinking 3 weeks, 40 hours
for analysts.


Chapter 1. Introduction to Data Preprocessing


Chapter 2. Working with Missing and Duplicate Values


Chapter 3. Changing Data Types


Chapter 4. Categorizing Data

• Chapter 5. Systems and Critical Thinking for Analysts

Exploratory Data Analysis (EDA)


+1 project for 

3 Performing initial scans to detect patterns in data. Building your portfolio

basic graphs and generating your first hypotheses.

3 weeks, 40 hours


Chapter 1. Introduction to Exploratory Data Analysis (EDA)


Chapter 2. First Graphs and Conclusions


Chapter 3. Data Slices


Chapter 4. Working with Several Data Sources


Chapter 5. Relationships Between Datasets

• Chapter 6. Validating Results

Statistical Data Analysis


+1 project for 

4 Probability theory, the most common distributions, and
your portfolio

3 weeks, 40 hours
statistical methods in Python. Sampling and statistical
significance. Identifying and handling anomalies.


Chapter 1. Introduction to Statistical Data Analysis


Chapter 2. Descriptive Statistics


Chapter 3. Probability Theory

• Chapter 4. Testing Hypotheses

Integrated Project 1
+1 project for 

5 Identify patterns to help you determine whether a given video your portfolio

game will be commercially successful or not. 1 week, 20 hours

1-week break
6 Data Collection and Storage (SQL)
+1 project for 

How databases are structured and how to pull data from your portfolio

them using SQL queries. Finding data online.

2 weeks, 40 hours


Chapter 1. Introduction to Data Collection and Storage (SQL)


Chapter 2. Retrieving Data from Online Resources


Chapter 3. SQL as a Tool for Working with Data


Chapter 4. Advanced SQL Features for Analysis


Chapter 5. Relationships Between Tables


Chapter 6. Soft Skills

• Bonus Chapter: PySpark


+1 project for 

Introduction to Machine Learning

7 Mastering the basics of machine learning. How the scikit-


your portfolio

2 weeks, 40 hours
learn library works and how to apply it in your very first
machine learning project.


Chapter 1. Introduction to Machine Learning


Chapter 2. Training Your First Model


Chapter 3. Model Quality


Chapter 4. Model Improvement

• Chapter 5. Moving on to Regression


+1 project for 

Supervised Learning

8 your portfolio

Diving into the most in-demand area of machine learning.


2 weeks, 40 hours
How to tune machine learning models, improve metrics, and
work with imbalanced data.


Chapter 1. Introduction to Supervised Learning


Chapter 2. Feature Preparation


Chapter 3. Classification Metrics


Chapter 4. Imbalanced Classification


Chapter 5. Regression Metrics

• Chapter 6. Soft Skills

9 Machine Learning in Business


+1 project for 

Applying what you’ve learned to business tasks. Discover your portfolio

business metrics, A/B testing, the bootstrapping technique, 2 weeks, 40 hours


and data labeling.


Chapter 1. Course Introduction


Chapter 2. Business Metrics


Chapter 3. Implementing New Functionality


Chapter 4. Data Collection

• Chapter 5. Soft Skills

10 Integrated Project 2
+1 project for

Prepare a prototype of a machine learning model to help your portfolio

a mining company develop efficient solutions.

1 week, 20 hours

1-week break
Linear Algebra
+1 project for 

Taking a deeper look at some algorithms you’ve already your portfolio

11 studied and understanding how to apply them. Key concepts 2 weeks, 40 hours
in linear algebra: vectors, matrices, and linear regression.


Chapter 1. Course Introduction


Chapter 2. Vectors and Vector Operations


Chapter 3. Distance Between Vectors


Chapter 4. Matrices and Matrix Operations

• Chapter 5. Linear Regression from the Inside

Numerical Methods
+1 project for 

12 Analyzing a number of algorithms that use numerical your portfolio

methods and applying them to practical tasks. Gradient 2 weeks, 40 hours


descent, gradient boosting, and neural networks.


Chapter 1. Course Introduction


Chapter 2. Algorithm Analysis


Chapter 3. Gradient Descent


Chapter 4. Gradient Descent Training


Chapter 5. Gradient Boosting

• Chapter 6. Soft Skills


13 Тime Series
+1 project for 

Exploring the time series. Understanding trends, your portfolio

seasonality, and feature creation.

2 weeks, 40 hours


Chapter 1. Course Introduction


Chapter 2. Time Series Analysis

• Chapter 3. Time Series Forecasting

Machine Learning for Texts


+1 project for 

14 Applying machine learning to text data. Finding out how your portfolio

to convert text into numbers and how to use bag-of-words, 2 weeks, 40 hours
TF-IDF, as well as embeddings and BERT.


Chapter 1. Course Introduction


Chapter 2. Text Vectorization

• Chapter 3. Language Representations

Computer Vision
+1 project for 

15 How to handle simple computer vision tasks using premade your portfolio

neural networks and the Keras library. A quick look at deep 2 weeks, 40 hours
learning.


Chapter 1. Course Introduction


Chapter 2. Fully Connected Networks


Chapter 3. Convolutional Neural Networks

• Chapter 4. Soft Skills

Unsupervised Learning
2 weeks, 40 hours
16 Figuring out what to do when you have no target features.
Handling clustering tasks and looking for anomalies.


Chapter 1. Course Introduction


Chapter 2. Clustering

• Chapter 3. Search for Anomalies

+1 project for 

Final Project
your portfolio

17 Apply everything you’ve learned in a two-week bootcamp 2 weeks, 40 hours


that simulates the experience of working as a junior data
scientist.
Career help
In addition to the main educational course, our career help is divided
into three parts: the Career Prep Course, Career Acceleration Program,
and the Apiary projects.

Career Prep Course


40 hours 

This is a course devoted to preparing for life after Practicum. During + Resume, LinkedIn
this course, you will learn how to create a resume, a LinkedIn profile, and Github profiles
and a GitHub account, along with improving networking and
interviewing skills. This course is self-paced and ends with a final
task. We’ll also perform a review of your career artifacts.

1 Resume

Learn how to write an eye-catching resume, transform your non-tech 



experience into a strength.


Compile a ready-to-use resume

• Gain access to a resume improvement tool

2 Creating an Online Presence

Assemble your GitHub portfolio and ensure your LinkedIn looks 



professional and informative.


Produce a production-ready portfolio

• Launch your LinkedIn profile

3 Being a Networking Ninja

Learn how to become a networking professional, and how to write 



the perfect cover letter.


Unlock a networking roadmap

• Prepare a cover letter template


4 The Job Search

Learn where to find a job and prepare for the search!


Access job searching resources & application tracker tool

• Produce a target job list

5 An Interview Masterclass

Familiarize yourself with different interview types, common questions


you might face, and practice tech assignments.


Learn interview do's & don'ts


Master the STAR technique and sound more professional

• Get tech interview help

Career Acceleration Program

Prepare for real-world interviews and gain experience through Up to 6 months 



authentic practice. This program is designed to help you find a real after graduation
job and also provides some work with technical skills.


Attend mock interviews


Receive 1:1 career coaching


Write technical articles and demonstrate your knowledge


Produce demo videos of your work


Participate in extracurricular activities Resolve

• Join the Slack community

Apiary Projects

You'll gain confidence solving work tasks that use a real company's data + 1-∞ real projects 

to provide them with valuable insights. Learn to communicate with for your portfolio, 

clients, meet their expectations, exchange peer reviews with 5-6 weeks
colleagues, and present results to the company. The Apiary projects
become available for participants sometime between the 8-10th Sprint,
depending on the project. They are also available after graduation.


Assemble a portfolio project based on actual data


Get a recommendation on LinkedIn by a real company

• Gain experience with freelance project workflow

You might also like