Data Science Machine Learning 17054
Data Science Machine Learning 17054
Data Science
Machine Learning
Full-Stack Roadmap
Invest 8 Months and build proof of work, skills, knowledge, projects, and portfolio
and be Industry ready
https://www.linkedin.com/in/ankit-kumar-singh-983b0820b/
The Roadmap is divided into 16 Sections
Duration: 256 Hours of Learning (8 Months) and many more hours for practice
and project building.
Month 1 — May
Month 2 — June
Month 3 — July
5. Machine Learning
6. ML Operations
Month 4 — August
Month 6 — October
Month 7 — November
Month 8 — December
● Python
● Data Structures
● NumPy
● Pandas
● Matplotlib
● Seaborn
● Scikit-Learn
● Statsmodels
● Natural Language Toolkit (NLTK)
● PyTorch
● OpenCV
● Tableau
● Structure Query Language (SQL)
● PySpark
● Azure Fundamentals
● Azure Data Factory
● Databricks
● 5 Major Projects
● Git and GitHub
● AWS
● GCP
● Azure
1 | Python Programming and Logic Building
I will prefer Python Programming Language. Python is the best for starting your
programming journey. Here is the roadmap of Python for logic building.
● Installation
● Python Org, Python 3
● Variables
● Print function
● Input from user
● Data Types
● Type Conversion
● First Program
2 | Operators
● Arithmetic Operators
● Relational Operators
● Bitwise Operators
● Logical Operators
● Assignment Operators
● Compound Operators
● Membership Operators
● Identity Operators
3 | Conditional Statements
● If Else
● If
● Else
● El If (else if)
● If Else Ternary Expression
4 | While Loop
5 | Lists
● List Basics
● List Operations
● List Comprehensions / Slicing
● List Methods
6 | Strings
● String Basics
● String Literals
● String Operations
● String Comprehensions / Slicing
● String Methods
7 | For Loops
● Range function
● For loop
● Nested For Loops
● Pattern-Based Questions
● Break
● Continue
● Pass
● Loop else
8 | Functions
● Definition
● Call
● Function Arguments
● Default Arguments
● Docstrings
● Scope
● Special functions Lambda, Map, and Filter
● Recursion
● Functional Programming and Reference Functions
9 | Dictionary
● Dictionaries Basics
● Operations
● Comprehensions
● Dictionaries Methods
10 | Tuple
● Tuples Basics
● Tuples Comprehensions / Slicing
● Tuple Functions
● Tuple Methods
11 | Set
● Sets Basics
● Sets Operations
● Union
● Intersection
● Difference and Symmetric Difference
12 | Object-Oriented Programming
● Classes
● Objects
● Method Calls
● Inheritance and Its Types
● Overloading
● Overriding
● Data Hiding
● Operator Overloading
13 | File Handling
● File Basics
● Opening Files
● Reading Files
● Writing Files
● Editing Files
● Working with different extensions of file
14 | Exception Handling
● Common Exceptions
● Exception Handling
● Try
● Except
● Try except else
● Finally
● Raising exceptions
● Assertion
15 Regular Expression
● Basic RE functions
● Patterns
● Meta Characters
● Character Classes
17 | Data Structures
● Stack
● Queue
● Linked Lists
● Sorting
● Searching
● Linear Search
● Binary Search
18 | Higher-Order Functions
● Function as a parameter
● Function as a return value
● Closures
● Decorators
● Map, Filter, Reduce Functions
● Understanding BeautifulSoup
● Extracting Data from websites
● Extracting Tables
● Data in JSON format
20 | Virtual Environment
● Flask
● Project Structure
● Routes
● Templates
● Navigations
23 | Deployment
● Heroku Deployment
● Flask Integration
● What is PIP?
● Installation
● PIP Freeze
● Creating Your Own Package
● Upload it on PIP
25 | Python with MongoDB Database
26 | Building API
● Statistics
● NumPy basics
● Working with Matrix
● Linear Algebra operations
● Descriptive Statistics
28 | Data Analysis with Pandas
● Matplotlib Basics
● Working with plots
● Plot
● Pie Chart
● Histogram
30 | What to do Now?
● Getting Started
● Variables
● Data Types
● Data Structures
● Algorithms
● Analysis of Algorithm
● Time Complexity
● Space Complexity
● Types of Analysis
● Worst
● Best
● Average
● Asymptotic Notations
● Big-O
● Omega
● Theta
Data Structures - Phase 1
1 | Stack
2 | Queue
3 | Linked List
4 | Tree
5 | Graph
Algorithms - Phase 2
8 | Searching 9 | Recursion
10 | Hashing
11 | Strings
12 | Dynamic Programming
Numpy
● Vectors, Matrix
● Operations on Matrix
● Mean, Variance, and Standard Deviation
● Reshaping Arrays
● Transpose and Determinant of Matrix
● Diagonal Operations, Trace
● Add, Subtract, Multiply, Dot, and Cross Product.
Pandas
Matplotlib
● Graph Basics
● Format Strings in Plots
● Label Parameters, Legend
● Bar Chart, Pie Chart, Histogram, Scatter Plot
4 | Statistics
Descriptive Statistics
Inferential Statistics
● t-Test
● z-Test
● Hypothesis Testing
● Type I and Type II errors
● t-Test and its types
● One way ANOVA
● Two way ANOVA
● Chi-Square Test
● Implementation of continuous and categorical data
5 | Machine Learning
The best way to master machine learning algorithms is to work with the Scikit-
Learn framework. Scikit-Learn contains predefined algorithms and you can work
with them just by generating the object of the class. These are the algorithm you
must know including the types of Supervised and Unsupervised Machine
Learning:
● Linear Regression
● Logistic Regression
● Decision Tree
● Gradient Descent
● Random Forest
● Ridge and Lasso Regression
● Naive Bayes
● Support Vector Machine
● KMeans Clustering
● Measuring Accuracy
● Bias-Variance Trade-off
● Applying Regularization
● Elastic Net Regression
● Predictive Analytics
● Exploratory Data Analysis
6 |MLOps
You can master any one of the cloud services providers from AWS, GCP, and
Azure. You can switch easily once you understand one of them.
● Sentiment analysis
● POS Tagging, Parsing,
● Text preprocessing
● Stemming and Lemmatization
● Sentiment classification using Naive Bayes
● TF-IDF, N-gram,
● Machine Translation, BLEU Score
● Text Generation, Summarization, ROUGE Score
● Language Modeling, Perplexity
● Building a text classifier
● Identifying the gender
8 | Computer Vision
To work on image and video analytics we can master computer vision. To work on
computer vision we have to understand images.
● PyTorch Tensors
● Understanding Pretrained models like AlexNet, ImageNet, and ResNet.
● Neural Networks
● Building a perceptron
● Building a single-layer neural network
● Building a deep neural network
● Recurrent neural network for sequential data analysis
Convolutional Neural Networks
● What is BigData?
● How is BigData applied within Business?
PySpark
Data Modeling
● Duplicate Data
● Descriptive Analysis of Data
● Visualizations
● ML lib
● ML Packages
● Pipelines
Streaming
● SQL databases
● NoSQL databases
● SQL vs NoSQL databases
● Database Replication
● Indexes
● Normalization and Denormalization
● CAP theorem
● URL Shortener
● Whatsapp, Twitter, Netflix, Uber
13 | Five Major Projects and Git
We follow project-based learning and we will work on all the projects in parallel.
14 | Interview Preparation
● Understanding Git
● Commands and How to commit your first code?
● How to use GitHub?
● How to make your first open-source contribution?
● How to work with a team? — Part 1
● How to create your stunning GitHub profile?
● How to build your own viral repository?
● Building a personal landing page for your Portfolio for FREE
● How to grow followers on GitHub?
● How to work with a team? Part 2 — issues, Milestones, and projects
SAVE FOR LATER