Data Science Curriculum
Data Science Curriculum
DATA SCIENCE
CURRICULUM
4 5
SQL + Machine
Power BI Learning
3 6
Python Deep
Data Analysis Learning
2 7
Computer
Statistics Vision
1 8
Python
Programming NLP
Kukatpally Gachibowli
#205, 2nd Floor, Fortune Signature, 2nd Floor, Leeway, BP Raju Marg,
Near JNTU Metro Station, Kukatpally, Opp. Sarath City Capital Mall,
Hyderabad, Telangana 500085. Laxmi Cyber City, Whitefields, Kondapur,
Telangana 500081.
®
AUTHORIZED IN PARTNERSHIP
IBM PARTNER BEST DATA LEADING BEST WITH
MOST TRUSTED SCIENCE EDTECH EDTECH DATA SCIENCE
INDIAN COMPANY IN INDIA COMPANY IN 2023 INSTITUTION >>
AWARDED BY >> ®
AWARDED BY AWARDED BY AWARDED BY
A MeitY - NASSCOM Digital Skilling Initiative
Table of
Contents
01 Course Objective, Key Features In The Training Page no 3
Instagram
www.innomatics.in Facebook
02 Linkedin
+91 9951666670 Youtube
Website
®
COURSE OBJECTIVE
► Duration: 6 Months
► Class Duration: 2 hrs (Monday to Friday)
► Online help on Doubt Clearance, Monitoring Session, Career Guidance, Interview
preparation & Mock interviews.
► Use cases covered: Python and statistics: 4, Machine Learning - 10, NLP - 2, DL - 3.
► One Big Hackathon Challenge on Machine Learning
► Projects:
► Python: Data analysis project, Machine Learning: Regression,
Classification
► NLP: Sentiment Analysis / Chatbot, Deep Learning: Face Recognition.
► Addition: Assignments, Quizzes for each module from python, Statistics, Machine
Learning, NLP and Deep Learning + Computer vision topic wise assignments and quiz.
► IBM Credentials and Certification after completion of the course.
► Guaranteed In-house Internship
► Nearly working on 20 use cases during your course.
► Training materials are provided with Lab Exercises, Data sets, Codes Quizzes, Case
studies on real data.
► For every online session recording video & live running notes will provide.
► Real time Training with live Scenarios and Applications.
► Job Assistance after completion of the course.
FOLLOW US ON
Instagram
www.innomatics.in
Facebook
Linkedin 03
Youtube +91 9951666670
Website
®
In this introductory section, we'll explore the fundamental concepts of data science, including
its origins, key principles, and the role it plays in solving real-world problems. We'll deliver into
the importance of data-driven decision-making and how data science contributes to
innovation across various domains.
MODULE 1
PYTHON PROGRAMMING AND FLASKFRAMEWORK
Introduction
► What is Python?
► Why does Data Science require Python?
► Installation of Anaconda
► Understanding Jupyter Notebook (IDE)
► Basic commands in Jupyter Notebook
► Understanding Python Syntax
► Identifiers and Operators
FOLLOW US ON
Instagram
www.innomatics.in Facebook
04 Linkedin
+91 9951666670 Youtube
Website
®
File Handling
► Create, Read, Write files and Operations in File Handling
► Errors and Exception Handling
Instagram
www.innomatics.in
Facebook
Linkedin 05
Youtube +91 9951666670
Website
®
MODULE 2
DATA ANALYSIS IN PYTHON
DATA VISUALIZATION
Instagram
www.innomatics.in Facebook
06 Linkedin
+91 9951666670 Youtube
Website
®
Regular Expressions
► Structured Data and Unstructured Data
► Literals and Meta Characters
► How to Regular Expressions using Pandas?
► Inbuilt Methods
► Pattern Matching
► This project covers the main four steps of Data Science Life Cycle which involves
► Data Collection
► Data Mining
► Data Preprocessing
► Data Visualization
Ex: Text, CSV, TSV, Excel Files, Matrices, Images
MODULE 3
ADVANCED STATISTICS
Instagram
www.innomatics.in
Facebook
Linkedin 07
Youtube +91 9951666670
Website
®
Descriptive Statistics
► Data types
► Data Collection Techniques
► Sampling Techniques:
► Convenience Sampling, Simple Random Sampling
► Stratified Sampling ,Systematic Sampling and Cluster Sampling
Descriptive Statistics
► What is Univariate and Bi Variate Analysis?
► Measures of Central Tendencies
► Measures of Dispersion
► Skewness and Kurtosis
► Box Plots and Outliers detection
► Covariance and Correlation
Probability Distribution
► Probability and Limitations
► Discrete Probability Distributions
► Bernoulli, Binomial Distribution, Poisson Distribution
► Continuous Probability Distributions
► Normal Distribution, Standard Normal Distribution
Inferential Statistics
► Sampling variability and Central Limit Theorem
► Confidence Intervals
► Hypothesis Testing
► Z -test, t-test
► Chi – Square Test
► F -Test and ANOVA
MODULE 4
Data Base (SQL) + Reporting Tool (Power BI)
Instagram
www.innomatics.in Facebook
08 Linkedin
+91 9951666670 Youtube
Website
®
Introduction To Power Bi
► What is Business Intelligence?
► Power BI Introduction
► Quadrant report
► Comparison with other BI tools
► Power BI Desktop overview
► Power BI workflow
► Installation query addressal
Instagram
www.innomatics.in
Facebook
Linkedin 09
Youtube +91 9951666670
Website
®
Power Queries
► Power Query Introduction
► Data Transformation - its benefits
► Introducing ribbons
► Queries panel
► M Language briefing
► Power BI Datatypes
► Changing Datatypes of columns
Instagram
www.innomatics.in Facebook
10 Linkedin
+91 9951666670 Youtube
Website
®
Miscellaneous Topics
► Visual Interactions
► Drill Through
► Drilldown
► Conditional Formatting
► Creating buttons in Power BI reports
► Creating Python Script Visuals
MODULE 5
MACHINE LEARNING - SUPERVISED LEARNING
Introduction
► What Is Machine Learning?
► Supervised Versus Unsupervised Learning
► Regression Versus Classification Problems Assessing Model Accuracy
Instagram
www.innomatics.in
Facebook
Linkedin 11
Youtube +91 9951666670
Website
®
REGRESSION TECHNIQUES
Linear Regression
► Simple Linear Regression:
► Estimating the Coefficients
► Assessing the Coefficient Estimates
► R Squared and Adjusted R Squared
► M SE and RMSE
Polynomial Regression
► Why Polynomial Regression
► Creating polynomial linear regression
► evaluating the metrics
Regularization Techniques
► Lasso Regularization
► Ridge Regularization
► ElasticNet Regularization
Case Study on Linear, Multiple Linear Regression, Polynomial, Regression using Python.
CAPSTONE PROJECT:
A project on a use case will challenge the Data Understanding, EDA, Data Processing
and above Regression Techniques.
FOLLOW US ON
Instagram
www.innomatics.in Facebook
12 Linkedin
+91 9951666670 Youtube
Website
®
CLASSIFICATION TECHNIQUES
Logistic regression
► An Overview of Classification
► Difference Between Regression and classification Models.
► Why Not Linear Regression?
► Logistic Regression:
► The Logistic Model
► Estimating the Regression Coefficients and Making Pr edictions
► Logit and Sigmoid functions
► Setting the threshold and understanding decision boundary
► Logistic Regression for >2 Response Classes
► Evaluation Metrics for Classification Models:
► Confusion Matrix
► Accuracy and Error rate
► TPR and FPR
► Precision and Recall, F1 Score
► AUC – ROC
► Kappa Score
Naive Bayes
► Principle of Naive Bayes Classifier
► Bayes Theorem
► Terminology in Naive Bayes
► Posterior probability
► Prior probability of class
► Likelihood
► Types of Naive Bayes Classifier
► Multinomial Naive Bayes
► Bernoulli Naive Bayes and Gaussian Naive Bayes
FOLLOW US ON
Instagram
www.innomatics.in
Facebook
Linkedin 13
Youtube +91 9951666670
Website
®
Decision Trees
► Decision Trees (Rule Based Learning):
► Basic Terminology in Decision Tree
► Root Node and Terminal Node
► Regression Trees and Classification Trees
► Trees Versus Linear Models
► Advantages and Disadvantages of Trees
► Gini Index
► Overfitting and Pruning
► Stopping Criteria
► Accuracy Estimation using Decision Trees
Random Forest
► What is it and how does it work?
► Variable selection using Random Forest
Instagram
www.innomatics.in Facebook
14 Linkedin
+91 9951666670 Youtube
Website
®
K Nearest Neighbors
► K-Nearest Neighbor Algorithm
► Eager Vs Lazy learners
► How does the KNN algorithm work?
► How do you decide the number of neighbors in KNN?
► Curse of Dimensionality
► Pros and Cons of KNN
► How to improve KNN performance
UN-SUPERVISED LEARNING
Instagram
www.innomatics.in
Facebook
Linkedin 15
Youtube +91 9951666670
Website
®
K-Means Clustering
► Centroids and Medoids
► Deciding optimal value of 'k' using Elbow Method
► Linkage Methods
Hierarchical Clustering
► Divisive and Agglomerative Clustering
► Dendrograms and their interpretation
► Applications of Clustering
► Practical Issues in Clustering
Recommendation Systems
► What are recommendation engines?
► How does a recommendation engine work?
► Data collection
► Data storage
► Filtering the data
► Content based filtering
► Collaborative filtering
► Cold start problem
► Matrix factorization
► Building a recommendation engine using matrix factorization
► Case Study
FOLLOW US ON
Instagram
www.innomatics.in Facebook
16 Linkedin
+91 9951666670 Youtube
Website
®
MODULE 6
DEEP LEARNING
TensorFlow 2.0
► Introducing Google Colab
► Tensorflow basic syntax
► Tensorflow Graphs
► Tensorboard
Instagram
www.innomatics.in
Facebook
Linkedin 17
Youtube +91 9951666670
Website
®
MODULE 7
CNN & COMPUTER VISION
Instagram
www.innomatics.in Facebook
18 Linkedin
+91 9951666670 Youtube
Website
®
MODULE 8
NATURAL LANGUAGE PROCESSING
Instagram
www.innomatics.in
Facebook
Linkedin 19
Youtube +91 9951666670
Website
®
Unit 4 : Applications
► Sentiment Analysis
► Sentence generation
► Machine translation
► Advanced LSTM structures
► Keras- machine translation
► ChatBot
Python +
Statistics
LINKEDIN
INSTAGRAM
WEBSITE
FACEBOOK
YOUTUBE
FOLLOW US ON
Instagram
www.innomatics.in Facebook
20 Linkedin
+91 9951666670 Youtube
Website