0% found this document useful (0 votes)
21 views

Data Science & AI

Data Science & AI

Uploaded by

hr.scratchnest
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
21 views

Data Science & AI

Data Science & AI

Uploaded by

hr.scratchnest
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 10

Tools and Technologies

Python
SQL

Numpy
Pandas

Scikit-learn
Seaborn

SciPy
Matplotlib

OpenAI(Generative
Keras
AI)

PowerB
I Tableau

AWS

https://ifittrvu.6i +91 77560 43707


Syllabus
Python

Variables:
Definition and use
Variable assignment and reassignment

Data types:
string, integer, float
String manipulation
Floating-point arithmetic

Loops and conditions:


for loop, while loop
For loop syntax and examples
While loop structure and
usage
Loop control statements (break, continue)

Data Structures:
Lists, tuples, dictionaries, and
sets List operations and methods
Tuple characteristics and
usage Dictionary key-value
pairs
Set operations and applications

Functions:
Function definition and
calling Parameters and
return values Lambda
functions

Object-oriented programming:
Object-oriented programming concepts
overview Class & object creation
Private functions and variables
Pandas:
Introduction to Pandas
DataFrames and
Series
Merge, filter and sort operations in Pandas
Handling missing data

NumPy:
Introduction to NumPy
NumPy arrays and
operations Linear algebra
with NumPy

Reading data from API and SQL:


Retrieving data from APIs,
Integration of SQL and Python

Data Visualization:
Seaborn, Matplotlib
Basic plots with Matplotlib
Seaborn for statistical data visualization

Math for Data Science


Probability
Scalar and Vector
Distance Metrics
Matrix
Operations
Linear and non-linear functions
Derivative
Statistics
Central limit theorem
Data Distribution
Mean, Mode, Median
Variance and Standard
Deviation Correlation and
Causation Missing data
imputation
Outlier Detection
Hypothesis Testing: p-value, chi square test, Welch t-test
A/B Testing
Market Basket Analysis

Machine Learning Models


Linear Regression:
Simple and multiple linear
regression Gradient Descent
Optimization
Ordinary Least Square Regression
Evaluation metrics for Regression models
Build linear regression model using Python in live class.

Logistic Regression:
Logistic function
Odds Ratio
Logistic regression model
Decision Boundary
Evaluation metrics for Classification model
Build logistic regression model using Python in live class.
Regularization:
Underfitting and Overfitting
Bias Variance Tradeoff
Ll regularization (Lasso regression)
L2 regularization (Ridge
regression)
Build Ll and L2 regularized model using Python in live class.

Clustering:
K-Means clustering
Hierarchical clustering
DBSCAN Clustering
Evaluation metrics for Clustering Model
Build and evaluate clustering models using Python in live class.

K-Nearest Neighbors (KNN):


Distance metrics in KNN
KNN algorithm
KNN for regression and classification
Build KNN model using Python in live class.

Support Vector Machines (SVM):


Margin Maximization
Hard Margin and Soft Margin
Kernel Trick
Support vector regression
Build and evaluate SVM model using Python in live class.

Dimension Reduction:
Principal Component Analysis (PCA)
Eigen Value Eigen Vector Decomposition
Apply PCA for dimension reduction using Python in live class.
Decision Trees and Random Forests:
Decision tree construction
Splitting criteria
Build a decision tree model using Python in live class.

Random Forest:
Bagging
Ensemble learning with random
forests Feature importance in random
forests
Build a random forest model using Python in live class.

Gradient Boosting:
Boosting
Adaboost
Gradient Boosting
XGBoost and its advantages
Build XGBoost model using Python in live class.

Time Series Forecasting:


Stationarity
Seasonality and
trend ARIMA
SARIMA
Train time series forecasting model in live class

Deep Learning

Neural Networks: Overview of Neural Networks

Activation Functions:
Understanding Activation
Functions Types of Activation
Functions
Importance of Activation Functions in Deep Learning

https://ifittrvu.6i +91 77560 43707


Deep Learning:
Introduction to Deep
Learning Deep Neural
Networks (DNNs) Forward
propagation
Backward Propagation
Vanishing Gradient problem
Exploding Gradient problem
Build a deep learning model using Python in live class.

Convolutional Neural Networks (CNN):


Introduction to Convolutional Neural
Networks. Pooling layer
Dropout layer
Create a model for object detection using Python in live class.

Long Short-Term Memory (LSTM):


Introduction to Recurrent Neural Networks
(RNNs). Architecture and working principles of
LSTM
Build LSTM model on time series data in live class.

Natural language processing ( NLP )


Text Cleaning:
Introduction to text preprocessing.
Handling special characters and
numbers. Case normalization.

Stemming and Lemmatization:


Understanding the basics of stemming and
lemmatization. Comparing stemming and lemmatization.

Part of Speech Tagging:


Overview of part-of-speech (POS)
tagging Use cases and applications of
POS tagging

Recognition (NER):
Overview of NER
Applications of NER
Build NER on text data using Python in live class
TFIDF (Term Frequency-Inverse Document
Frequency):
Explaining TFIDF and its importance in
NLP Applications of TFIDF in text analysis
TFIDF implementation
Build TFIDF model on text data using Python in live class

Word2Vec:
Introduction to Word Embeddings
Word2Vec architecture and
models Training Word2Vec
models
Applications of Word2Vec in NLP
Build word2vec model on large text data using Python
in live class

Explainable AI
How to create explainable models?
LIME
SHAP
Build Explainable AI model using Python in live class

Generative AI
BERT
Large language
Models ChatGPT
How ChatGPT is trained?
ChatGPT API for Python developers
Build Question and Answer model using Generative AI on custom
dataset using Python in live class

ML Ops
Git and Coding
standards Model
deployment
Model monitoring
Flask API and Batch
processing Docker
Build a ML project with code structured for batch and API deployment using
Python in live class
Amazon Web Services (AWS)
AWS cloud services
AWS for machine learning
ML model deployment on AWS

SQL for Data Science


Introduction to SQL for data
manipulation Aggregation functions
in SQL
Filters in SQL queries
Group By clause and its applications
Joins in SQL and handling nested
queries Date and time operations in
SQL

Power BI
Data visualization using Power BI
Data Modelling
Power Query
Advanced visualizations
Build data analytics project using Power BI

Tableau
Data Visualization using
Tableau Creating dashboard
Pages
Sorting, grouping and
Filtering Different types of
charts
Build data analytics project using Tableau

Capstone projects on real world datasets


ML capstone project l – Regression
ML capstone project 2 - Classification
ML capstone project 3 -Clustering
ML capstone project 4 - NLP and Generative AI

You might also like