Unit 1 - Week (1 - 4) : Planning and Thinking Skills For Architecting Data Science Solutions
Unit 1 - Week (1 - 4) : Planning and Thinking Skills For Architecting Data Science Solutions
Unit 2 | Week (5 - 8)
Foundation Courses
Data Analytics in Excel - foundation to dashboarding
Visualization using Tableau
Python / R Programming - coding structures, data handling, control structures, etc.
Data Pre-Processing
Data Exploration - Histograms, Bar Chart, Box Plot, Line Graph, Scatter Plot
Data Storytelling - The Science, ggplot, Bubble Charts with Multiple Dimensions, Gauge Charts,
Treemap, Heat Map and Motion Charts
Linear Regression
Approach: Model Estimation, MLE & Error Function, Optimization through Gradient Descent for
finding parameters
Constructing a Linear Regression, Diagnostics
Interpretation and Applications
Case Study 1 - Help a digital media company understand why their viewership is falling and
propose recommendations to increase viewership
Case Study 2 - Create a model to understand the factors that influence car prices in the US.
Decision Trees
Rule Based Knowledge: Logic of Rules, Evaluating Rules, Rule Induction and Association
Rules.
Construction of Decision Trees through Simplified Examples; Choosing the "Best" attribute at
each Non-Leaf node; Entropy; Information Gain, Gini Index, Chi Square, Regression Trees.
Generalizing Decision Trees; Information Content and Gain Ratio; Dealing with Numerical
Variables; other Measures of Randomness
Pruning a Decision Tree; Cost as a consideration; Unwrapping Trees as Rules Oblique Decision
Trees
Oblique Decision Trees
Case Study - Predict whether a customer will default on loan or not
Instance based learning
K-NN method, wilson editing and triangulation
K-NN in collaborative filtering, digit recognition
Ensembles
Methods of Ensembling (Stacking, Mixture of Experts)K-NN in collaborative filtering, digit
recognition
Bagging and Random forest (Logic, Practical Applications)
Ada Boost
Gradient Boosting Machines
Time Series
Regression on Time.
Modeling Seasonality as Deviation
Statistician's Approach: Components of a Time Series and Estimation Methods
Smoothing: Moving Average, Weighted and Exponential Moving
Holt Winters Method
Box-Jenkins and ARIMA
Case Study - Forecast gold prices using past 30 years data.