0% found this document useful (0 votes)
25 views

About Course Syllabus Data Analytics Businesss Intelligence

Hope it helps you

Uploaded by

Vishal guni
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
25 views

About Course Syllabus Data Analytics Businesss Intelligence

Hope it helps you

Uploaded by

Vishal guni
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

SHAHEED SUKHDEV COLLEGE OF BUSINESS STUDIES

(University of Delhi)
Dr. KN Katju Marg, Sec-16, Rohini, Delhi-11008
Certificate course on “Data Analytics & Business Intelligence” (Weekend)

About the Course


Data Analytics & Business Intelligence is the need of the hour. Today, huge amounts of data are
being generated without knowing how to make beneficial use of it. In order to utilize this, machine
learning and statistical techniques are being used to develop predictive models from existing data
to forecast future outcomes.

Rationale
Data analytics and business intelligence are of great importance in today’s world.
Data analysis is required to understand organizational problems and to explore data while business
intelligence helps companies to make better decisions by showing current and historical data within
their business context. To make the organization run smoother and more efficiently, analysts use BI
to provide performance and competitor benchmarks.

Aims
The fundamental aim of leveraging Data Analysis and Business Intelligence skills is to help
understand trends and derive actionable insights from data, thus allowing to make data-driven,
strategic and tactical business decisions.

Learning Outcomes
Students will be able to apply principles of statistics, python programming, machine learning,
probability and decision making in the context of data analysis. Moreover, they should design tested
and effective advanced analytics models for decision making and communicate effectively in a
variety of modes and contexts.

Objectives
Expecting to build a solid business analytics foundation, this course has been designed to impart
knowledge of machine learning and statistical methods for data analysis. The course shall also
provide sufficient knowledge of python programming language for machine learning algorithms
and python/ R programming for statistical methods. A brief introduction to neural networks and
deep learning will also be covered.

Target Audience
Those who are interested in developing a strong foundation in business analytics and have
graduated or are pursuing graduation (studied mathematics till class 12 th level).

Future Prospects
Upon completion of the course, the students will be able to enhance their skills in data analysis,
python programming for machine learning and python/ R programming for statistical methods.
They will also be able to find answers to the questions they don’t know the answers to. This course
will help them to adapt themselves to the automated future of business intelligence.

Course Duration: 125 Hours [January, 2024 - June, 2024]

Fees: Rs. 40,200/-


(Course Fees: Rs. 40000/- plus taxes if any. Application Fees: Rs. 200/-)

Resource Persons
1. Eminent resource persons from reputed institutes will be invited.
2. The experts from various industries will also be invited for delivering lectures and
hands-on training.
Course Contents

Module 1: Foundation of Data Analytics & Python Programming (20 hours)


Foundation of Data Analytics: - Introduction, Evolution , Concept and Scopes, Data, Big Data,
Metrics and Data classification, Data Reliability & Validity, Problem Solving with Analytics,
Different phases of Analytics in the business and Data science domain, Descriptive Analytics,
Predictive Analytics and Prescriptive Analytics, Different Applications of Analytics in Business,
Text Analytics and Web Analytics, Skills for Business Analytics, Concepts of Data Science, Basic
skills required for understanding Data Science.
Python Programming: - Introduction to Python Editors & IDE’s (Jupyter, Spyder, pycharm, etc.),
custom environment settings, basic data types-numeric, string, float, tuples, list, dictionary, sets
and their operations, control flow (if-elif-else), loops (for, while), inbuilt functions for data
conversion, writing user defined functions.
Concepts of packages/libraries – important packages like NumPy, SciPy, scikit-learn, Pandas,
Matplotlib, seaborn, etc., installing and loading packages, reading and writing data from/to different
formats, simple plotting, functions, list comprehensions, database connectivity, Playing with Date
Format.

Module 2: Probability & Statistics (25 hours)


Descriptive Analytics: Describing and summarizing data sets, measures of central tendency,
dispersion, skewness, kurtosis, Correlation.
Probability: Measures of probability, conditional probability, independent event, Bayes’ theorem,
random variable, discrete (binomial, Poisson, geometric, hypergeometric, negative binomial) and
continuous (uniform, exponential, normal, gamma). Expectation and variance, markov inequality,
chebyshev’s inequality, central limit theorem.
Inferential Statistics: Sampling & Confidence Interval, Inference & Significance. Estimation and
Hypothesis Testing, Goodness of fit, Test of Independence, Permutations and Randomization Test,
t- test/z-test (one sample, independent, paired).

Module 3: Data Munging with Python (15 hours)


Relevance in industry, Statistical learning vs machine learning, types and phases of analytics.
Data pre-processing and cleaning: data manipulation steps (sorting, filtering, duplicates, merging,
appending, subsetting, derived variables, data type conversions, renaming, formatting, etc.),
normalizing data, sampling, missing value treatment, outliers.
Exploratory data analysis: Data visualization using matplotlib, seaborn libraries, creating graphs
(bar/line/pie/boxplot/histogram, etc.), summarizing data, descriptive statistics, univariate analysis
(distribution of data), bivariate analysis (cross tabs, distributions and relationships, graphical
analysis).

Module 4: Machine learning – Part 1 (17 hours)


Introduction, Applications of Machine Learning, Key elements of Machine Learning, Supervised
vs.Unsupervised Learning.
Supervised Machine Learning: Linear Regression, Multiple Linear Regression Polynomial
Regression.
Classification: Using Logistic Regression, Logistic Regression vs. Linear Regression, Logistic
Regression with one variable and with multiple variables, Application to multi-class classification.
The problem of Overfitting, Application of Regularization in Linear and Logistic Regression.
Regularization and Bias/Variance. Classification using K-NN, Naive Bayes classifier, Decision
Trees(CHAID Analytics), Random Forest, Support Vector Machines.
Natural Language Processing (NLP): Definition and scope of NLP, Applications of NLP in data
analytics, Text classification, sentiment analysis
Model Evaluation: Cross validation types (train & test, bootstrapping, k-fold validation), parameter
tuning, confusion matrices, basic evaluation metrics, precision-recall, ROC curves.
Case study

Module 5: Machine learning – Part 2 (18 hours)


Neural Networks: Introduction, Model Representation, Gradient Descent vs. Perceptron
Training, Stochastic Gradient Descent, Multiclass Representation, Multilayer Perceptrons,
Backpropagation Algorithm for Learning, Introduction to Deep Learning.
Association Rule Mining: Mining frequent item sets, Apriori algorithm, market basket analysis.
Case study

Unsupervised Machine Learning: Introduction, Clustering, K-Means algorithm, Affinity


Propagation, Agglomerative Hierarchical, DBSCAN, Dimensionality Reduction using Principal
Component Analysis.
Case study: Application of PCA
Time Series Forecasting: Trends and seasonality in time series data, identifying trends, seasonal
patterns, first order differencing, periodicity and autocorrelation, rolling window estimations,
stationarity vs. non-stationarity, ARIMA modeling, time series forecasting using XGBoost
Case Study

Module 6: Optimization in Analytics (10 hours)


Introduction to Operations Research (OR), Linear Programming Problems (LPP), Geometry of
linear programming, Sensitivity and Post-optimal analysis, Duality and its economic interpretation.
Non-linear Programming – KKT conditions, Quadratic Programming, Portfolio optimization.

Module 7: Introduction to SQL (5 hours)


Learning SQL query structure with examples, Data management and query system OLTP and
OLAP and Their data models, Data warehousing, ETL and data integration

Module 8: Excel, Tableau and Business Intelligence (15 hours)


Excel for Analytics: Data Cleaning and Processing, Vlookup, Pivot table and Dashboards, Charts,
Date functions, Conditional Formatting and Data Validation, VBA, Dynamic Arrays and lambda

Tableau and Business Intelligence: Dashboard creation using Tableau, Concepts of Business
intelligence (BI), the relevance of BI in application to analytics, industry and different domains.

Course Co-coordinator:

Dr. Rishi Rajan Sahay (Mob: 9818011766, Email: [email protected]),


Course email id: [email protected]

You might also like