0% found this document useful (0 votes)

71 views18 pages

DSBA Curriculum Guide

Uploaded by

Hamza Amir

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

71 views18 pages

DSBA Curriculum Guide

Uploaded by

Hamza Amir

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

POST GRADUATE PROGRAM IN

DATA SCIENCE
AND BUSINESS
ANALYTICS
CURRICULUM GUIDE

Delivered in collaboration with:

ABOUT THE PROGRAM
The Post Graduate Program in Data Science and Business Analytics (PGP-DSBA) is tailored for
mid-senior and senior professionals. The program’s curriculum is designed for those interested in
extracting insights from data to create insightful stories and impact business decisions. Through the
program, learners will familiarize themselves with the tools and techniques required to solve business
problems.
Learners will discover how to analyze and visualize data using Python to extract valuable insights and
offer practical business recommendations. They'll also learn how to conduct statistical analysis to test
business hypotheses and create machine learning models for predicting future occurrences based on
data relationships.
This program is built around the fundamental learning principle of ‘learning by doing’. It focuses on
building a meaningfully practical skill set with hands-on case studies, hands-on projects, and a
portfolio of Data Science and analytics projects. PGP-DSBA is designed to help you transition or
advance into one of the fastest-growing careers of the modern world.

PROGRAM HIGHLIGHTS

6 Months Online

17 Weekly Online Mentorship

7 Hands-On Projects
Sessions

Academic Learning Support

Dedicated Program Manager (GL Community, Project Discussion
Forums, Peer Groups)

Personalized Evaluation and Postgraduate Certificate from

Feedback for all Projects UT Austin

Career Services (Career Prep

Shareable E-Portfolio Material, Profile Reviews, Career
Orientation Session, 1 CMS)

02
LEARNING OUTCOMES

A solid understanding of Data Science from a business, technical, and

01 conceptual perspective

Working knowledge of using Python to perform end-to-end data analysis

02 and extract strategic business insights for a variety of business problems

Ability to perform statistical analysis and extract statistical inferences from

03 linear models

Ability to independently solve business problems using analytics and Data

04 Science

Working knowledge of using Python to design and implement machine

05 learning models to predict future trends and make informed business

decisions

03
CURRICULUM

Module 1: Python Foundations

Master data storytelling with Python. Learn to read, manipulate, and visualize data, driving insights
for impactful business solutions through exploratory data analysis. Transform raw information into
compelling narratives.

Topic 1- Python Programming

Python Programming: Python is a widely used, high-level, interpreted programming language,

having a simple, easy-to-learn syntax that highlights code readability. This module will cover the
fundamentals of Python programming and taking the first steps in organizing data with Python.

Concepts Used:
Variables and Datatypes
Data Structures
Conditional and Looping Statements
Functions
Learning Outcomes: Learn about the fundamentals of Python programming (variables, data
structures, conditional and looping statements, functions).

Case Study: CRED Pay- CRED Pay is a consultation firm that partners with banks and checks if
their customers are eligible for a credit card. They are a startup just at the roots of starting their
business. They have partnered with a few banks and are currently collecting data for credit card
applications. You have been hired as a Data Scientist to handle and organize the data. You are
responsible for organizing the data so that it will be easily accessible and to help the company
predict if the application can be accepted for a credit card or not.

04
Topic 2- Python for Data Science

Python for Data Science: NumPy is a Python package for mathematical and scientific
computing and involves working with arrays and matrices. Pandas is a fast, powerful,
flexible, and simple-to-use open-source library in Python to manipulate and analyze data.
This module will cover these important libraries and provide a deep understanding of how
to use them to explore data.

Concepts Used:
NumPy Arrays and Functions
Accessing and Modifying NumPy Arrays
Saving and Loading NumPy Arrays
Pandas Series (Creating, Accessing, and Modifying Series)
Pandas DataFrames (Creating, Accessing, Modifying, and Combining DataFrames)
Pandas Functions
Saving and Loading Datasets using Pandas

Learning Outcomes: Learn about two of the most commonly used libraries (NumPy and
Pandas) used in Data Science for reading and manipulating data.

Case Study: MovieLens- MovieLens is a company in the internet and entertainment domain
providing an online database of information related to films, television series, online
streaming content including cast, production crew, trivia, ratings, fan, and critical reviews.
Every year in collaboration with a guest curator, MovieLens publishes its annuals based on
a theme providing a comprehensive view of a topic. The company is planning to bring out
the ‘Movie Talkies: Classic’ edition this year. The idea is to explore the movies that are a
decade old and deliver a detailed analysis.

Topic 3- Python for Visualization

Python for Visualization: Matplotlib is a library to create statically animated, interactive

visualizations, whereas Seaborn is a Matplotlib-based data visualization library in Python.
This module will give you a deep understanding of exploring data sets using Matplotlib
and Seaborn.
Concepts Used:
Histogram, Box Plots and Bar Graphs
Line Plot, Scatterplot, and Lmplot
Jointplot, Violin Plot, and Striplot
Swarm, Catplot, and Pairplots
Heatmaps, Plotly, and Customizing of Plots

Learning Outcomes: Learn about different visual tools which help in summarizing data
better and how to create them using Seaborn, a popular Python library.

Case Study: Chef’s Kitchen- Chef's Kitchen is one of the most popular restaurants in the
city of San Diego and acts as a one-stop destination for food lovers. The polite and
efficient service provided by the restaurant staff often gets them tips from the customer.
As a Data Analyst for the restaurant, you have been asked to analyze the data provided to
identify the patterns and trends in the revenue and tips received from customers across
different demographics and come up with informative visualizations to convey the insights
obtained from the analysis.

05
Topic 4- Exploratory Data Analysis (Deep Dive)

Exploratory Data Analysis (Deep Dive): Exploratory Data Analysis, or EDA, is a process of
examining and visualizing data to uncover patterns and extract meaningful insights from it
and facilitates storytelling. This module provides a deep insight on how to conduct EDA
using Python and utilize the insights extracted to drive business decisions.

Concepts Used:
Data Overview
Univariate Analysis
Bivariate/Multivariate Analysis
Missing Value Treatment
Outlier Detection and Treatment

Learning Outcomes: Learn how to perform Exploratory Data Analysis (EDA) to extract
insights from data.

Case Study: Zoom Ads- Zoom Ads is an advertising agency that wants to perform an
analysis on the data of the Google Play Store. They need to understand the trend of
applications available on the Play Store so that they can decide to focus on promoting
advertisements on particular applications which are trending in the market and can lead
to maximum profit. As a Data Scientist, you are required to gather and analyze detailed
information on apps in the Google Play Store in order to provide insights on app features
and the current state of the Android app market.

Module 2: Business Statistics

Utilize Python for statistical analysis. Validate business estimates through confidence intervals,
ensuring reliability. Test assumptions with hypothesis testing, guiding informed resource
allocation and strategic decision-making based on data distribution analysis.

06
Topic 1- Inferential Statistics Foundations

Inferential Statistics Foundations: Inferential statistics is pivotal in statistical analysis and

decision-making and involves drawing conclusions about populations based on samples.
This module will introduce learners to the common probability distributions and how they
are used to make statistically-sound, data-driven decisions.

Concepts Used:
Experiments, Events, and Definition of Probability
Introduction to Inferential Statistics
Introduction to Probability Distributions (Random Variable, Discrete and Continuous
Random Variables, Probability Distributions)
Binomial Distribution
Normal Distribution
Z-Score

Learning Outcomes: Learn about the fundamentals of probability distributions and the
foundations of Inferential Statistics

Case Study: Medicon Drug Testing- Pharmaceutical company Medicon has manufactured the
sixth batch (40,000 units) of COVID-19 vaccine doses. This vaccine was clinically tested last
quarter and around 2,00,000 doses of this vaccine have already been given to the people, in
five batches. Now, this sixth batch needs to be tested for their time of effect (which is
measured as the time taken for the dose to completely cure COVID), as well as for quality
assurance (which tells you whether the dose will be able to do a satisfactory job or not).

Topic 2- Estimation and Hypothesis Testing

Estimation and Hypothesis Testing: Estimation involves determining likely values for
population parameters from sample data, while hypothesis testing provides a framework for
drawing conclusions from sample data to the broader population. This module covers the
important concepts of central limit theorem and estimation theory that are vital for statistical
analysis, and the framework for conducting hypothesis tests.

Concepts Used:
Sampling
Central Limit Theorem
Estimation
Introduction to Hypothesis Testing (Null and Alternative Hypothesis, Type-I and Type-II
errors, Alpha, Critical Region, P-Value)
Hypothesis Formulation and Performing a Hypothesis Test
One-Tailed and Two-Tailed Tests
Confidence Intervals and Hypothesis Testing

Learning Outcomes: Learn about the Central Limit Theorem, estimation, and the key
concepts of Hypothesis Testing.

07
Case Study: Talent Hunt Examination- A research institute conducts a Talent Hunt Examination
every year to hire people who can work on various research projects in the field of Mathematics
and Computer Science. A2Z institute provides a preparatory program to help the aspirants
prepare for the Talent Hunt Exam. The institute has a good record of helping many students clear
the exam. Before the application for the next batch starts, the institute wants to attract more
aspirants to their program. For this, the institute wants to assure the aspiring students of the
quality of results obtained by students enrolled in their program in recent years.
The institute wants to provide an estimate of the average score obtained by aspirants who enroll
in their program. Keeping in mind the variation in scores every year, the institute wants to provide
a more reliable estimate of the average score using a range of scores instead of a single estimate.
A recent social media post from A2Z institute received feedback from a reputed critic, mentioning
that the students from A2Z institute score less than last year's cut-off on average. The institute
wants to test if the claim by the critic is valid.

Topic 3- Common Statistical Tests

Common Statistical Tests: Hypothesis tests assess the validity of a claim or hypothesis
about a population parameter through statistical analysis. This module introduces learners
to the most commonly used hypothesis tests used in the world of Data Science and how to
choose the right test for a given business claim depending on the associated context.

Concepts Used:
Common Statistical Tests
Test for One Mean
Test for Equality of Means (Known Standard Deviation)
Test for Equality of Means (Equal and Unknown Std Dev)
Test for Equality of Means (Unequal and Unknown Std Dev)
Test of Independence
One-Way ANOVA

Learning Outcomes: Learn about various commonly used statistical tests and their
implementation in Python with business examples.

Case Study: Diet- The Health Company, which provides various diet plans for weight loss,
conducted a market test experiment to test three different kinds of diets (A, B, C). Each of
the volunteers was given one of the three diet plans and asked to follow the diet for 6
weeks. In order to understand the effectiveness of each of the different diets for weight loss,
the executives of the company reached out to you, a data scientist at the company. The
weights before starting the diet and the weights 6 weeks after following the diet were
recorded for 78 volunteers who were provided with either of the three diet plans. You have
been asked to perform a statistical analysis to find evidence of whether the mean weight
losses with respect to the three diet plans are significantly different. Consider a 5%
significance level for the analysis.

08
Module 3: Supervised Learning - Foundations

Delve into linear models for uncovering relationships between variables and continuous outcomes.
Validate models for statistical soundness, drawing inferences to extract crucial business insights
into decision-making factors.

Topic 1- Intro to Supervised Learning - Linear Regression

Intro to Supervised Learning - Linear Regression: Machine Learning (ML), a subset of Artificial
Intelligence (AI), which focuses on developing algorithms capable of learning patterns in data
and making predictions without being explicitly programmed to do so. Linear Regression is one
of the most popular supervised ML algorithms that identifies the degree of linear relationship in
data. This module introduces participants to ML and explores how linear regression can be used
for predictive analysis.

Concepts Used:
Introduction to Learning from Data
Simple and Multiple Linear Regression
Evaluating a Regression Model
Pros and Cons of Linear Regression

Learning Outcomes: Understand the concept of learning from data, how the linear regression
algorithm works, and how to build and assess the performance of a regression model in Python.

Case Study: Anime Rating- Streamist is a streaming company that streams web series and
movies to a worldwide audience. Every content on their portal is rated by the viewers, and the
portal also provides other information for the content like the number of people who have
watched it, the number of people who want to watch it, the number of episodes, duration of an
episode, etc. Streamist is currently focusing on the anime available in their portal and wants to
identify the most important factors involved in rating an anime. As a data scientist at Streamist,
you are tasked with analyzing the portal's anime data and identifying the important factors by
building a predictive model to predict the rating of an anime.

Topic 2- Linear Regression Assumptions and Statistical Inference

Linear Regression Assumptions and Statistical Inference: The linear regression algorithm has a
set of assumptions that need to be satisfied for the model to be statistically validated and to be
able to draw inferences from it. This module walks participants through these assumptions, how
to check them, what to do in case they are violated, and the statistical inferences that can be
drawn based on the model's output.

Concepts Used:
Statistician vs ML Practitioner
Linear Regression Assumptions
Statistical Inferences from a Linear Regression Model

Learning Outcomes: Understand the underlying assumptions of a linear regression model, how
to check and ensure their satisfaction, and making statistical inferences from the model.

09
Module 4: Supervised Learning - Classification

Unlock the power of classification models to discern relationships between variables and
categorical outcomes. Extract business insights by identifying pivotal factors shaping
decision-making processes.

Topic 1- Logistic Regression

Logistic Regression: Logistic regression is a statistical modeling technique primarily used for
modeling the probability of binary outcomes. It finds applications in various fields such as
medicine, finance, and manufacturing. This module covers the theory behind the logistic
regression model, how to assess its performance, and how to draw statistical inferences from it.

Concepts Used:
Introduction to Logistic Regression
Interpretation from a Logistic Regression Model
Changing the Threshold of a Logistic Regression Model
Evaluation of a Classification Model
Pros and Cons

Learning Outcomes: Understand the foundations of the Logistic Regression Model, how to make
interpretations from it, how to evaluate the performance of classification models, and how
changing the threshold of a Logistic Regression Model can help in improving predictions.

Case Study: Income group classification (WHO data)- DeltaSquare is an NGO that works with
the Government on matters of social policy to bring about a change in the lives of underprivileged
sections of society. They are tasked with coming up with a policy framework by looking at the
data the government got from WHO. The objective is to analyze the data provided to identify the
different factors that influence the income of an individual, to build a good predictive model for
income and assess its performance, and help in sharing a proposal for the government.

10
Topic 2- Decision Tree

Decision Tree: Decision Trees are supervised ML algorithms that utilize a hierarchical
structure for decision making and can be used for both classification and regression
problems. This module dives into how a decision tree can be used to model complex,
non-linear data and how to improve the performance of Decision Trees using pruning
techniques.

Concepts Used:
Introduction to Decision Tree
How a Decision Tree is Built
Methods of Pruning a Decision Tree
Different impurity measures
Regression Trees
Pros and Cons

Learning Outcomes: Understand the Decision Tree algorithm, how it’s built, the different
pruning techniques that can be used to improve performance, and learn about the different
impurity measures used to make decisions.

Case Study: Machine Predictive Maintenance- Analyze the data of an auto component
manufacturing company and develop a predictive model to detect potential machine
failures, determine the most influencing factors on machine health, and provide
recommendations for cost optimization to the management.

Module 5: Ensemble Techniques

Combine the decisions from multiple models using ensemble techniques to arrive at more robust
models that can make better predictions.

11
Topic 1- Bagging and Random Forest

Bagging and Random Forest: Random forest is a popular ensemble learning technique that
comprises several decision trees, each using a subset of the data to understand patterns. The
outputs of each tree are then aggregated to provide predictive performance. This module will
explore how to train a random forest model to solve complex business problems.

Concepts Used:
Introduction to Ensemble Techniques
Introduction to Bagging
Sampling with Replacement
Introduction to Random Forest
Learning Outcomes: Understand how ensemble techniques work, learn about sampling with
replacement and the concept of bagging, and build Random Forest models to make better
predictions.

Case Study: HR Attribution- McCurr Consultancy is an MNC that has thousands of employees
spread across the globe. The company believes in hiring the best talent available and retaining
them for as long as possible. A huge amount of resources is spent on retaining existing employ-
ees through various initiatives. The Head of People Operations wants to bring down the cost of
retaining employees. For this, he proposes limiting the incentives to only those employees who
are at risk of attrition. The objective is to identify patterns in characteristics of employees who
leave the organization and use the information to predict if an employee is at risk of attrition
using an ML model. This information will be used to target them with incentives.

Topic 2- Boosting

Boosting: Boosting models are robust ensemble models that comprise several sub-models, each
of which are developed in a sequential manner to improve upon the errors made by the previous
one. These modules will cover essential boosting algorithms like Adaboost and XGBoost that are
widely used in the industry for accurate and robust predictions.

Concepts Used:
Introduction to Boosting
Boosting Algorithms like Adaboost, Gradient Boost, and XGBoost
Stacking

Learning Outcomes: Understand the concept of boosting, the difference between bagging and
boosting, learn various boosting algorithms, and understand the concept of stacking.

Case Study: Bike Sharing- Bike-sharing systems are a new generation of traditional bike rentals
where the whole process from membership, rental and return back has become automatic.
Through these systems, the user is able to easily rent a bike from a particular position and return
back to another position. 'Travel Along' is a new bike-sharing company and wants to expand its
customer count and provide better services at a reasonable cost. They have conducted several
surveys and collated the data about weather, weekends, holidays, etc. from the past 2 years. The
objective is to analyze the patterns in the data and figure out the key areas which can help the
organization to grow and manage the customer demands. Further, you need to use this
information to predict the count of bikes shared so that the company can take prior decisions
for surge hours.

12
Module 6: Model Tuning

Employ feature engineering techniques and hyperparameter tuning to improve model performance
and optimize associated business costs.

Topic 1- Feature Engineering and Cross Validation

Feature Engineering and Cross Validation: Feature engineering involves creating new input
features or modifying existing ones to improve a machine learning model's performance, and
cross-validation is used for getting a better assessment of a model performance. This module
covers these two concepts along with regularization to tune the performance of ML models and
correctly assess their performance.

Concepts Used:
Feature Engineering
Cross-Validation
Oversampling and Undersampling
Regularization

Learning Outcomes: Learn how to handle imbalanced data, how to use the cross-validation
technique to get a better picture of model performance, and understand the concept of
regularization.

Case Study: Job change prediction- An ed-tech company wants to hire data scientists among
people who have successfully passed some courses and then signed up for training. The
company wants to know which of the people are really looking for a job change and will prefer
working with them, after completion of training, because it helps to reduce the cost and time
for categorization of candidates. Information related to demographics, education, experience
is in hands from candidates signup and enrollment. The objective is to identify the factors
affecting a person looking for a job change, build a predictive model to predict whether a
person is looking for a job change, and check whether imbalance in the data affects model
predictions.

13
Topic 2- ML Pipeline and Hyperparameter Tuning

ML Pipeline and Hyperparameter Tuning: Hyperparameter tuning involves optimizing the

configuration of a machine learning model to enhance its performance. This module covers
two common techniques to find the optimal hyperparameters of an ML model given a business
context, and how to create an ML pipeline to conduct data processing and modeling in a
streamlined and reproducible manner.

Concepts Used:
Machine Learning Pipeline
Model Tuning and Performance
Hyperparameter Tuning
Grid Search
Random Search

Learning Outcomes: Learn how to optimize model performance using hyperparameter tuning
and how to automate standard workflows in a machine learning process using pipelines.

Case Study: Supermarket marketing campaign- ‘All You Need' Supermarket is planning for the
year-end sale - they want to launch a new offer i.e. gold membership for only $499 that is of
$999 on normal days (that gives 20% discount on all purchases) only for existing customers, for
that they need to do a campaign through phone calls - the best way to reduce the cost of the
campaign is to make a predictive model to classify customers who might purchase the offer,
using the data they gathered during last year's campaign.
The objective is to build a model for classifying whether customers will respond positively or
not, identify the different factors which affect the kind of response, and to improve the
performance of an initially built model using hyperparameter tuning.

Module 7: Unsupervised Learning

Unlock the power of clustering algorithms to group data based on similarity, unveiling hidden
patterns and intrinsic structures. Explore dimensionality reduction techniques to grasp the
significance of streamlined data analysis.

14
Topic 1- K-Means Clustering

K-Means Clustering: K-means clustering is a popular unsupervised ML algorithm that is used for
identifying patterns in unlabeled data and grouping it. This module dives into the working of the
algorithm and the important points to keep in mind when implementing it in practical scenarios.

Concepts Used:
Introduction to Clustering
Types of Clustering
K-Means Clustering
Importance of Scaling
Silhouette Score
Visual Analysis of Clustering

Learning Outcomes: Learn about the different types of clustering algorithms, how K-means
clustering works, how to determine the optimal number of clusters by comparing different
metrics, and the importance of scaling data.

Case Study: Engineering Colleges Case Study- Education is fast becoming a very competitive
sector with hundreds of institutions to choose from. It is a life-transforming experience for any
student and it has to be a thoughtful decision. There are ranking agencies that do a survey of all
the colleges to provide more insights to students. Agency ‘RankForYou’ wants to leverage this
year's survey to roll out an editorial article in leading newspapers, on the state of engineering
education in the country. The objective is to cluster the colleges into groups based on the data
provided and come up with evidence-based insights for that article.

Topic 2- Hierarchical Clustering and PCA

Hierarchical Clustering and PCA: Hierarchical clustering organizes data into a tree-like structure
of nested clusters, while dimensionality reduction techniques are used to transform data into a
lower-dimensional space while retaining the most important information in it. This module
covers the business applications of hierarchical clustering and how to reduce the dimension of
data using PCA to aid in visualization and feature selection of multivariate datasets.

Concepts Used:
Hierarchical Clustering
Cophenetic Correlation
Introduction to Dimensionality Reduction
Principal Component Analysis

Learning Outcomes: Learn how to apply the hierarchical clustering technique to group similar
data points together and discover underlying patterns, understand the need for reducing
dimensions of the data, and understand the working of the PCA and how to transform data into
fewer dimensions using PCA.

Case Study: Tourism Services- Tourism is now recognized as a directly measurable activity,
enabling more accurate analysis and more effective policies can be made for tourism. Whereas
previously the sector relied mostly on approximations from related areas of measurement (e.g.
Balance of Payments statistics), tourism nowadays is a productive activity that can be analyzed
using factors like economic indicators, social indicators, environmental & infrastructure indica-
tors, etc. The task is to analyze several of these factors and group countries based on them to
help understand the key locations where the company can invest to promote tourism services.

15
ENHANCE KNOWLEDGE WITH
SELF-PACED MODULES
The self-paced modules cater to skills that are complementary to those learnt in guided modules.
Since all learners do not need to/may not want to learn them, they have been kept as part of
self-paced modules. All these modules have similar high-quality recorded video lectures by UT Austin
faculty, global academicians, and industry experts, but do not have mentorship sessions. You can learn
them at your own pace and schedule, based on your interests and the current and future demands of
your role.

Introduction to Data Science

Gain an understanding of the evolution of Data Science over time, its application in industries,
the mathematics and statistics behind it, and an overview of the life cycle of building
data-driven solutions.

Pre-Work
Gain a fundamental understanding of the basics of Python programming and build a strong
foundation of coding to build Data Science applications.

Data Visualization in Tableau

Read, explore and effectively visualize data using Tableau and tell stories by analyzing data
using Tableau dashboards.

Time Series Forecasting

Learn how to describe components of a time series data and analyze them using special
techniques and methods for time series forecasting.

Marketing and Retail Analytics

Understand the role of predictive modeling in influencing customer behavior and how
businesses leverage analytics in marketing and retail applications to make data-driven decisions.

Finance and Risk Analytics

Cultivate a profound understanding of credit and market risk. Explore how predictive analytics
shapes risk modeling in financial institutions.

Web and Social Media Analytics

Understand tools of web analytics which form the basis for rational and sound online business
decisions. Learn how to analyze social media data, including posts, content, and marketing
campaigns, to create effective online marketing strategies.

Supply Chain and Logistics Analytics

Explore the discipline of supply chain management and its stakeholders. Understand the role
of logistics in businesses and supply chains, and learn methods of forecasting prices, demand,
and indexes.

Generative AI
Get an overview of Generative AI, what ChatGPT is and how it works. delve into the business
applications of ChatGPT, and get an overview of other generative AI models/tools via
demonstrations.
16
BUILD INDUSTRY-RELEVANT SKILLS WITH
HANDS-ON PROJECTS

Practical Learning

7 hands-on
projects Skill Development
that will help
you with:

Portfolio Enhancement

Data Analysis for Food Aggregator

Explore food aggregator data to address key business questions, uncover trends, and suggest
actionable insights for improved operations and customer satisfaction.

A/B Testing for News Portal

Conduct A/B testing to gauge the effectiveness of a new landing page design for an online news
portal, comparing user engagement metrics to optimize website performance.

Dynamic Pricing Model for Devices Seller

Utilize linear regression to build a dynamic pricing model for a seller of used and refurbished
devices, identifying influential factors to optimize pricing strategies for profitability.

Classification Analysis for Hotel Bookings

Employ classification models to determine factors influencing hotel booking cancellations, aiding
in proactive management strategies and customer retention efforts.

Visa Approval Prediction with ML

Implement ensemble machine learning models to facilitate visa approval processes, recommending
profiles for certification or denial based on comprehensive analysis of applicant data.

Predictive Maintenance for Generators

Predict generator failures for a wind energy company using predictive analytics, enabling
preemptive maintenance scheduling to reduce downtime and maintenance costs.

Stock Clustering for Portfolio Diversification

Analyze financial attributes of stocks to cluster and build a diversified investment portfolio,
optimizing risk management and potential returns through strategic asset allocation.

17
READY TO ADVANCE YOUR CAREER?

APPLY NOW

[email protected]

https://onlineexeced.mccombs.utexas.edu/online-data-science-business-analytics-course

Ocs353dsf Unit Wise Notes
100% (2)
Ocs353dsf Unit Wise Notes
121 pages
Data Analysis From Scratch With Python - Beginner Guide Using Python, Pandas, NumPy, Scikit-Learn, IPython, TensorFlow and
100% (10)
Data Analysis From Scratch With Python - Beginner Guide Using Python, Pandas, NumPy, Scikit-Learn, IPython, TensorFlow and
104 pages
Data Science Training in Naresh I Technologies
100% (3)
Data Science Training in Naresh I Technologies
18 pages
Program Delivery
No ratings yet
Program Delivery
37 pages
Data Science Course and Machine Learnign Using Python
No ratings yet
Data Science Course and Machine Learnign Using Python
3 pages
DSBA Curriculum Guide
No ratings yet
DSBA Curriculum Guide
19 pages
Applied Data Science With Python-N
No ratings yet
Applied Data Science With Python-N
17 pages
A Report Submitted in Partial Fulfillment of The Requirement of The Award of Degree of
No ratings yet
A Report Submitted in Partial Fulfillment of The Requirement of The Award of Degree of
35 pages
Getting Started With Python Data Analysis - Sample Chapter
0% (1)
Getting Started With Python Data Analysis - Sample Chapter
17 pages
Vibhin Pro
No ratings yet
Vibhin Pro
36 pages
Part A
No ratings yet
Part A
24 pages
Brochure NUS PA 210521
No ratings yet
Brochure NUS PA 210521
13 pages
Python Data Analysis Sample Chapter
No ratings yet
Python Data Analysis Sample Chapter
40 pages
Instagram Python Project
No ratings yet
Instagram Python Project
66 pages
PythonDASE - 2025 Version1
No ratings yet
PythonDASE - 2025 Version1
44 pages
Data Science Course Outline CES LUMS
No ratings yet
Data Science Course Outline CES LUMS
4 pages
Data Science
No ratings yet
Data Science
8 pages
DAL EXT 1 and 2
No ratings yet
DAL EXT 1 and 2
125 pages
Data Science Machine Learning 17054
No ratings yet
Data Science Machine Learning 17054
27 pages
Data Science I: Charles C.N. Wang
No ratings yet
Data Science I: Charles C.N. Wang
68 pages
Introduction-It Skills
No ratings yet
Introduction-It Skills
20 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
51 pages
Data Science Using Python
No ratings yet
Data Science Using Python
12 pages
Data Science Book
No ratings yet
Data Science Book
383 pages
PDS Merged New
No ratings yet
PDS Merged New
19 pages
Data Science With Python Updated Brochure
No ratings yet
Data Science With Python Updated Brochure
13 pages
DS Unit 1 - NUMPY
No ratings yet
DS Unit 1 - NUMPY
29 pages
Machine Learning Engineer Course Curriculum PDF
No ratings yet
Machine Learning Engineer Course Curriculum PDF
40 pages
TBC 401 Data Analytics Using Python
No ratings yet
TBC 401 Data Analytics Using Python
2 pages
DS Curriculum
No ratings yet
DS Curriculum
4 pages
Data Science With Python
No ratings yet
Data Science With Python
4 pages
Data Science
No ratings yet
Data Science
9 pages
Sem-Vi Aids Syllabus
No ratings yet
Sem-Vi Aids Syllabus
36 pages
Python For Data Science
No ratings yet
Python For Data Science
22 pages
Syllabus PracticalDataScience
No ratings yet
Syllabus PracticalDataScience
7 pages
Iit Data Science
No ratings yet
Iit Data Science
20 pages
FDS Syllabus and CIS
No ratings yet
FDS Syllabus and CIS
10 pages
Unit 1 FUNDAMENTALS OF DATA SCIENCE-1
No ratings yet
Unit 1 FUNDAMENTALS OF DATA SCIENCE-1
27 pages
2.1 - Introduction To Data Analytics
No ratings yet
2.1 - Introduction To Data Analytics
32 pages
Lab Manual FOR CSE 355/ Data Science Professional Certification Name
No ratings yet
Lab Manual FOR CSE 355/ Data Science Professional Certification Name
20 pages
3 Months Python and Data Analytics Syllabus
100% (1)
3 Months Python and Data Analytics Syllabus
3 pages
Data Analysis Salary of Data Professions
No ratings yet
Data Analysis Salary of Data Professions
14 pages
Unit2 PDS
No ratings yet
Unit2 PDS
17 pages
Unit 1
No ratings yet
Unit 1
84 pages
Syllabus PracticalDataScience
No ratings yet
Syllabus PracticalDataScience
8 pages
PYDS 3150713 Unit-2
No ratings yet
PYDS 3150713 Unit-2
38 pages
Suraj Report File
No ratings yet
Suraj Report File
17 pages
Lesson1 Introduction To The Data Science Process and The Value of Learning Data Science
No ratings yet
Lesson1 Introduction To The Data Science Process and The Value of Learning Data Science
6 pages
Gujarat Technological University: Overview of Python and Data Structures
No ratings yet
Gujarat Technological University: Overview of Python and Data Structures
4 pages
DS Final
No ratings yet
DS Final
46 pages
Datasciencewith AI
No ratings yet
Datasciencewith AI
12 pages
Sodapdf
No ratings yet
Sodapdf
1 page
Introduction To Data Science
No ratings yet
Introduction To Data Science
25 pages
Tba Record Final
No ratings yet
Tba Record Final
140 pages
Data Science - Data
No ratings yet
Data Science - Data
10 pages
Syllabus Sem 6
No ratings yet
Syllabus Sem 6
6 pages
VESDA System
No ratings yet
VESDA System
2 pages
DSBA Mentor Guide
No ratings yet
DSBA Mentor Guide
5 pages
DSBA Testimonial Book
No ratings yet
DSBA Testimonial Book
7 pages
CCTV Notes
No ratings yet
CCTV Notes
111 pages
User Manual Mu250 Graphics-1
No ratings yet
User Manual Mu250 Graphics-1
2 pages
Resume Kritika
No ratings yet
Resume Kritika
1 page
Shivam Resume
No ratings yet
Shivam Resume
1 page
Shaswat Resume (1) - 1
No ratings yet
Shaswat Resume (1) - 1
1 page
Data Science
No ratings yet
Data Science
17 pages
Srikrishna Amaravadi
No ratings yet
Srikrishna Amaravadi
2 pages
CV Format Lpu
No ratings yet
CV Format Lpu
1 page
Data Visualization - Lab - Manual - 2024
No ratings yet
Data Visualization - Lab - Manual - 2024
13 pages
Edureka Training - Python Programming Certification Course
No ratings yet
Edureka Training - Python Programming Certification Course
11 pages
Learn Python-eBook
No ratings yet
Learn Python-eBook
27 pages
Intro To Python For Computer Science To and Data Science: Learning Program With
No ratings yet
Intro To Python For Computer Science To and Data Science: Learning Program With
413 pages
PySpark Interview Cheatsheet 1741068112
No ratings yet
PySpark Interview Cheatsheet 1741068112
19 pages
Ip Project Class Xii
No ratings yet
Ip Project Class Xii
31 pages
Age Gender Detection
No ratings yet
Age Gender Detection
24 pages
Datascience Lab Manual
No ratings yet
Datascience Lab Manual
46 pages
Abhay Mishra
No ratings yet
Abhay Mishra
1 page
WT 1 and FDS Practical Slips Solution Form WWW - Dailycover.live
No ratings yet
WT 1 and FDS Practical Slips Solution Form WWW - Dailycover.live
91 pages
Career Change Resume 17751
No ratings yet
Career Change Resume 17751
1 page
Oreillyfodooltweek 11675274112220
No ratings yet
Oreillyfodooltweek 11675274112220
45 pages
Lab Manual
No ratings yet
Lab Manual
83 pages
Ad3002 - Question Bank Health Care
100% (1)
Ad3002 - Question Bank Health Care
16 pages
Foundation of Data Science - CS3352 - Important Questions With Answer - Unit 4 - Python Libraries For Data Wrangling
No ratings yet
Foundation of Data Science - CS3352 - Important Questions With Answer - Unit 4 - Python Libraries For Data Wrangling
14 pages
Intro To Python For Computer Science To and Data Science: Learning Program With
No ratings yet
Intro To Python For Computer Science To and Data Science: Learning Program With
402 pages
02 - Bharghav Fake News Detection
No ratings yet
02 - Bharghav Fake News Detection
49 pages
Syllabus
No ratings yet
Syllabus
21 pages
Python For Data Science - Ultimate Library Guide
No ratings yet
Python For Data Science - Ultimate Library Guide
5 pages
Chapter-14 Data Science
No ratings yet
Chapter-14 Data Science
12 pages
Anthropic-cookbook:Skills:Contextual-embeddings:Guide - Ipynb at Main Anthropics
No ratings yet
Anthropic-cookbook:Skills:Contextual-embeddings:Guide - Ipynb at Main Anthropics
21 pages
SQL Cheat Sheet Accessing Databases Using Python
No ratings yet
SQL Cheat Sheet Accessing Databases Using Python
2 pages
Unit II Eda Using Python
No ratings yet
Unit II Eda Using Python
61 pages
Combined
No ratings yet
Combined
85 pages

DSBA Curriculum Guide

Uploaded by

DSBA Curriculum Guide

Uploaded by

POST GRADUATE PROGRAM IN

Delivered in collaboration with:

17 Weekly Online Mentorship

Academic Learning Support

Personalized Evaluation and Postgraduate Certificate from

Career Services (Career Prep

A solid understanding of Data Science from a business, technical, and

Working knowledge of using Python to perform end-to-end data analysis

Ability to perform statistical analysis and extract statistical inferences from

Ability to independently solve business problems using analytics and Data

Working knowledge of using Python to design and implement machine

05 learning models to predict future trends and make informed business

Module 1: Python Foundations

Topic 1- Python Programming

Python Programming: Python is a widely used, high-level, interpreted programming language,

Topic 3- Python for Visualization

Python for Visualization: Matplotlib is a library to create statically animated, interactive

Module 2: Business Statistics

Inferential Statistics Foundations: Inferential statistics is pivotal in statistical analysis and

Topic 2- Estimation and Hypothesis Testing

Topic 3- Common Statistical Tests

Topic 1- Intro to Supervised Learning - Linear Regression

Topic 2- Linear Regression Assumptions and Statistical Inference

Topic 1- Logistic Regression

Module 5: Ensemble Techniques

Topic 1- Feature Engineering and Cross Validation

ML Pipeline and Hyperparameter Tuning: Hyperparameter tuning involves optimizing the

Module 7: Unsupervised Learning

Topic 2- Hierarchical Clustering and PCA

Introduction to Data Science

Data Visualization in Tableau

Time Series Forecasting

Marketing and Retail Analytics

Finance and Risk Analytics

Web and Social Media Analytics

Supply Chain and Logistics Analytics

Data Analysis for Food Aggregator

A/B Testing for News Portal

Dynamic Pricing Model for Devices Seller

Classification Analysis for Hotel Bookings

Visa Approval Prediction with ML

Predictive Maintenance for Generators

Stock Clustering for Portfolio Diversification

You might also like