Data Analytics Using R (DA-R)
Data Analytics Using R (DA-R)
INTRODUCTION
Outline
Different Types
Introduction of Data Prediction Effect
Analytics
Types of
Course Plan Why R? Problems in
Data Analytics
Scenario 1
STAFF SCHEDULING AT VUMC
Staff Scheduling at VUMC
• Vanderbilt University Medical Centre (VUMC) is one
of the leading hospitals.
• VUMC maintains 55 operating rooms across
different sites.
• VUMC schedules elective (non-emergency)
surgeries primarily on weekdays.
VUMC Operations
• The charge nurse reports the schedule for
the next day to admin director. Admin
• If the number of cases booked is low, the Director
admin director decides to close some
operating rooms.
• The charge nurse also asks some
operating room nurses to take a paid
holiday.
• If the number of booked cases is high, Charge
the admin director asks the charge nurse
to call in extra operating room nurses. Nurse
Challenges at VUMC
• VUMC assumes that surgeries would
occur equally across all weekdays in a Elective Surgeries
month.
• Recently, VUMC has observed a large
variation in daily surgical case volume •94%
(number of surgeries) to be performed.
• This is creating a major problem for
surgical staff schedule. Add-on Cases
•6%
Surgeries are generally scheduled
earlier on the week and earlier on the
day.
Potential
Causes of Sometimes no surgeries are scheduled
in a week for various reasons.
Variation
6% add-on cases.
Why is Staff Scheduling so Important?
Overstaffing
• May not cancel staff at late notice (labour relations). Even if possible, last minute
changes may hurt employee satisfaction, as most employees want predictable
schedules.
Understaffing
• May not be able to find someone available to work on short notice. Understaffing
of nurses may delay the surgeries.
Objective
• To resolve issues related to staff scheduling.
Data
• Actual Number of Surgeries
• Number of Surgeries booked in advance
• Day of the Week
Scenario 2
FINDING RIGHT CUSTOMERS AT AUTO FINANCE LTD.
Finding Right Customers
at Auto Finance Ltd.
• Auto Finance Ltd. is a major player in the two-wheeler
business in India.
• Many of the people buying two-wheelers belong to lower-
middle class of India and does not have access to enough
capital.
• Auto Finance Ltd. provides loans, typically on a fixed interest
rate for 3-5 years, to enable cash-strapped customers to buy
the vehicle.
• The loan facility has enabled Auto Finance Ltd. to attract a
new customer segment.
Challenges at Auto Finance Ltd.
• Recently, Auto Finance Ltd. has faced
a major issue.
Timely
• Around 70% of the customers have Payment
30%
delayed the repayments.
• In order to decide whether to grant
credit, the credit provider considers Delayed
the trade-off between the interest Payment
70%
income and the possibility of borrower
defaulting.
Objective
• To reduce the loss due to high default rate.
Data
• Auto Finance Ltd. records the default status for
each customer.
• It also maintains a huge database with several
customer specific information such as age, gender,
income, employment details, etc.
Scenario 3
TALENT ACQUISITION BY SCALENEWORKS
Talent Acquisition by
Scaleneworks
• Scaleneworks, a Bangalore based start-up company,
supports many IT companies in India with talent
acquisition.
• Advises its customers on status of modern talent
acquisition practices.
• Recommends and implements individually tailored,
viable solutions.
Business Problem
• The top management has observed
that several persons have not joined In an IT firm, suppose 12000 offers
are rolled out every year.
the organization even after accepting
the offer. At 30% renege rate, approximately
3600 candidates accept the offer and
• Owing to this, cost of hiring increased then not join the company.
between 10% and 15%. Company would have spent 15 man-
hours/candidate in recruitment
lifecycle.
Source:
https://www.sciencedirect.com/science/article/pii/
S0148296318302480
Descriptive Analytics
• Descriptive Analytics consists of set of techniques
that describes what has happened in the past.
• Examples: Data Queries, Reports, Descriptive
Statistics, Data Visualization, etc.
Diagnostic analytics
• Diagnostic analytics (as a natural extension of
descriptive analytics) examines data or content to
answer the question “why did it happen?”
• It requires exploratory data analysis of the existing
data or sometimes additional data using tools and
techniques as visualization, data discovery, and data
mining in order to discover the root causes of a
problem.
Predictive Analytics
• Predictiveanalytics comprises of the set of
techniques that use models constructed from the
past data to predict the future or study the impact
on one variable on the other.
• Examples: Linear Regression, Logistic Regression,
etc.
Prescriptive Analytics
• Prescriptive analytics provides a best course of
action to take, i.e., the output from a prescriptive
analytics model is the best solution.
• A common example is portfolio models in finance,
which determine the mix of investments that yield
the highest expected return while limiting the
exposure to risk.
Predictive Analytics
WHY PREDICTION IS SO IMPORTANT?
Suppose you have observed that one out of 100 of them will buy your product (i.e.,
10,000 responses).
Suppose PA earmarks a quarter of the entire list and says: "These folks are
three times more likely to respond than average!”
Regression
𝑌 𝑌
Model
Quantitative Quantitative
𝑋1 𝑋2 𝑋3
Regression Problem: Examples
• Staff Scheduling at VUMC
Classification Problem
Predicted
Classification
𝑌 Class
Model Labels
Qualitative
Qualitative
𝑋1 𝑋2 𝑋3
Classification Problem: Examples
• Finding Right Customers at Auto Finance Ltd.
• Talent Acquisition by Scaleneworks
Supervised Learning:
Techniques
Linear Regression
Logistic Regression
Decision Trees
Bagging
Random Forest
Boosting