No-Code ML With DataRobot
No-Code ML With DataRobot
MACHINE
LEARNING
WITH
DATAROBOT
INTRODUCTION
TO MACHINE
LEARNING IN
FINANCE/BANKIN
G
MACHINE LEARNING APPLICATIONS IN FINANCE
“Artificial intelligence & Machine Learning are to trading what fire was to the cavemen”!
1. 1. ROBO ADVISORS
• Robo advisors are AI/ML-based financial advisors that consume client financial status, target return, risk appetite and goals to
generate financial advice (Examples: Betterment and Wealthfront).
• “JP Morgan invests $11.5 billion a year in new data driven technologies. The
company’s machine learning-based Contract Intelligence (COiN) platform reviews
12,000 annual commercial loan agreements in few hours, as opposed to the
360,000 man-hours it would take to do so manually.”
• “Bank of America introduced Erica chatbot that served 6 million users as of March
2019. Erica provide clients relevant, timely guidance and help make managing
their finances easier.”
• Electronic trades account for almost 45% of revenues in cash equities trading”
U.K. research firm Coalition Report.
ource: https://medium.com/eleks-labs/4-powerful-use-cases-for-data-science-in-finance-35d50075ff80
ource: https://emerj.com/ai-sector-overviews/ai-in-banking-analysis/
READING TIME & QUIZ: ML APPLICATIONS IN
FINANCE
• Please read the article below and answer the following quiz.
15 MINS
o Link to Article: https://algorithmxlab.com/blog/applications-
machine-learning-finance/
10 MINS
PROJECT
OVERVIEW:
BANK LOAN
RISK
PREDICTION
PROJECT OVERVIEW & BUSINESS
•CASE
The objective of this case study is to develop a machine learning model
(classifier model) to predict whether a loan is good or bad loan given the
customer (borrower) features such as employment length, loan term, interest
rates, and debt to income ratio.
INPUTS
Employment Length
Home Ownership
OUTPUT
Annual Income
Loan Amount
MACHINE LEARNING
Loan Term MODEL Good Loan
Purpose of loan Bad Loan
Interest rate (CLASSIFIER)
Interest payments
Debt to Income
Ratio
MINI CHALLENGE
• Read this article by McKinsey & Company and answer the following
questions:
https://www.mckinsey.com/business-functions/mckinsey-analytics/our-insigh
ts/the-executives-ai-playbook?page=industries/
1. How much annual value does McKinsey & Company predict AI could
potentially bring in the banking industry?
2. How much of this value is attributed to traditional vs. advanced AI?
3. What about “risk” category?
4. What are the use cases for the fraud and debt analytics?
5. What type of classifiers can be used?
MINI CHALLENGE SOLUTION
• Read this article by McKinsey & Company and answer the following questions:
https://www.mckinsey.com/business-functions/mckinsey-analytics/our-insights/th
e-executives-ai-playbook?page=industries/
1. How much annual value does McKinsey & Company predict AI could potentially
bring in the banking industry? 1.0T
2. How much of this value is attributed to traditional vs. advanced AI? 660.9B vs.
361.5B
3. What about “risk” category? 372.9B
4. What are the use cases for the fraud and debt analytics?
• Detect money laundering schemes
• Detect fraud
• Detect and prevent future defaults
• Credit underwriting
• Improve debt collection strategies
5. What type of classifiers can be used?
• Support vector machines, logistic and linear.
DATAROBOT
DEMO – PART
#1: SIGNUP
AND TRAINING
DATA UPLOAD
DATAROBOT
• DataRobot is the leading end-to-end enterprise ai/ml platform that automates
the process of building, training and deploying ai/ml models at scale.
TRAINING
DATASET
60%
VALIDATION
DATASET
20%
TESTING DATASET
20%
DATAROBOT
DEMO – PART
#3: MODEL
TRAINING &
FEATURE
IMPORTANCE
DATAROBOT
NOW CLICK ON THE START BUTTON
TO BEGIN YOUR TRAINING.
DATAROBOT
NOW, YOU CAN SEE THE PROGRESS OF
YOUR TRAINING IN THE SIDEBAR.
DATAROBOT
WHILE THE TRAINING IS IN PROGRESS, YOU CAN SEE THE
ASSOCIATION BETWEEN FEATURES BY CLICKING ON FEATURE
ASSOCIATION BOX.
DATAROBOT
BY CLICKING ON FEATURE ASSOCIATION PAIRS, YOU CAN SEE
A MUCH CLEARER ASSOCIATION BETWEEN TWO FEATURES.
DATAROBOT
UNDER INSIGHTS, YOU CAN SEE FEATURE IMPORTANCE
DATAROBOT AI
DEMO – PART #4:
CLASSIFICATION
MODELS BASIC
DEFINITIONS
(PRECISION,
RECALL, ROC, &
AUC)
CONFUSION MATRIX
TRUE CLASS
+ -
TYPE I ERROR
+ TRUE + FALSE +
PREDICTIONS
FALSE - TRUE -
-
TYPE II ERROR
CLASSIFICATION MODEL KPIs
o True positives (TP): cases when classifier predicted TRUE (they have the disease), and
correct class was TRUE (patient has disease).
o True negatives (TN): cases when model predicted FALSE (no disease), and correct class
was FALSE (patient do not have disease).
o False positives (FP) (Type I error): classifier predicted TRUE, but correct class was FALSE
(patient did not have disease).
o False negatives (FN) (Type II error): classifier predicted FALSE (patient do not have
disease), but they actually do have the disease
TRUE CLASS
PREDICTIONS
+ -
+ TRUE + FALSE +
FALSE - TRUE -
-
PRECISION Vs. RECALL EXAMPLE
RANDOM
PREDICTOR
IF YOU SET THE THRESHOLD TOO HIGH, YOU WILL BECOME SUPER SELECTIVE,
MEANING YOUR RECALL (TRUE POSITIVE RATE)~0 BUT YOUR FALSE POSITIVE RATE
WILL REDUCE DRAMATICALLY AS WELL (YOU WON’T MAKE MANY MISTAKES)
GREAT RESOURCE BY JASON BROWNLEE:
DATAROBOT HTTPS://MACHINELEARNINGMASTERY.COM/THRESHOLD-
MOVING-FOR-IMBALANCED-CLASSIFICATION/
IF YOU SET THE THRESHOLD LITTLE LOWER, YOU WILL NOT BECOME SELECTIVE
ANYMORE AND MOST DATA POINTS WILL BE CLASSIFIED AS 1, MEANING YOUR RECALL
(TRUE POSITIVE RATE) WILL BE EQUAL ONE (BECAUSE YOU HAVE DETECTED ALL BAD
LOANS) BUT YOUR FALSE POSITIVE RATE WILL INCREASE DRAMATICALLY AS WELL (YOU
WILL MESS UP TOO MUCH AND MAKE MANY MISTAKES).
DATAROBOT TO EVALUATE FEATURE FIT OF THE
MODEL, CLICK ON THE COMPUTE
FEATURE FIT BOX
DATAROBOT
FOR AI/ML MODEL EXPLAINABILITY PURPOSES, WE CAN EXPLORE
FEATURE IMPACTS AND FEATURE EFFECTS AS SHOWN BELOW.
DATAROBOT
FOR AI/ML MODEL EXPLAINABILITY PURPOSES, WE CAN EXPLORE
FEATURE IMPACTS AND FEATURE EFFECTS AS SHOWN BELOW.
DATAROBOT
HERE, YOU CAN SEE THE FEATURES AND
THEIR EFFECTS ON THE MODEL TO COME
UP WITH A DECISION.
DATAROBOT DEMO
– PART #6:
HYPERPARAMETER
S TUNING
DATAROBOT
YOU CAN TUNE THE HYPERPARAMETERS BY CLICKING ON ADVANCED
TUNING OPTION. YOU CAN SPECIFY THE VALUES OF PREDICTION
PARAMETER.
DATAROBOT
TUNE THE MODEL PARAMTERS
DATAROBOT
ONCE YOU MAKE CHANGES TO THE PARAMETER, CLICK
ON UPDATE PARAMETER.
CLICK ON BEGIN TUNING TO START HYPER PARAMETER TUNING. THE NEW
DATAROBOT MODEL WILL BE ADDED TO THE LEADERBOARD
DATAROBOT
THE NEW MODEL SHOWN ON THE BOTTOM OF THE LEADERBOARD.
DATAROBOT
annual_
emp_length_int home_ownership income_category loan_amount term application_type purpose interest_payments interest_rate grade dti region
inc
0.6 RENT Low 31000 2600 65 months INDIVIDUAL car High 16 C 1.5 leinster
DATAROBOT