RMSC3001 2023-24 PS2

Uploaded by

michaelng1112

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views2 pages

RMSC3001 2023-24 PS2

Uploaded by

michaelng1112

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

THE CHINESE UNIVERSITY OF HONG KONG

Department of Statistics

RMSC3001: Principles of Credit Risk Management

Problem Sheet 2 This problem sheet is for NG, Sze Leong ???, student ID 1155159866, sent to
[email protected].

The deadline for this Problem Sheet is 2359 on Saturday 17th February. Please submit your solutions via the
link provided on the course Blackboard page - if you must submit your solutions in hard copy, please contact me
at [email protected] in advance. No late submissions will be accepted. A late submission will
receive a mark of zero. Students may discuss set problems with others, but their final submissions must be their
own work. Do show your working - it helps us to give you marks.
Please answer the following problems.

1. “Credit Risk Analytics: Measurement Techniques, Applications, and Examples in SAS” (2016) by Baesens,
Roesch and Schuele uses the HMEQ dataset to illustrate credit scoring models. Access the online version of
the book via the CUHK library website: p39 describes the dataset’s response and characteristics; output of a
SAS modelling procedure applied to the data is shown in Exhibit 5.6 on p134-135.
Read the description of this dataset to understand what the characteristics and possible attributes are. Food
for thought (i.e. no need to write down the answer to this question): how do you expect these characteristics
to affect the probability the applicant is Good?
(a) The original dataset has 12 characteristics and one response BAD = 1 if the applicant defaulted or is
seriously delinquent, = 0 otherwise. How many characteristics does the model presented in Exhibit 5.6
use? Write down those which have been discarded.
(b) The model in Exhibit 5.6 is a logistic regression. Note that according to the documentation for LOGISTIC
procedure in SAS, SAS doesn’t sees the 0s and 1s in the BAD column as numbers but rather as labels
and defaults to model the logit of the lowest label - in this case, 0. Write out the fitted model in full,
with all the estimated parameters.
(c) Use the model to estimate the probability of borrowers with the following attributes not defaulting:
i. LOAN = 3578, MORTDUE = 102370, VALUE = 120953, REASON = HomeImp, JOB = Office,
YOJ = 2, DEROG = 0, DELINQ = 0, CLAGE = 260.3315, NINQ = 0, CLNO = 13, DEBTINC =
31.5885
ii. LOAN = 65500, MORTDUE = 205156, VALUE = 290239, REASON = DebtCon, JOB = ProfExe,
YOJ = 2, DEROG =0, DELINQ = 0, CLAGE = 98.8082, NINQ = 1, CLNO = 21, DEBTINC =
130.661.
(d) Exhibit 5.11 on p139-141 provides a scorecard based on the logistic regression model in Exhibit 5.6.
Calculate the scores for both the applicants in the previous question.
2. You test an imaginary scorecard on a dataset in which customers’ data and response (good or bad) are
recorded. Say only six scores (150, 200, 250, 300, 350 and 400) are possible. You observe the following
Of all the customers with a score of 150, 0 are good, 12 are bad.
Of all the customers with a score of 200, 16 are good, 11 are bad.
Of all the customers with a score of 250, 92 are good, 15 are bad.
Of all the customers with a score of 300, 194 are good, 18 are bad.
Of all the customers with a score of 350, 360 are good, 12 are bad.
Of all the customers with a score of 400, 208 are good, 0 are bad

(a) Calculate the co-ordinates of the five points on the ROC curve between (0, 0) and (1, 1) as the score
increases.
(b) Use Excel or other software to plot the ROC curve.

1
(c) Without using R, calculate the AUROC and Gini coefficient.

3. For this question, please use R and Excel and include your R code and xlsx file in your submission.
Download the “Default.csv” file, originally from Introduction to Statistical Learning, 1st Ed. by James,
Witten, Hastie and Tibshirani, from the Problem Sheet 2 content area on Blackboard. Split the data into
two parts: from Row 2 to Row 7001 inclusive, which we call the training set; from Row 7002 to Row 10001
inclusive, which we call the testing set. The dataset has four columns
default: binary response for whether the credit card holder defaulted that month or not;
student: binary characteristic, whether the credit card holder was a student or not;
balance: numeric characteristic, the holder’s credit card balance at the end of that month;
income: numeric characteristic, the credit card holder’s annual income.

(a) Use R (or other software) to fit a logistic regression model to the training set, using all three character-
istics. To be clear, you should fit log(Ω(Good|Characteristics)). You don’t need to coarse classify the
characteristics; you don’t need to include interactions between characteristics in your model. Write down
the parameter estimates, their standard errors and their Z-statistics. Write down the fitted equation.
(b) Now treat the logit of the credit card holder as a score. Find its AUROC on the testing set.
(c) Investigate possible three possible cut-off scores by calculating confusion matrices for P (G|x) = 0.5, 0.75, 0.9
by applying your scoring system to the testing set.
(d) The specificity of a prediction is the probability that given H0 is true, the prediction is correct. The
sensitivity of a prediction is the probability that that given H0 is not true, the prediction is correct. How
are specificity and sensitivity related to Type I and Type II errors?
(e) Taking H0 to be “the applicant is Good”, comment on the sensitivities and specificities resulting from
using the three cut-off scores in part (b).
(f) Calculate the three swap set matrices for the three possible pairings of cut-off scores. Comment.

THE END

Analysis of German Credit Data
100% (1)
Analysis of German Credit Data
24 pages
Topics in Finite and Discrete Mathematics - Sheldon M. Ross
100% (1)
Topics in Finite and Discrete Mathematics - Sheldon M. Ross
279 pages
Introduction To Modern Industrial Engineering
100% (2)
Introduction To Modern Industrial Engineering
221 pages
Abu Dhabi Ports Company (PJSC) : Shamal Development - New 33/11Kv Substation
No ratings yet
Abu Dhabi Ports Company (PJSC) : Shamal Development - New 33/11Kv Substation
52 pages
Capstone Project
100% (1)
Capstone Project
7 pages
Part 1 Building Your Own Binary Classification Model
43% (14)
Part 1 Building Your Own Binary Classification Model
6 pages
2022 ECN 3311 Test 2
No ratings yet
2022 ECN 3311 Test 2
4 pages
Logistic Regression:: PGP Dse Bangalore July 2018
No ratings yet
Logistic Regression:: PGP Dse Bangalore July 2018
62 pages
DC-6 Om
100% (4)
DC-6 Om
522 pages
Credit Scoring
No ratings yet
Credit Scoring
26 pages
ch4 PDF
No ratings yet
ch4 PDF
32 pages
Asb 3303
No ratings yet
Asb 3303
9 pages
446 PDF
No ratings yet
446 PDF
19 pages
Conjugate Beam Method SLU
No ratings yet
Conjugate Beam Method SLU
41 pages
EAD Model Development Using SAS
No ratings yet
EAD Model Development Using SAS
12 pages
Credit-Scoring-CASE
No ratings yet
Credit-Scoring-CASE
29 pages
MKT Research HW 5
No ratings yet
MKT Research HW 5
5 pages
Logistic Regression - Exercises
No ratings yet
Logistic Regression - Exercises
8 pages
Team 14 - Project Documentation - Taiwan Credit Defaults v1.0
No ratings yet
Team 14 - Project Documentation - Taiwan Credit Defaults v1.0
3 pages
Customer Scoring - Case Study
No ratings yet
Customer Scoring - Case Study
15 pages
Reading Material - Module-5 - Introduction To Special Topics
No ratings yet
Reading Material - Module-5 - Introduction To Special Topics
27 pages
Capstone Project Report v1 - Abhishek Bihani
No ratings yet
Capstone Project Report v1 - Abhishek Bihani
16 pages
Plumbing Tools and Their Uses
No ratings yet
Plumbing Tools and Their Uses
6 pages
Quiz 1
No ratings yet
Quiz 1
2 pages
Intro LOGIT
No ratings yet
Intro LOGIT
46 pages
31st MCMC
No ratings yet
31st MCMC
11 pages
Note 4
No ratings yet
Note 4
18 pages
Progress Report 2
No ratings yet
Progress Report 2
10 pages
Scan Mar 30, 2023 PDF
No ratings yet
Scan Mar 30, 2023 PDF
23 pages
Binary Logistic
No ratings yet
Binary Logistic
29 pages
Regression Log
No ratings yet
Regression Log
4 pages
Regression Analysis
No ratings yet
Regression Analysis
16 pages
Midterm AI
No ratings yet
Midterm AI
18 pages
Assignment 03 DUE DATE: 2 August 2021 UNIQUE NUMBER: 899315: Instructions
No ratings yet
Assignment 03 DUE DATE: 2 August 2021 UNIQUE NUMBER: 899315: Instructions
3 pages
Jana Sir - Final
No ratings yet
Jana Sir - Final
19 pages
Part 1: Building Your Own Binary Classification Model: Data - Final Project
No ratings yet
Part 1: Building Your Own Binary Classification Model: Data - Final Project
9 pages
Practice Final
No ratings yet
Practice Final
15 pages
Building Credit Scorecard
No ratings yet
Building Credit Scorecard
58 pages
Lending Club Data Analysis and Default
No ratings yet
Lending Club Data Analysis and Default
10 pages
CAP5768 Homework3
No ratings yet
CAP5768 Homework3
10 pages
Assignment 3 F1 - F4
No ratings yet
Assignment 3 F1 - F4
19 pages
Assignment 1 DA - E Oct 2023 V1-1
No ratings yet
Assignment 1 DA - E Oct 2023 V1-1
3 pages
Omicron
No ratings yet
Omicron
23 pages
Exam 2
No ratings yet
Exam 2
21 pages
RMSC3001 PS1 2023-24
No ratings yet
RMSC3001 PS1 2023-24
2 pages
BA II - End Sem Exam - 2024
No ratings yet
BA II - End Sem Exam - 2024
5 pages
Credit Defaulter Classifier 1659348484
No ratings yet
Credit Defaulter Classifier 1659348484
7 pages
Credit Scoring Modelling For Retail Banking Sector
No ratings yet
Credit Scoring Modelling For Retail Banking Sector
9 pages
Development and Validation of Credit-Scoring Models
No ratings yet
Development and Validation of Credit-Scoring Models
70 pages
Background 2.1. Logistic Definition
No ratings yet
Background 2.1. Logistic Definition
6 pages
November 2010)
No ratings yet
November 2010)
6 pages
MAT2148-Problem Sheet 2
No ratings yet
MAT2148-Problem Sheet 2
5 pages
Logistic Regression
No ratings yet
Logistic Regression
41 pages
MBA786M Project
No ratings yet
MBA786M Project
2 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
Past Exam
No ratings yet
Past Exam
5 pages
Employee Welfare
No ratings yet
Employee Welfare
44 pages
A01 Assessing Credit Risk Using Logistic Regression
No ratings yet
A01 Assessing Credit Risk Using Logistic Regression
1 page
Capstone Presentation Final
No ratings yet
Capstone Presentation Final
14 pages
Ppa Final Project
No ratings yet
Ppa Final Project
17 pages
Assignment3 05.01.24
No ratings yet
Assignment3 05.01.24
4 pages
Linear+Regression+ +transcription
No ratings yet
Linear+Regression+ +transcription
22 pages
MN67672 Eng
No ratings yet
MN67672 Eng
22 pages
Reject Inference Methodologies in Credit Risk Modeling
No ratings yet
Reject Inference Methodologies in Credit Risk Modeling
10 pages
Hanover Report 1978
100% (1)
Hanover Report 1978
10 pages
What Is Athletic Sports and Management?
No ratings yet
What Is Athletic Sports and Management?
3 pages
Term 2 Year 1 Hass
No ratings yet
Term 2 Year 1 Hass
13 pages
VK Liste 2017
No ratings yet
VK Liste 2017
29 pages
Risk Ranger
No ratings yet
Risk Ranger
31 pages
Pickle Brand Auditing and Strengthening
No ratings yet
Pickle Brand Auditing and Strengthening
34 pages
Important: Service Data Sheet
No ratings yet
Important: Service Data Sheet
4 pages
TH 2
No ratings yet
TH 2
4 pages
Nature 14432
No ratings yet
Nature 14432
17 pages
Experimental Investigation of Circular Concrete Filled Steel Tube Geometry On Seismic Performance
No ratings yet
Experimental Investigation of Circular Concrete Filled Steel Tube Geometry On Seismic Performance
54 pages
Hyaluronic Acid
No ratings yet
Hyaluronic Acid
7 pages
Amcas Coursework Video
100% (2)
Amcas Coursework Video
7 pages
John B. Goodenough
No ratings yet
John B. Goodenough
11 pages
Technical Data Sheet & Processing Guide: ENMAT™ Thermoplastics Resin Y1000P
No ratings yet
Technical Data Sheet & Processing Guide: ENMAT™ Thermoplastics Resin Y1000P
6 pages
Hopf Bifurcation Normal Form
100% (2)
Hopf Bifurcation Normal Form
3 pages
American Ethnologist - February 1987 - BROWN - Religion Class and Context Continuities and Discontinuities in Brazilian
No ratings yet
American Ethnologist - February 1987 - BROWN - Religion Class and Context Continuities and Discontinuities in Brazilian
21 pages
Bio Paper 5 PDF
No ratings yet
Bio Paper 5 PDF
8 pages
Dilution Systems For Aerosols Series DIL, DDS and HDS: Special Advantages
No ratings yet
Dilution Systems For Aerosols Series DIL, DDS and HDS: Special Advantages
4 pages
Azure Iot (Complete Steps 1-9 in Order) : Login With Your Live Id To Receive Credit
No ratings yet
Azure Iot (Complete Steps 1-9 in Order) : Login With Your Live Id To Receive Credit
2 pages
2nde Unit 6 Speaking
No ratings yet
2nde Unit 6 Speaking
3 pages
ED Mid
No ratings yet
ED Mid
1 page
PMP Question Bank
From Everand
PMP Question Bank
Mohammad Usmani
4/5 (34)
Data Interpretation Guide For All Competitive and Admission Exams
From Everand
Data Interpretation Guide For All Competitive and Admission Exams
Mohmmad Khaja Shareef
2.5/5 (6)
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
IGNOU PGDCA MCS 207 Database Management Systems Previous Years Unsolved Papers
From Everand
IGNOU PGDCA MCS 207 Database Management Systems Previous Years Unsolved Papers
Manish Soni
No ratings yet
IGNOU MCA Previous Years Unsolved Papers All in One
From Everand
IGNOU MCA Previous Years Unsolved Papers All in One
Manish Soni
No ratings yet

RMSC3001 2023-24 PS2

Uploaded by

RMSC3001 2023-24 PS2

Uploaded by

THE CHINESE UNIVERSITY OF HONG KONG

RMSC3001: Principles of Credit Risk Management

You might also like