100% found this document useful (1 vote)

298 views35 pages

Credit Risk Modeling in Python Chapter4

This document provides an overview of credit risk modeling techniques in Python. It discusses comparing classification reports from models, ROC and AUC analysis to evaluate model performance, ensuring model calibration by having predicted probabilities accurately represent confidence levels, and calculating and plotting calibration curves for interpretation. It also covers setting thresholds to determine loan acceptance and calculating acceptance rates, bad rates, and expected portfolio losses to evaluate different risk strategies. The goal is to select a strategy that minimizes expected loss while maintaining an acceptable approval rate.

Uploaded by

Fgpeqw

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

100% found this document useful (1 vote)

298 views35 pages

Credit Risk Modeling in Python Chapter4

Uploaded by

Fgpeqw

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

You are on page 1/ 35

Model evaluation

and implementation
CREDIT RIS K MODELIN G IN P YTH ON

Michael Crabtree
Data Scientist, Ford Motor Company
Comparing classi cation reports
Create the reports with classification_report() and compare

CREDIT RISK MODELING IN PYTHON

ROC and AUC analysis
Models with better performance will have more lift

More lift means the AUC score is higher

CREDIT RISK MODELING IN PYTHON

Model calibration
We want our probabilities of default to accurately represent the model's con dence level
The probability of default has a degree of uncertainty in it's predictions

A sample of loans and their predicted probabilities of default should be close to the percentage of
defaults in that sample

Sample of loans Average predicted PD Sample percentage of actual defaults Calibrated?

10 0.12 0.12 Yes

10 0.25 0.65 No

1
http://datascienceassn.org/sites/default/ les/Predicting%20good%20probabilities%20with%20supervised%20learning.

CREDIT RISK MODELING IN PYTHON

Calculating calibration
Shows percentage of true defaults for each predicted probability

Essentially a line plot of the results of calibration_curve()

from sklearn.calibration import calibration_curve

calibration_curve(y_test, probabilities_of_default, n_bins = 5)

# Fraction of positives
(array([0.09602649, 0.19521012, 0.62035996, 0.67361111]),
# Average probability
array([0.09543535, 0.29196742, 0.46898465, 0.65512207]))

CREDIT RISK MODELING IN PYTHON

Plotting calibration curves
plt.plot(mean_predicted_value, fraction_of_positives, label="%s" % "Example Model")

CREDIT RISK MODELING IN PYTHON

Checking calibration curves
As an example, two events selected (above and below perfect line)

CREDIT RISK MODELING IN PYTHON

Calibration curve interpretation

CREDIT RISK MODELING IN PYTHON

Calibration curve interpretation

CREDIT RISK MODELING IN PYTHON

Let's practice!
CREDIT RIS K MODELIN G IN P YTH ON
Credit acceptance
rates
CREDIT RIS K MODELIN G IN P YTH ON

Michael Crabtree
Data Scientist, Ford Motor Company
Thresholds and loan status
Previously we set a threshold for a range of prob_default values
This was used to change the predicted loan_status of the loan

preds_df['loan_status'] = preds_df['prob_default'].apply(lambda x: 1 if x > 0.4 else 0)

Loan prob_default threshold loan_status

1 0.25 0.4 0

2 0.42 0.4 1

3 0.75 0.4 1

CREDIT RISK MODELING IN PYTHON

Thresholds and acceptance rate
Use model predictions to set better thresholds
Can also be used to approve or deny new loans

For all new loans, we want to deny probable defaults

Use the test data as an example of new loans

Acceptance rate: what percentage of new loans are accepted to keep the number of defaults in a
portfolio low
Accepted loans which are defaults have an impact similar to false negatives

CREDIT RISK MODELING IN PYTHON

Understanding acceptance rate
Example: Accept 85% of loans with the lowest prob_default

CREDIT RISK MODELING IN PYTHON

Calculating the threshold
Calculate the threshold value for an 85% acceptance rate

import numpy as np
# Compute the threshold for 85% acceptance rate
threshold = np.quantile(prob_default, 0.85)

0.804

Loan prob_default Threshold Predicted loan_status Accept or Reject

1 0.65 0.804 0 Accept

2 0.85 0.804 1 Reject

CREDIT RISK MODELING IN PYTHON

Implementing the calculated threshold
Reassign loan_status values using the new threshold

# Compute the quantile on the probabilities of default

preds_df['loan_status'] = preds_df['prob_default'].apply(lambda x: 1 if x > 0.804 else 0)

CREDIT RISK MODELING IN PYTHON

Bad Rate
Even with a calculated threshold, some of the accepted loans will be defaults

These are loans with prob_default values around where our model is not well calibrated

CREDIT RISK MODELING IN PYTHON

Bad rate calculation

#Calculate the bad rate

np.sum(accepted_loans['true_loan_status']) / accepted_loans['true_loan_status'].count()

If non-default is 0 , and default is 1 then the sum() is the count of defaults

The .count() of a single column is the same as the row count for the data frame

CREDIT RISK MODELING IN PYTHON

Let's practice!
CREDIT RIS K MODELIN G IN P YTH ON
Credit strategy and
minimum expected
loss
CREDIT RIS K MODELIN G IN P YTH ON

Michael Crabtree
Data Scientist, Ford Motor Company
Selecting acceptance rates
First acceptance rate was set to 85%, but other rates might be selected as well

Two options to test different rates:

Calculate the threshold, bad rate, and losses manually

Automatically create a table of these values and select an acceptance rate

The table of all the possible values is called a strategy table

CREDIT RISK MODELING IN PYTHON

Setting up the strategy table
Set up arrays or lists to store each value

# Set all the acceptance rates to test

accept_rates = [1.0, 0.95, 0.9, 0.85, 0.8, 0.75, 0.7, 0.65, 0.6, 0.55,
0.5, 0.45, 0.4, 0.35, 0.3, 0.25, 0.2, 0.15, 0.1, 0.05]
# Create lists to store thresholds and bad rates
thresholds = []
bad_rates = []

CREDIT RISK MODELING IN PYTHON

Calculating the table values
Calculate the threshold and bad rate for all acceptance rates

for rate in accept_rates:

# Calculate threshold
threshold = np.quantile(preds_df['prob_default'], rate).round(3)
# Store threshold value in a list
thresholds.append(np.quantile(preds_gbt['prob_default'], rate).round(3))
# Apply the threshold to reassign loan_status
test_pred_df['pred_loan_status'] = \
test_pred_df['prob_default'].apply(lambda x: 1 if x > thresh else 0)
# Create accepted loans set of predicted non-defaults
accepted_loans = test_pred_df[test_pred_df['pred_loan_status'] == 0]
# Calculate and store bad rate
bad_rates.append(np.sum((accepted_loans['true_loan_status'])
/ accepted_loans['true_loan_status'].count()).round(3))

CREDIT RISK MODELING IN PYTHON

Strategy table interpretation
strat_df = pd.DataFrame(zip(accept_rates, thresholds, bad_rates),
columns = ['Acceptance Rate','Threshold','Bad Rate'])

CREDIT RISK MODELING IN PYTHON

Adding accepted loans
The number of loans accepted for each acceptance rate
Can use len() or .count()

CREDIT RISK MODELING IN PYTHON

Adding average loan amount
Average loan_amnt from the test set data

CREDIT RISK MODELING IN PYTHON

Estimating portfolio value
Average value of accepted loan non-defaults minus average value of accepted defaults

Assumes each default is a loss of the loan_amnt

CREDIT RISK MODELING IN PYTHON

Total expected loss
How much we expect to lose on the defaults in our portfolio

# Probability of default (PD)

test_pred_df['prob_default']
# Exposure at default = loan amount (EAD)
test_pred_df['loan_amnt']
# Loss given default = 1.0 for total loss (LGD)
test_pred_df['loss_given_default']

CREDIT RISK MODELING IN PYTHON

Let's practice!
CREDIT RIS K MODELIN G IN P YTH ON
Course wrap up
CREDIT RIS K MODELIN G IN P YTH ON

Michael Crabtree
Data Scientist, Ford Motor Company
Your journey...so far
Prepare credit data for machine learning models
Important to understand the data

Improving the data allows for high performing simple models

Develop, score, and understand logistic regressions and gradient boosted trees

Analyze the performance of models by changing the data

Understand the nancial impact of results

Implement the model with an understanding of strategy

CREDIT RISK MODELING IN PYTHON

Risk modeling techniques
The models and framework in this course:
Discrete-time hazard model (point in time): the probability of default is a point-in-time event

Stuctural model framework: the model explains the default even based on other factors

Other techniques
Through-the-cycle model (continuous time): macro-economic conditions and other effects are
used, but the risk is seen as an independent event

Reduced-form model framework: a statistical approach estimating probability of default as an

independent Poisson-based event

CREDIT RISK MODELING IN PYTHON

Choosing models
Many machine learning models available, but logistic regression and tree models were used
These models are simple and explainable

Their performance on probabilities is acceptable

Many nancial sectors prefer model interpretability

Complex or "black-box" models are a risk because the business cannot explain their decisions
fully

Deep neural networks are often too complex

CREDIT RISK MODELING IN PYTHON

Tips from me to you
Focus on the data
Gather as much data as possible

Use many different techniques to prepare and enhance the data

Learn about the business

Increase value through data

Model complexity can be a two-edged sword

Really complex models may perform well, but are seen as a "black-box"

In many cases, business users will not accept a model they cannot understand

Complex models can be very large and dif cult to put into production

CREDIT RISK MODELING IN PYTHON

Thank you!
CREDIT RIS K MODELIN G IN P YTH ON

Credit Risk Modeling in Python Chapter3
No ratings yet
Credit Risk Modeling in Python Chapter3
35 pages
Logit Model For PD
No ratings yet
Logit Model For PD
9 pages
EXL Acquisition Scorecard Reject Inference Methodologies
No ratings yet
EXL Acquisition Scorecard Reject Inference Methodologies
13 pages
Credit Risk Assessment
100% (5)
Credit Risk Assessment
115 pages
Credit Risk Analytics: Measurement Techniques, Applications, and Examples in SAS
From Everand
Credit Risk Analytics: Measurement Techniques, Applications, and Examples in SAS
Bart Baesens
No ratings yet
F - 8480 - Operation Manual New 2015 - 1
No ratings yet
F - 8480 - Operation Manual New 2015 - 1
57 pages
Ifrs 9 Ecl Template General Approach
No ratings yet
Ifrs 9 Ecl Template General Approach
2 pages
Basics of Credit Risk Modelling
100% (1)
Basics of Credit Risk Modelling
13 pages
Market Risk Questions PDF
No ratings yet
Market Risk Questions PDF
16 pages
JD - Credit Model Validations
No ratings yet
JD - Credit Model Validations
2 pages
Model Risk Management with SAS
From Everand
Model Risk Management with SAS
SAS Institute Inc.
No ratings yet
Designing Machine Learning Workflows in Python Chapter2
No ratings yet
Designing Machine Learning Workflows in Python Chapter2
39 pages
Analyzing IoT Data in Python Chapter3
No ratings yet
Analyzing IoT Data in Python Chapter3
30 pages
Djj10013-Engineering Drawing: Title: Autocad Report
No ratings yet
Djj10013-Engineering Drawing: Title: Autocad Report
23 pages
Credit Risk Modeling in Python Chapter1
100% (1)
Credit Risk Modeling in Python Chapter1
27 pages
Credit Risk Modeling in R
100% (2)
Credit Risk Modeling in R
66 pages
Credit Risk Predictive Modelling - by EY
0% (1)
Credit Risk Predictive Modelling - by EY
37 pages
Credit Risk Models
No ratings yet
Credit Risk Models
32 pages
Credit Risk Modeling in Python Chapter2
100% (1)
Credit Risk Modeling in Python Chapter2
36 pages
Redesigning Credit Risk Modeling to Achieve Profit and Volatility Targets
From Everand
Redesigning Credit Risk Modeling to Achieve Profit and Volatility Targets
Joseph L. Breeden
No ratings yet
Validators Guide To Model Risk Management by RiskSpan
100% (4)
Validators Guide To Model Risk Management by RiskSpan
29 pages
Credit Risk Modelling Final
No ratings yet
Credit Risk Modelling Final
135 pages
Credit Risk Estimation Techniques
0% (1)
Credit Risk Estimation Techniques
31 pages
Models For PD LGD Ead
100% (2)
Models For PD LGD Ead
38 pages
Credit Risk Modeling Steps
No ratings yet
Credit Risk Modeling Steps
81 pages
SMEs Credit Risk Modelling For PDF
No ratings yet
SMEs Credit Risk Modelling For PDF
270 pages
Credit Risk Modeling
No ratings yet
Credit Risk Modeling
213 pages
Credit Risk Modeling
No ratings yet
Credit Risk Modeling
4 pages
Credit Risk S1
100% (1)
Credit Risk S1
33 pages
Credit Risk Sas
No ratings yet
Credit Risk Sas
152 pages
Probability of Default
100% (1)
Probability of Default
5 pages
Risk Models
100% (1)
Risk Models
20 pages
Modeling of EAD and LGD: Empirical Approaches and Technical Implementation
100% (1)
Modeling of EAD and LGD: Empirical Approaches and Technical Implementation
21 pages
Credit Risk Modelling
100% (3)
Credit Risk Modelling
40 pages
Loan Pricing
No ratings yet
Loan Pricing
39 pages
Point-In-Time (PIT) LGD and EAD Models For IFRS9/CECL and Stress Testing
No ratings yet
Point-In-Time (PIT) LGD and EAD Models For IFRS9/CECL and Stress Testing
16 pages
Estimation of Probability of Defaults (PD) For Low Default Portfolios An Actuarial Approach
100% (2)
Estimation of Probability of Defaults (PD) For Low Default Portfolios An Actuarial Approach
47 pages
Credit Risk Irb Approach2
No ratings yet
Credit Risk Irb Approach2
232 pages
Online Credit Risk Analytics and Modeling
0% (2)
Online Credit Risk Analytics and Modeling
7 pages
Model Risk Tiering
100% (2)
Model Risk Tiering
32 pages
Managing Credit Risk
No ratings yet
Managing Credit Risk
44 pages
Credit Risk Irb Model
100% (1)
Credit Risk Irb Model
66 pages
Financial Risk Analysis: Great Learning PGPBABI 2017
No ratings yet
Financial Risk Analysis: Great Learning PGPBABI 2017
25 pages
Credit Risk Modeling New - New1
No ratings yet
Credit Risk Modeling New - New1
41 pages
AnalytixWise - Risk Analytics Unit 3 Credit Risk Analytics
100% (1)
AnalytixWise - Risk Analytics Unit 3 Credit Risk Analytics
29 pages
106 - Machine Learning and Credit Risk Modelling
100% (1)
106 - Machine Learning and Credit Risk Modelling
8 pages
Model Management Guidance PDF
No ratings yet
Model Management Guidance PDF
70 pages
Basel III
No ratings yet
Basel III
31 pages
Deloitte Approach On IRRBB
No ratings yet
Deloitte Approach On IRRBB
4 pages
Loan Prediction
No ratings yet
Loan Prediction
37 pages
How To Credit Score With Predictive Analytics: Whitepaper
No ratings yet
How To Credit Score With Predictive Analytics: Whitepaper
7 pages
ICAAP Overview Core Concepts Toc
No ratings yet
ICAAP Overview Core Concepts Toc
3 pages
The Basel Ii "Use Test" - a Retail Credit Approach: Developing and Implementing Effective Retail Credit Risk Strategies Using Basel Ii
From Everand
The Basel Ii "Use Test" - a Retail Credit Approach: Developing and Implementing Effective Retail Credit Risk Strategies Using Basel Ii
Stephen D. Morris
No ratings yet
Practical Data Analytics for BFSI
From Everand
Practical Data Analytics for BFSI
Mr. Bharat Sikka
No ratings yet
Credit Risk Modeling Using Python
No ratings yet
Credit Risk Modeling Using Python
133 pages
FRA Cheat Sheet Week1
No ratings yet
FRA Cheat Sheet Week1
2 pages
EasyChair Preprint 8693
No ratings yet
EasyChair Preprint 8693
22 pages
ch4 PDF
No ratings yet
ch4 PDF
32 pages
Machine Learning Paper BD
No ratings yet
Machine Learning Paper BD
16 pages
Credit Risk - Predictive Modelling
No ratings yet
Credit Risk - Predictive Modelling
47 pages
DMMLM - Risk Score Prediction Model
No ratings yet
DMMLM - Risk Score Prediction Model
28 pages
12113667 an Kit
No ratings yet
12113667 an Kit
12 pages
Credit Risk Prediction Presentation
No ratings yet
Credit Risk Prediction Presentation
11 pages
SSRN Id3769854
No ratings yet
SSRN Id3769854
8 pages
Spoken Language Processing in Python Chapter4
No ratings yet
Spoken Language Processing in Python Chapter4
46 pages
Spoken Language Processing in Python Chapter2
No ratings yet
Spoken Language Processing in Python Chapter2
23 pages
Chapter3 PDF
No ratings yet
Chapter3 PDF
36 pages
Spoken Language Processing in Python Chapter3
No ratings yet
Spoken Language Processing in Python Chapter3
26 pages
Preparing Your Gures To Share With Others: Ariel Rokem
No ratings yet
Preparing Your Gures To Share With Others: Ariel Rokem
35 pages
Introduction To Data Visualization With Seaborn Chapter2
No ratings yet
Introduction To Data Visualization With Seaborn Chapter2
38 pages
Introduction To Data Visualization With Matplotlib Chapter2
No ratings yet
Introduction To Data Visualization With Matplotlib Chapter2
27 pages
Spoken Language Processing in Python Chapter1
No ratings yet
Spoken Language Processing in Python Chapter1
17 pages
Changing Plot Style and Color: Erin Case
No ratings yet
Changing Plot Style and Color: Erin Case
54 pages
Designing Machine Learning Workflows in Python Chapter3
No ratings yet
Designing Machine Learning Workflows in Python Chapter3
42 pages
Customer Segmentation in Python Chapter4
No ratings yet
Customer Segmentation in Python Chapter4
37 pages
Designing Machine Learning Workflows in Python Chapter4
No ratings yet
Designing Machine Learning Workflows in Python Chapter4
38 pages
Cleaning Data With PySpark Chapter3
No ratings yet
Cleaning Data With PySpark Chapter3
25 pages
Designing Machine Learning Workflows in Python Chapter1
No ratings yet
Designing Machine Learning Workflows in Python Chapter1
32 pages
Introduction To Data Visualization With Matplotlib: Ariel Rokem
No ratings yet
Introduction To Data Visualization With Matplotlib: Ariel Rokem
30 pages
Introduction To Data Visualization With Seaborn Chapter1
No ratings yet
Introduction To Data Visualization With Seaborn Chapter1
26 pages
Introduction To Data Visualization With Seaborn Chapter3
100% (1)
Introduction To Data Visualization With Seaborn Chapter3
32 pages
Credit Risk Modeling in Python Chapter4
100% (1)
Credit Risk Modeling in Python Chapter4
35 pages
Customer Segmentation in Python Chapter3
No ratings yet
Customer Segmentation in Python Chapter3
25 pages
Building Chatbots in Python Chapter4
No ratings yet
Building Chatbots in Python Chapter4
20 pages
Cleaning Data With PySpark Chapter4
No ratings yet
Cleaning Data With PySpark Chapter4
23 pages
Cleaning Data With PySpark Chapter2
100% (1)
Cleaning Data With PySpark Chapter2
25 pages
Cleaning Data With PySpark Chapter1
0% (1)
Cleaning Data With PySpark Chapter1
20 pages
Building Chatbots in Python Chapter2 PDF
No ratings yet
Building Chatbots in Python Chapter2 PDF
41 pages
Analyzing IoT Data in Python Chapter4
No ratings yet
Analyzing IoT Data in Python Chapter4
34 pages
Analyzing IoT Data in Python Chapter2
No ratings yet
Analyzing IoT Data in Python Chapter2
35 pages
Analyzing IoT Data in Python Chapter1
100% (1)
Analyzing IoT Data in Python Chapter1
27 pages
Petrol Engine Overhaul
No ratings yet
Petrol Engine Overhaul
47 pages
Ali Et Al. - 2023 - The Enlightening Role of Explainable Artificial Intelligence in Medical & Healthcare Domains A Syst
No ratings yet
Ali Et Al. - 2023 - The Enlightening Role of Explainable Artificial Intelligence in Medical & Healthcare Domains A Syst
19 pages
Google Keyword Planner Notes
No ratings yet
Google Keyword Planner Notes
32 pages
Educator Workforce Data Report
No ratings yet
Educator Workforce Data Report
92 pages
ErationCard - SPHH - RationCardNo - 40296236 - 43850684 - 26 - 06 - 2023 11 - 32 - 14
No ratings yet
ErationCard - SPHH - RationCardNo - 40296236 - 43850684 - 26 - 06 - 2023 11 - 32 - 14
1 page
Balanced Pressure Systems
No ratings yet
Balanced Pressure Systems
5 pages
LCD Tutorial PDF
No ratings yet
LCD Tutorial PDF
46 pages
OOSE Project Documentation Preparation Template
No ratings yet
OOSE Project Documentation Preparation Template
2 pages
Seeq UseCase Continuous Process Verification
No ratings yet
Seeq UseCase Continuous Process Verification
2 pages
Nagma Thakor CV (PM)
No ratings yet
Nagma Thakor CV (PM)
4 pages
Dokumen - Tips - The 214 Traditional Kanji Radicals and Their Meanings
No ratings yet
Dokumen - Tips - The 214 Traditional Kanji Radicals and Their Meanings
15 pages
SCORM12 AICC Troubleshooting Guide
No ratings yet
SCORM12 AICC Troubleshooting Guide
17 pages
Generator Pamphlet PDF
No ratings yet
Generator Pamphlet PDF
4 pages
HPCN Chapter 1 Packet Switched Network HPCN Forouza
100% (1)
HPCN Chapter 1 Packet Switched Network HPCN Forouza
109 pages
Andor Istar 334 Specifications
No ratings yet
Andor Istar 334 Specifications
7 pages
Enlighted Advanced Lighting Controls Brochure
No ratings yet
Enlighted Advanced Lighting Controls Brochure
8 pages
40 - Specification - 04400 - Stone Masonry
100% (1)
40 - Specification - 04400 - Stone Masonry
4 pages
Samsara Patent
No ratings yet
Samsara Patent
22 pages
Project Management For Managers: Lec - 19 Risk Management-I
No ratings yet
Project Management For Managers: Lec - 19 Risk Management-I
11 pages
Social Media Marketing PHD Thesis PDF
100% (3)
Social Media Marketing PHD Thesis PDF
8 pages
Group 4 The Influence of YouTube Food Vlog Reviews For Viewers Purchase Intention
100% (1)
Group 4 The Influence of YouTube Food Vlog Reviews For Viewers Purchase Intention
69 pages
Risk Mitigation, Monitoring, and Management (RMMM) Plan: Module-6
No ratings yet
Risk Mitigation, Monitoring, and Management (RMMM) Plan: Module-6
7 pages
BS en 12350-4 (2009)
No ratings yet
BS en 12350-4 (2009)
12 pages
First Contact With Tensor Flow PDF
100% (2)
First Contact With Tensor Flow PDF
136 pages
Accreditation of CPD Program
No ratings yet
Accreditation of CPD Program
14 pages
Project Charter
No ratings yet
Project Charter
4 pages
Laboratory Manual: Analogue and Digital Communication Lab
No ratings yet
Laboratory Manual: Analogue and Digital Communication Lab
18 pages
Blue:: Mak Advanced Technician Exam
No ratings yet
Blue:: Mak Advanced Technician Exam
1 page