top of page

🎯 Course Objective

The objective of this course is to provide banking and risk management professionals with a comprehensive understanding of internal models for credit risk, their validation frameworks, and the management of model risk in line with ECB guide to internal models July 2025. The course also explores the challenges and opportunities of applying artificial intelligence (AI) and machine learning (ML) within internal models, with a focus on governance, explainability, and regulatory compliance.

📘 Course Description

Internal Models for Credit Risk: Validation and Model Risk under AI & ML is a specialized program designed for professionals involved in credit risk modeling, model validation, model risk management, and regulatory compliance.

The course covers the design, validation, and oversight of internal models for Probability of Default (PD), Loss Given Default (LGD), and Exposure at Default (EAD) under the IRB approach, with emphasis on supervisory principles from the ECB Guide to Internal Models (2025). It also covers the same parameters for IFRS 9 impairment.

 

Participants will learn how to:

  • Implement robust model governance and risk management frameworks.

  • Conduct independent validation using statistical tests, benchmarking, back-testing, and challenger models.

  • Apply margin of conservatism (MoC) in response to model deficiencies.

  • Integrate AI and ML techniques into internal models while addressing supervisory concerns about complexity, explainability, bias, and drift.

  • Utilize explainable AI (XAI) methods such as SHAP and LIME to validate complex ML models.

  • Understand the role of validation, audit, and governance in mitigating model risk throughout the model lifecycle.

  • Explain the pioneering model risk directive SR 11-7 in the US, the internal model review directive, TRIM, in the European Union, EU, and other important model risk and validation directives such as the estimation directive of PD and LGD and treatment of EBA default exposures.

  • Explain the use of artificial intelligence for model validation.

  • Techniques are shown to achieve the automation of the Construction and Calibration of Models through Artificial Intelligence.

  • Indicate the best practices for validation of internal credit risk models using Machine Learning.

  • Show model risk quantification techniques in credit scoring models, PD, LGD and EAD parameters and regulatory and economic capital.

  • Explore credit scoring validation techniques, and others such as discriminant power, stability tests and backtesting.

  • Offer a very significant number of econometric and machine learning methodologies to develop credit scoring, PD, LGD and EAD models under the IRB and IFRS9 approaches.

  • Explain methodologies to develop models of economic capital and stress testing.

  • Present validation techniques for economic and regulatory capital models.

  • Show a significant number of validation techniques for econometric models and time series used in stress testing.

  • Modeling the stress testing of the PD, LGD and transition matrices of consumer and corporate portfolios.

  • Show innovative stress testing validation techniques.

  • Explain and detect model risk in stress testing.

 

Through a combination of theory, regulatory insights, and hands-on coding exercises in Python and R, participants will gain the skills to design, validate, and manage both traditional and ML-driven internal models for credit risk.

By the end of the course, learners will be equipped to bridge regulatory requirements and innovation, ensuring that credit risk models remain compliant, explainable, and fit for purpose in a rapidly evolving banking environment.

 

WHO SHOULD ATTEND?

This program is aimed at managers, analysts and credit risk consultants. Particularly, to professionals of model risk, model validation and model auditing. For a better understanding of the topics, it is recommended that the participant have knowledge of statistics. The course contains exercises in SAS, R and Excel.

fondo-azul-degradado-lujo-abstracto-azul-oscuro-liso-banner-estudio-vineta-negra.jpg
Time.png

Schedules:

  • Europe: Mon-Fri, CEST 16-19 h

 

  • America: Mon-Fri, CDT 18-21 h

  • Asia: Mon-Fri, IST 18-21 h

 

 

 

 

Price.png

Price: 7 900 €

Online Live Course

Price.png

Price: 4 900 €

Self-Paced MasterClass

Level.png

Level: Advanced

Duration.png

Duration: 40 h

Material.png

Material: 

Presentations PDF

Exercises in R, Python, SAS and Excel

gesto-de-hombre-de-negocios-que-protege-el-grafico-y-el-grafico-de-inversion-virtual-cada-

Internal Models for Credit Risk: Validation and Model Risk 

under AI & ML

fondo-azul-degradado-lujo-abstracto-azul-oscuro-liso-banner-estudio-vineta-negra.jpg

Syllabus

Internal Models for Credit Risk: Validation and Model Risk under AI & ML

Anchor 12

CREDIT RISK INTERNAL MODELS

 

Module 1: Credit Risk Internal Models &

Model Risk in Machine Learning

  • ECB guide to internal models July 2025

  • Introduction and Supervisory Foundations

  • Foreword & purpose of the ECB Guide.

  • Legal framework: CRR, CRR2/CRR3, CRD, RTS/ITS.

  • TRIM legacy and harmonisation of practices.

  • Role of JSTs and supervisory investigations.
    Exercise: Map EU regulations to internal model approval steps.

  •  Overarching Principles of Internal Models

  • Group-wide implementation & consistency.

  • Documentation requirements & model register.

  • Data governance expectations (DORA, BCBS 239).

  • Model risk management framework (definition, policies, lifecycle).

  • Role of management body & senior management.
    Exercise: Build a model risk framework checklist.

  • Internal Validation Principles

  • Independence & proportionality of validation.

  • Frequency of initial and annual validations.

  • Separation of duties between development and validation.

  • Resource & skill requirements.
    Case Study: Validation independence in a medium-sized bank.

  •  Internal Audit and Oversight

  • Scope of internal audit in internal models.

  • Independence, resources, and reporting lines.

  • Interaction with validation.

  • Audit documentation and follow-up.

  • Climate-Related and Environmental Risks

  • ECB guidance on integrating climate/ESG in internal models.

  • Materiality assessments.

  • Examples of climate risk transmission channels.
    Exercise: Extend an LGD model with climate-related variables.

  •  Machine Learning in Internal Models

  • Supervisory stance on ML models (complexity, opacity, drift).

  • Classification as “highly complex” or “dynamic” models.

  • Governance and skill requirements across lines of defence.

  • Data quality, bias, explainability, and ethics.
    Lab: Train an ML PD model (Python) & apply SHAP for explainability.

  • Credit Risk Models – Governance and Validation

  • Scope & roll-out of IRB models.

  • Internal governance structures.

  • Model use tests and limitations.

  • Change management for IRB models.

  • Data maintenance and documentation.
    Exercise: Create an IRB model governance flowchart.

  •  Definition of Default

  • Consistency in default definition.

  • Days past due criterion.

  • Unlikeliness to pay indicators.

  • Return to non-defaulted status.

  • Adjustments after definition changes.
    Case Study: Recalibrating PDs after default definition revision.

  •  Estimation of Credit Risk Parameters

  • Use of data for estimation.

  • Probability of Default (PD): calibration, discriminatory power.

  • Loss Given Default (LGD): downturn, discounting, recovery data.

  • EAD/Conversion Factors: estimation, credit lines, derivatives.

  • Margin of Conservatism (MoC): principles and application.

  • Review and recalibration of estimates.

  • Maturity calculation for non-retail exposures.
    Lab: Calibrate PD and LGD using historical data in Python/R.

  • Model Risk Governance & ICAAP Integration

  • Classification of model complexity/materiality.

  • MoC and its impact on capital.

  • Link between validation, internal audit, and ICAAP/ILAAP.

  • Supervisory review process (SREP).

  • Third-party involvement and outsourcing risks.
     

MODEL RISK 

 

​Module 2: Model Risk Process

  • Model validation process

  • Definition of the objective and methodology of the model

  • Review of model memories

  • validation plan

  • Validation conclusions

  • Model risk sub-risks:

    • Model risk in the data

    • Model risk in the methodology

    • Model risk in implementation

    • Risk in model results

  • Model governance

  • Controls at each sub-risk level

  • Analysis and validation of model documentation

  • Purpose of the model

  • Data

  • Design and methodology

  • Specification and estimate

  • Evidence

  • Implementation

  • monitoring

  • operational controls

  • reporting

  • Control panel

  • Solutions and technology necessary for model risk management

  • Case study 3: Model risk management for credit risk models, validation and documentation review

EU and US Model Risk Regulation

 

​Module 3: Model Risk Management Directive SR 11-7 in USA

  • Introduction

  • Scope and purpose

  • Model Risk Management

  • Model development, implementation and use

  • Model validation

  • Concept strength assessment and development testing

  • Permanent monitoring, verification of processes and Benchmarking

  • Analysis of the results, including Backtesting

  • Governance, Policies and Controls

  • Politics and procedures

  • Roles and responsibilities

  • Internal audit

  • model inventory

Module 4: Guide for the Targeted Review of Internal Models (TRIM) EU

  • Scope and objectives of the guide

  • General principles of internal models Roll-out and PPU

  • governance

  • Internal audit

  • internal validation

  • Model Usage

  • Management of model changes

  • Data quality

  • Third parties Participation

  • Credit risk

    • Scope of the credit risk guide

    • Data requirements

  • Probability of default (PD)

    • Structure of PD models

    • main drivers

    • Pool distribution

    • Rating Philosophy

    • Calculation of the default rate

    • Calculation of long-term mean PD

  • Loss Given Default (LGD)

  • Credit conversion factor (CCF)

  • Model-related conservatism margin

  • Review of estimates

Module 5: Validation of Models in practice

  • Lessons learned from the financial crisis on validation​

  • Validation Framework

  • Validation Definition

  • Validation principles

  • Roles and responsibilities

  • Scope and frequency

  • Validation Process

  • Internal Governance

  • Validation of IRB Models

  • Qualitative Validation

    • model design

    • data quality

    • Use Test

  • Quantitative Validation

    • backtesting

    • Discriminating Power

    • Stability Tests

  • Technological infrastructure

  • Required documentation

  • Internal validation department and team

  • Audit department and team

  • artificial intelligence

    • Autonomous validation, reconstruction and recalibration

    • Model Validation using Machine Learning

    • Artificial intelligence for model risk

    • Artificial intelligence to validate models

​​

Module 6: Model Risk Management

  • Governance in model risk

    • Role of the board and senior management

    • Role of the risk department

    • Responsible and creators of the model

    • Model risk committees

    • Internal audit

    • Policy definition

  • Organization

    • Three lines of defense

    • Internal and external communication

  • Model Life Cycle Phases

    • ID

    • Model Inventory

    • Classification

    • Levels or "Tiering" according to materiality, sophistication and impact

    • Planning and development

    • internal validation

    • Documentation

    • pattern approval

    • Implementation and use of the model

    • Model monitoring

    • International best practice risk management model

  • Model Risk Mitigation

Data quality test

Validation and audit

Benchmarks

What-if sensitivity tests

Stress testing

backtesting

  • Model risk appetite

    • Model risk appetite statement

    • risk tolerance

  • Qualitative measurement of model risk

    • Creation of the Model Risk Scorecard

    • Definition of scale and ranges

    • International Scorecard Best Practices

  • Case study: Scorecard for model risk

 

 

Quantification of Model Risk

​​

Module 7: Quantitative Measurement of Model Risk

  • The importance of quantifying model risk

  • Best international classifications

  • Optimal and non-optimal models

  • Categories of model risk:

  • Data deficiency

  • Model parameter uncertainty

  • Inappropriate use of the model

  • Model specification

  • Change in the dynamics of the financial and economic environment

  • Incorrect model implementation, misinterpretation of model output, and other errors

  • Model risk aggregation

Module 8: Models for Quantifying Model Risk

  • Why does this type of model risk occur?

  • Examples

  • Magnitude of model error related to human error

  • Qualitative measurement of model risk

  • Scorecard model

  • Creation of the model risk scorecard

  • Definition of scale and ranges

  • International best practices for scorecards

  • LDA approach to model risk measurement

  • Loss Distribution Approach (LDA)

  • Available data

  • Scenario Based Approach (SBA)

  • Scenario generation

  • Scenario assessment

  • Data quality and validation

  • Parameter determination

  • Frequency Distributions

  • Severity Distributions

  • Distribution of losses due to model risk

  • Monte Carlo Simulation

  • Panjer Approach

  • FFT

  • Bottom-up model uncertainty approach

  • Sources of information

  • Exercise 1: Estimation of model risk scorecard, establishment of scale and ranges.

  • Exercise 2: Estimation of model risk using the following frequency and severity distributions:

  • Frequency

    • Poisson and Negative binomial

  • Severity

    • LognormalGammaWeibullInverse Gaussian

    • GDP EVTG-H 4 parameter, Mixture of lognormals

    • Lognormal-EVTPoisson-Gamma Bayesian Approach, Lognormal Partition and GDP

  • Scenarios with Expert Criteria

  • Exercise 3: Selection of the best distribution using Cramer Von Misses, AD, and KS goodness-of-fit tests

  • Exercise 4: Comparison of model risk using Recursive Panjer, Fast Fourier Transformation, and Monte Carlo Simulation

Model Risk in Credit Scoring

 

​Module 9: Model risk in Credit scoring

​​

  • Dimension and use of materiality

  • Classification of scoring models by importance within the financial institution

  • Impact of the model on the entity

  • model dependency

  • Model limitations

  • Model governance

  • Documentation and review

  • Implementation

  • operational controls

  • Decision tree to assess rating and scoring models

  • Casuistry in expert Credit Rating models

  • Casuistry in statistical credit scoring models

  • Model risk in big data

  • Risk model by machine learning and black box

Module 10: Exploratory data analysis EDA and Binning 

  • Exploratory data analysis

  • Data typology

  • transactional data

  • Unstructured data embedded in text documents

  • Social Media Data

  • data sources

  • Data review

  • Target definition

  • Time horizon of the target variable

  • Sampling

    • Random Sampling

    • Stratified Sampling

    • Rebalanced Sampling

  • Exploratory Analysis:

    • histograms

    • Q Q Plot

    • Moment analysis

    • boxplot

  • Treatment of Missing values

    • imputation

    • Delete

    • Keep

  • Advanced Outlier detection and treatment techniques

    • Z-Score

    • Mahalanobis Distance

  • Data Standardization

  • Binning

  • Variable categorization

    • Equal Interval Binning

    • Equal Frequency Binning

    • Chi-Square Test

  • binary coding

  • WOE Coding

    • WOE Definition

    • Univariate Analysis with Target variable

    • Variable Selection

    • Treatment of Continuous Variables

    • Treatment of Categorical Variables

    • Gini and KS

    • Information Value

    • Optimization of continuous variables

    • Optimization of categorical variables

  • ​Exercise 5: Exploratory Analysis in R

  • Exercise 6: Detection and treatment of Outliers in R

  • Exercise 7: Missing imputation techniques in R

  • Exercise 8: Stratified and Random Sampling

  • Exercise 9: Analysis of the Weight of Evidence in Excel

  • Exercise 10: Univariate analysis in percentiles in R

  • Exercise 11: Continuous variable optimal univariate analysis in R

  • Exercise 12: KS, Gini and IV validation of each variable 

  • Exercise 14: Optimizing categorical variables in R

  • Exercise 15: Univariate Analysis with decision trees in R

  • Exercise 16: Segmentation using K means Clustering in R

MACHINE LEARNING MODELS

Module 11: Machine Learning

  • Unsupervised models​

  • K Means

  • Principal Component Analysis (PCA)

  • Advanced PCA Visualization​​

  • Supervised Models

  • Ensemble Learning

  • Bagging trees

  • Random Forest

  • Boosting

  • Adaboost

  • Gradient Boosting Trees

Deep Learning

 

Module 12: Deep Learning 

  • Feed Forward Neural Networks

  • Single Layer Perceptron

  • Multiple Layer Perceptron

  • Neural network architectures

  • Activation function​

  • Back propagation

    • Directional derivatives

    • gradients

    • Jacobians

    • Chain rule

    • Optimization and local and global minima

  • Deep Learning Convolutional Neural Networks CNN​​

    • CNN for pictures

    • Design and architectures

    • convolution operation

    • filters

    • strider

    • padding

    • Subsampling

    • pooling

    • fully connected

    • Credit Scoring using CNN

    • Recent CNN studies applied to credit risk and scoring

  • Deep Learning Recurrent Neural Networks RNN​​

    • Long Term Short Term Memory (LSTM)

    • Hopfield

    • Bidirectional associative memory

    • descending gradient

    • Global optimization methods

    • RNN and LSTM for credit scoring

    • One-way and two-way models​​

  • Generative Adversarial Networks (GANs)

    • Generative Adversarial Networks (GANs)

    • Fundamental components of the GANs

    • GAN architectures

    • Bidirectional GAN

    • Training generative models

    • Credit Scoring using GANs

  •  

Module 15: Generative AI​

  • Introducing generative AI

  • What is Generative AI?

  • Generative AI Models

    • Generative Pre- trained Transformer (GPT)

  • Text generation, Image generation, Music generation, Video generation

  • Generating text

  • Generating Code

  • Ability to solve logic problems​

  • Enterprise Use Cases for Generative AI

  • Overview of Large Language Models (LLMs)

    • Transformer Architecture

    • Types of LLMs

    • Open-Source vs. Commercial LLMs

    • Key Concepts of LLMs

  • Prompts

  • Tokens

  • Embeddings

  • Model configuration

  • Prompt Engineering

  • Model adaptation

  • Specifying multiple Dataframes to ChatGPT

  • Exercise 17: Embeddings for words, sentences, question answers

  • Exercise 18: Embeddings on Large Dataset

  • Exercise 19: Advanced Prompting Techniques

  • Exercise 20: Large Language Models in Credit Rating

Model risk in Credit Scoring

Module 16: Multivariate models and Machine Learning

 

  • Multivariate Models

    • Logistic regression

    • Cox Regression

    • Model Risk

  • ​Machine Learning

    • decision trees

    • neural networks

    • SVM

    • Ensemble Learning

    • bagging

    • Boosting

    • Random Forest

  • Deep Learning

    • CNN

    • LSTM

    • GAN

    • LLM

  • Model Risk in Machine Learning​

    • overfitting

    • Transparency

    • failed sampling

    • important variables

  • Exercise 21: Logistic Regression, stepwise method in R

  • Exercise 22: Piecewise Regression in Excel and R

  • Exercise 23: Decision trees in R

  • Exercise 24: Support Vector Machine in R

  • Exercise 25: Neural Networks in R

  • Exercise 26: Ensemble models in R

  • Exercise 27: Random Forest in R

  • Exercise 28: Bagging in R

  • Exercise 29: Credit scoring using deep learning CNN Python

  • Exercise 30: Credit Scoring using Deep Learning LSTM Python

  • Exercise 31: Credit Scoring using GANs Python

  • Exercise 32: Comparison of models of discriminant power between models: Neural Networks, Logistic Regression, Panel Data Logistic Regression and Cox Regression

  • Exercise 33: Model Risk using Confidence Intervals of Logistic Regression Coefficients

Module 14: Tuning Hyperparameters in Deep Learning

  • Hyperparameterization

  • Grid search

  • Random search

  • Bayesian Optimization

  • Train test split ratio

  • Learning rate in optimization algorithms (e.g. gradient descent)

  • Selection of optimization algorithm (e.g., gradient descent, stochastic gradient descent, or Adam optimizer)

  • Activation function selection in a (nn) layer neural network (e.g. Sigmoid, ReLU, Tanh)

  • Selection of loss, cost and custom function

  • Number of hidden layers in an NN

  • Number of activation units in each layer

  • The drop-out rate in nn (dropout probability)

  • Number of iterations (epochs) in training a nn

  • Number of clusters in a clustering task

  • Kernel or filter size in convolutional layers

  • Pooling size

  • Batch size

  • Interpretation of the Shap model

  • Exercise 34: Tuning hyperparameters in Xboosting, Random forest and SVM models for credit scoring 

  • Exercise 35: Tuning hyperparameters in Deep Learning model for credit scoring

​Module 17: Model Risk in the Scorecard

  • Scoring assignment

  • Scorecard Classification

    • Scorecard WOE

    • Binary Scorecard

    • Continuous Scorecard

  • Scorecard Rescaling

    • Factor and Offset Analysis

    • Scorecard WOE

    • Binary Scorecard

  • Reject Inference Techniques

    • cut-off

    • parceling

    • Fuzzy Augmentation

  • Advanced Cut Point Techniques

    • Cut-off optimization using ROC curves

  • Model risk by cut point decision

  • Model risk due to lack of data

  • Model Risk for not updating or recalibrating

  • Exercise 36: Construction of Scorecard in Excel

  • Exercise 37: Optimum cut-off point estimation in Excel and model risk by cut-off point selection

  • Exercise 38: Confusion matrix to verify Type 1 and Type 2 Error in Excel with and without variables

  • Exercise 39: Model risk in credit scoring due to not recalibrating on time

 

Model Validation

Validation of Credit Scoring models

​Module 18: Stability tests

  • Model stability index

  • Factor stability index

  • Xi-square test

  • K-S test

  • Exercise 40: Stability tests of models and factors

Module 19: Validation of traditional and

 Machine Learning models

  • Out of Sample and Out of time validation

  • Checking p-values in regressions

  • R squared, MSE, MAD

  • Waste diagnosis

  • Goodness of Fit Test

    • deviation

    • Bayesian Information Criterion (BIC)

    • Akaike Information Criterion

  • Multivariate Multicollinearity

  • Cross validation

  • Error bootstrapping

  • Binary case confusion matrix

  • Multinomial case confusion matrix

  • Main discriminant power tests:

    • KS, ROC Curve, Gini Index, Cumulative Accuracy Profile, Kullback-Leibler Distance, Pietra Index, 1-Ph, Conditional Entropy, Information Value, Kendall Tau, Brier Score, Divergence

  • confidence intervals

  • Jackknifing with discriminant power test

  • Bootstrapping with discriminant power test

  • Kappa statistic

  • K-Fold Cross Validation

  • Traffic Light Analysis

  • Exercise 41: Cross validation in R

  • Exercise 42: Gini Estimation, Information Value, Brier Score, Lift Curve, CAP, ROC, Divergence in Excel

  • Exercise 43: Bootstrapping of R parameters

  • Exercise 44: Gini/ROC Bootstrapping in R

  • Exercise 45: Kappa estimation

  • Exercise 46: K-Fold Cross Validation in R

  • Exercise 47: Out of time traffic light validation (horizon 6 years) of Logistics and Machine Learning models

Explainable Artificial Intelligence

Module 20: Explainable Artificial Intelligence XAI

  • Interpretability problem

  • model risk

  • Regulation of the General Data Protection Regulation GDPR

  • EBA discussion paper on machine learning for IRB models

    • 1. The challenge of interpreting the results,

    • 2. The challenge of ensuring that management functions adequately understand the models, and

    • 3. The challenge of justifying the results to supervisors

  • ​Black Box Models vs. Transparent and Interpretable Algorithms

  • interpretability tools

  • Shap, Shapley Additive explanations

    • Global Explanations

    • Dependency Plot

    • Decision Plot

    • Local Explanations Waterfall Plot

  • Lime, agnostic explanations of the local interpretable model

  • Explainer Dashboard

  • Other advanced tools

  • Exercise 48: XAI interpretability of credit scoring

Advanced Validation

 

Module 21: Advanced Validation of AI Models

 

  • Integration of state-of-the-art methods in interpretable machine learning and model diagnosis.

  • Data Pipeline

  • Feature Selection

  • Black-box Models

  • Post-hoc Explainability

  • Global Explainability

  • Local Explainability

  • Model Interpretability

  • Diagnosis: Accuracy, WeakSpot, Overfit, Reliability, Robustness, Resilience, Fairness

  • Model comparison

    • Comparative for Regression and Classification

    • Fairness Comparison

  • Exercise 49: Validation and diagnosis of advanced credit scoring models

MODEL VALIDATION

PD, LGD and EAD validation IRB and IFRS 9

 

Module 22: Directive on the estimation of PD and LGD IRB and defaulted exposures issued by EBA (updated ECB Guide to Internal Models July 2025)

  • European Directive on estimation of PD and LGD, and exposures in default

  • Why is it advisable to consider it in Latin America?

  • Parameter variability reduction

  • Homogenization of the calculation of PD and LGD

  • Implementation dates in European banks

  • Data quality

  • Representativeness of the data for the development of the model and for the calibration of the risk parameters

  • Human judgment for parameter estimation

  • Treatment of deficiencies and margin of conservatism (Moc)

  • PD estimation

  • Model development

  • Data requirement

  • Risk drivers and rating criteria

  • Treatment of external ratings

  • Rating philosophy

  • Pool Treatment

  • PD Calibration

  • Data requirement

  • One Year Default Rate Calculation

  • Calculation and use of the average observed default rate

  • Long-term default rate

  • Calibration of the long-term default rate

  • LGD estimation

  • Methodologies for estimating PD

  • Data requirement

  • Recoveries from collaterals

  • Model development

  • Risk drivers

  • Collateral Eligibility

  • Inclusion of collaterals

  • LGD Calibration

  • Definition of economic loss and realized loss

  • Treatment of commissions, interest and other withdrawals after default

  • Discount rate

  • Direct and indirect costs

  • long-term LGD

  • Long Term LGD Calibration

  • Estimation of risk parameters for exposures in default

  • Estimation and calibration of the Expected Loss Best Estimate

  • Estimation and calibration of LGD in-default

  • Application of risk parameters

  • Review of estimates

  • Accompanying Documents

  • impact assessment

  • Identification of the problem

  • Policy objectives

  • Baseline scenario

  • Options considered

  • Cost-benefit analysis

Module 23: PD templates for IRB and IFRS 9 approach

  • Introduction to Probability of Default

  • Default definition

  • Default Triggers

  • Effective and robust process to detect default

  • Technical defaults and technical default filters

  • Indispensable data model

  • Single Factor Analysis

  • Multifactor Analysis

  • Model Selection

  • Historical PS

  • Econometric and Machine Learning Models of PD

  • Risk factors that affect default

    • macroeconomic

    • idiosyncratic

  • PD Logistic Regression

  • PD COX Regression

  • PD Log-log Complementary

  • PD Logistic Regression Data Panel

  • Machine Learning to estimate PD

  • ​PD Calibration

  • Introduction to Calibration

  • Anchor Point Estimate

  • Mapping from Score to PD

  • Temporal structure of the PD

    • Marginal PD

    • Forward PD

    • Cumulative PD

  • Techniques for Mapping PD's to temporary structure

  • Vintages or vintages of PD

  • Bayesian PD

  • Expert Judgment

  • Prior and posterior distribution

  • Markov Chain Monte Carlo

  • probit model

  • ​Adjustment to the PD Economic Cycle

  • Introduction of Adjustment to the Economic Cycle

  • Directives on the economic cycle in the PD models

  • PD Trough The Cycle (PD TTC) Models

  • Considerations of the Adjustment to the “Scalar Variable” approach cycle

  • PD on Low Default Portfolios

  • PD estimation without correlations

  • PD estimation with correlations

  • LDP Calibration Using CAP Curves

  • Bayesian PD estimation for LDP

  • Default correlation

  • Correlation of defaults and multiperiod

  • Neutral Bayesian and Conservative Bayesian

  • ​Transition and PD Matrices

  • Properties of transition matrices

  • Multi-year transition matrix

    • discrete time

    • continuous time

    • Generating Matrix

    • Exponential of a matrix

  • duration method

  • Cohort method

  • error management

  • IFRS 9 PD Modeling

  • IFRS 9 requirements

  • Probability Weighted

  • Forward Looking

  • Lifetime PD modeling

  • PD Forecasting Modeling

  • PD Point in Time Forecasting

  • Markov chains

  • Exercise 50: Calibration of PD with COX regression in R

  • Exercise 51: PD calibration with log-log complementary in R

  • Exercise 52: PD calibration with logistic model in R

  • Exercise 53: PD calibration logistic Bayesian regression

  • Exercise 54: Calibration of PD regression panel logistic data

  • Exercise 55: Calibration of PD Lasso Regression

  • Exercise 56: Calibration of PD Bayesian Probit regression in R

  • Exercise 57: Transition matrices in Excel and R

  • Exercise 58: Multinomial Regression to estimate PD Lifetime

  • Exercise 59: Multi stage Markov chains in R

Module 24: Backtesting PD IRB and IFRS9

 

  • Validation of the PD in IRB PIT and TTC

  • Validation of PD Lifetime and PD12m in IFRS 9

  • Backtesting PS

  • PD Calibration Validation

  • Hosmer Lameshow test

  • normal test

  • Binomial Test

  • Spiegelhalter test

  • Redelmeier Test

  • Traffic Light Approach

  • Traffic Light Analysis and PD Dashboard

  • PS Stability Test

  • Forecasting PD vs Real PD in time

  • Validation with Monte Carlo simulation

  • Exercise 60: Backtesting PD IRB and PD IFRS 9

  • Exercise 61: Forecasting Estimated PD and Actual PD in Excel

  • Exercise 62: Validation using Monte Carlo Simulation

 

Module 25: LGD Models for IRB and IFRS 9 Approach

 

  • ​BASEL III and EBA LGD

  • LGD estimation and calibration in practice

  • Default LGD estimation

  • LGD Econometric and Machine Learning Models

  • Advantages and disadvantages of LGD Predictive Models

  • Forward Looking models incorporating Macroeconomic variables

  • Parametric and non-parametric models and transformation regressions

  • Linear regression and Beta transformation

  • Linear Regression and Logit Transformation

  • Linear regression and Box Cox transformation

  • Logistic and Linear Regression

  • Logistic and nonlinear regression

  • Censored Regression

  • Generalized Additive Model

  • neural networks

  • SVM

  • Beta regression

  • Inflated beta regression

  • Fractional Response Regression

  • LGD for IFRS 9

  • Comparison of regulatory LGD vs. IFRS 9

  • LGD adjustments

  • Selection of Interest Rates

  • Allocation of Costs

  • floors

  • Treatment of collateral over time

  • Marginal LGD

  • PIT LGD

  • Loss Lifetime Concept

  • exposure treatment

  • Exercise 63: Logistic and linear regression LGD in R

  • Exercise 64: Neural Networks and SVM LGD

  • Exercise 65: Generalized Additived Model LGD in R

  • Exercise 66: Beta Regression Model LGD in R

  • Exercise 67: Long-term LGD calibration

  • Exercise 68: Inflated Beta Regression in SAS

  • Exercise 69: Comparison of the performance of the models using Calibration and precision tests

Module 26: LGD Backtesting

 

  • LGD backtesting in IRB

  • LGD Backtesting in IFRS 9

  • Accuracy ratio

  • absolute accuracy indicator

  • Confidence Intervals

  • transition analysis

  • RR Analysis using Triangles

  • Advanced LGD Backtesting with a vintage approach

  • Backtesting for econometric models

  • ROC, Gini and K-S curve

  • Exercise 70: Comparison of the performance of the models using Calibration and precision tests.

 

Module 27: EAD Models

 

  • Guidelines for estimating CCF

  • Guidelines for Estimating CCF Downturn

  • Temporal horizon

  • Transformations to model the CCF

  • Approaches to Estimating CCF

  • Fixed Horizon approach

  • Cohort Approach

  • Variable focus time horizon

  • CCF Econometric and Machine Learning Models

    • Linear regression

    • Logistic regression

    • Generalized Additive Model

    • neural networks

    • SVM

    • Beta regression

    • Inflated beta regression

    • Fractional Response Regression

  • EAD for IFRS 9

    • Comparison of regulatory EAD vs. IFRS 9

    • Adjustments in the EAD

    • Interest Accrual

    • CCF PIT Estimate

    • Modeling of available lifetime

  • Exercise 56: Estimation and adjustments for EAD IFRS 9 in excel and R

  • Exercise 57: Neural Networks and SVM CCF

  • Exercise 58: Generalized Additived Model CCF in R

  • Exercise 59: Beta Regression Model CCF in R

Module 28: EAD Backtesting

  • EAD validation

  • CCF validation

  • Backtesting EAD and CCF IRB

  • Backtesting of the EAD and CCF IFRS 9

  • r squared

  • Pearson coefficient

  • Spearman correlation

  • Validation using ROC, K-S and Gini

  • Exercise 71: Comparison of the performance of EAD models

IFRS 9 Expected Credit Loss (ECL) Validation

​Module 29: ECL Validation

  • Initial validation

  • Periodic validation

  • Monitoring

  • Main milestones of qualitative validation

    • Data quality

    • Default Definition

    • Relevance of the qualification process

    • Override Analysis

    • environmental dynamics

    • user test

  • Main milestones of quantitative validation

    • Samples used for validation purposes

    • Discriminating Power

    • population stability

    • Characteristic Stability

    • concentration analysis

    • Staging analysis

    • Parameter Calibration

    • ECL backtesting

  • Principle 5 – Validation of the ECL model in Basel III

    • Governance

    • Model inputs

    • model design

    • ​Model output/performance

  • Validation metrics

    • Bayesian/Akaike/Schwartz/Deviance information criteria

    • Receiver operating characteristic (ROC) curve or AUROC statistic

    • Lorenz curve, Gini coefficient, Kolmogorov-Smirnov test

    • T-tests, F-tests, Wald test, log likelihood test

    • RMSE, MAPE, MAD

    • R-squared, Adjusted R-squared

    • Out-of-sample testing

    • benchmarking

    • Population stability index (PSI)

  • Statistical problems

    • Sampling bias

    • Survivorship bias

    • Disproportionately high model weightsT

    • Autoregressive and lagged terms that do not capture macroeconomic effects

    • spurious correlation

    • Smoothing methods that alter data integrity

    • simple linear models in nonlinear relationships

Validation of Regulatory and Economic Capital for Credit Risk

Module 30: Economic Capital Models

  • Definition and Objective

  • Temporal horizon

  • Default Correlation

  • asset mapping

  • unexpected loss

  • Regulatory capital models

  • ASRF Economic Capital Models

  • Business Models

  • Multifactorial Models

  • Economic Capital for retail using Charge off

  • Dependency modeling using copulas

  • VaR estimation and Expected Shortfall

  • Economic capital management

  • Exercise 72: Default correlation matrix in SAS

  • Exercise 73: Correlation of default: consumer portfolios in R

  • Exercise 74: Correlation of assets with EMV and observable data in R

  • Exercise 75: Creditrisk + in R

  • Exercise 76: One-factor model in Excel and R

  • Exercise 77: Multifactorial Model in Excel

  • Exercise 78: T-student in Excel in Excel

  • Exercise 79: Copulas in R

Module 31: Economic capital validation

  • Regulatory Capital Models

  • Economic Capital Models

  • Validation of Credit Portfolio Models

    • Model Design

    • Model Output

    • Processes, data and test of use

  • Loss aggregation validation

  • Validation of Basel models

  • Testing distributions using Berkowitz test

  • Credit Loss Distribution

  • Simulation of the critical chi-square value

  • Berkowitz test in subportfolios

  • power assessment

  • Scope and limits of the test

  • Model risk in economic capital due to uncertainty

  • Exercise 80: Implementation of the Berkowitz test in economic and regulatory capital models

  • Exercise 81: Simulation of losses and model risk in regulatory and economic capital

Validation of Stress Testing Credit Risk

 

Module 32: Forecasting Models

  • Data processing

    • Non-Stationary Series

    • Dickey-Fuller test

    • Cointegration Tests

  • Econometric Models

    • ARIMA models

    • VAR Autoregressive Vector Models

    • ARCH models

    • GARCH models

    • Linear regression

  • ​Machine Learning Models

    • Supported Vector Machine

    • neural network

  • Exercise 82: Nonstationary Series and Cointegration Tests in R and Python

  • Exercise 83: macroeconomic variables with VAR in R

  • Exercise 84: Garch modeling market variables R

  • Exercise 85: Machine Learning SPV and NN modeling in Python

​​

Module 33: Validation of econometric models

  • Review of assumptions of econometric models

  • Review of the coefficients and standard errors of the models

  • Model reliability measures

  • Error management

  • not normal

  • heteroscedasticity

  • Outliers

  • autocorrelation

  • multicollinearity

  • Exercise 86: Measuring Logistic Regression Collinearity

Module 34: Stress Testing Consumer Credit Risk

  • Temporal horizon

  • Multi-period approach

  • Impact on P&L, RWA and Capital

  • Macroeconomic Stress Scenarios in consumption

  • Stress Testing of PD, LGD and EAD

  • Stress Testing of the Transition Matrix

  • Chage Off Stress Testing

  • ​Losses from new impaired assets

  • Losses on old impaired assets

  • ​Exercise 87: Stress Testing PD in Excel and R multifactorial model

  • Exercise 88: Stress Testing PD Multiyear Approach

  • Exercise 89: Stress test of PD and Autoregressive Vectors

  • Exercise 90: Net Charge Off Stress Test

  • Exercise 91: LGD Stress Test

Module 35: Stress Testing Credit Risk Portfolios Corporate

  • Stress Test Methodology for corporate portfolios

  • Creditmetrics and Transition Matrices

  • Credit Index and PD

  • PD simulation and transition matrices

  • Exercise 92: Corporate portfolio stress test

Module 36: Stress Testing of ECL IFRS 9

 

  • Stress testing IFRS 9 and COVID-19

  • Pandemic scenarios applied to the ECL calculation

  • Stress Testing of IFRS 9 parameters

  • EBA Stress Testing 2021

  • Treatment of the moratorium

  • Possible regulatory scenarios

  • Impact on P&L

  • PIT starting parameters

  • PIT projected parameters

  • Calculation of non-productive assets and impairments

  • Changes in the stock of provisions

  • Changes in the stock of provisions for exposures phase S1

  • Changes in the stock of provisions for exposures phase S2

  • Changes in the stock of provisions of exposures phase S3

  • Sovereign Exposure Impairment Losses

  • Impact on capital

  • Internal Stress Testing Model for ECL IFRS 9

  • Exercise 93: Stress Testing of the ECL using matrices and time series R and Excel

​​

Module 37: Validation of Stress Testing

  • Validation of Stress Testing

  • Validation of the Best Case and adverse scenarios

  • Stationarity of variables

  • The signs of economically intuitive coefficients

  • Statistical significance of coefficients

  • confidence levels

  • residual diagnoses

  • Model performance metrics

    • goodness of fit

    • Risk classification

    • cumulative error measures

  • Industry Accepted Thresholds

  • Intuitive sort order

  • Generalized Cross Validation

  • Squared Correlation

  • Root Mean Squared Error

  • Cumulative Percentage Error

  • Akaike Information Criterion

  • Exercise 94: Validation tests of stress testing VAR vs MARS

Agent AI for Model Validation 

Automating the construction and calibration of

Credit Risk models with AI

 

Module 38: Build Automation and Calibration

  • What is modeling automation?

  • that is automated

  • Automation of machine learning processes

  • Optimizers and Evaluators

  • Modeling Automation Workflow Components

    • Summary

    • Indicted

    • Feature engineering

    • Model generation

    • Assessment

  • Hyperparameter optimization

  • Reconstruction or recalibration of credit scoring

  • Credit Scoring Modeling

    • Main milestones

    • Evaluation and optimization

    • Possible Issues

  • PD calibration modeling

    • Evaluation and optimization

    • backtesting

    • Discriminating Power

    • Stability Tests

  • Global evaluation of modeling automation

  • Implementation of modeling automation in banking

  • Technological requirements

  • available tools

  • Benefits and possible ROI estimation

  • Main Issues

  • Model Risk

  • Exercise 95: Automation of modeling and optimization and validation of credit scoring hyperparameters

Module 40: Expanded Module: Agent AI for Model Validation

1. What is an Agent AI in Model Risk?

  • An autonomous or semi-autonomous AI system that uses RAG (retrieval-augmented generation) + monitoring scripts + reporting templates to assist model validation teams.

  • Unlike a static validation script, an Agent AI can:

    • Listen: continuously monitor inputs/outputs of models.

    • Think: apply rules, statistical tests, and supervisory requirements.

    • Act: generate reports, flag anomalies, and escalate issues.

  • Goal: augment human validators, not replace them.

 

2. Functions Across the Validation Lifecycle

 

A. Monitoring

  • Continuous performance tracking (AUC, Gini, KS, Hosmer–Lemeshow).

  • Alerts for drift in input data (Population Stability Index, feature distribution shifts).

  • Watchdog for compliance with “use test” (ensuring model is applied consistently).

 

B. Review

  • Automated documentation checks: ensures PD/LGD/EAD methodologies are properly documented.

  • Cross-references model register vs. current version (alerts on undocumented changes).

  • Reviews MoC justification in light of validation outcomes.

 

C. Analysis

  • Runs challenger models: compares ML models with logistic regression benchmarks.

  • Conducts explainability tests: SHAP, LIME, surrogate linear models.

  • Identifies “black box” risks — where variables drive predictions but lack economic meaning.

 

D. Validation

  • Executes quantitative validation suite:

    • Discrimination: ROC, AUC, Gini.

    • Calibration: binomial test, Hosmer–Lemeshow, calibration curves.

    • Stability: PSI, time stability metrics.

  • Automates out-of-time and stress testing (e.g. COVID dataset vs. pre-COVID).

  • Suggests MoC adjustments when systematic errors are detected.

 

E. Reporting

  • Auto-generates validation reports with:

    • Executive summary (traffic-light system).

    • Tables/graphs of metrics.

    • Supervisory compliance checklist (ECB Guide sections).

  • Creates alerts to governance bodies (Board, Risk Committee).

  • Stores audit logs for ECB/JST review.

 

3. Architecture of an AI Validation Agent

  • Data Ingestion Layer: connects to model outputs, validation datasets, and logs.

  • Validation Engine: runs statistical tests, ML explainability, benchmarking.

  • Rule-based Layer: supervisory compliance checks (ECB validation requirements).

  • RAG Knowledge Layer: retrieves supervisory text (e.g. ECB Guide, SR 11-7, TRIM).

  • LLM Interaction Layer: generates human-readable validation reports.

  • Interface/Dashboard: alerts, monitoring, drill-down analysis.

4. Benefits of Agent AI for Banks

  • Efficiency: reduces manual workload in recurring tests.

  • Consistency: applies the same framework across multiple models and portfolios.

  • Auditability: provides structured logs and transparent documentation.

  • Early warning: detects issues (drift, bias, calibration failures) before supervisors find them.

  • Scalability: supports dozens of models simultaneously across retail, wholesale, IRB portfolios.

 

5. Practical Exercise: Build a Prototype

AI Agent for Validation

 

Objective: simulate an Agent AI that monitors and validates a PD model.

  • Understand how AI agents can support regulatory-aligned validation.

  • Be able to design automated workflows for monitoring and validation.

  • Gain hands-on skills in building a prototype Agent AI in Python/R.

  • Appreciate the governance, benefits, and limitations of automation in model risk.

  1. Dataset: credit scoring dataset (default = 0/1).

  2. Models:

    • Logistic Regression (benchmark).

    • Gradient Boosting (ML challenger).

  3. Agent AI Tasks (to code in Python/R):

    • Monitor: calculate AUC, PSI, KS monthly.

    • Analyse: run SHAP values to explain ML model.

    • Validate: compare calibration (predicted PDs vs. actual defaults).

    • Report: generate an automated validation summary (PDF/Markdown).

  4. Output:

    • Traffic-light dashboard:

      • 🟢 if AUC > 0.70 and PSI < 0.1.

      • 🟡 if moderate drift or small calibration issues.

      • 🔴 if serious underperformance, drift, or poor calibration.

    • A short written validation report auto-produced by the Agent AI.

 

bottom of page