🎯 Course Objective

The objective of this course is to provide banking and risk management professionals with a comprehensive understanding of internal models for credit risk, their validation frameworks, and the management of model risk in line with ECB guide to internal models July 2025. The course also explores the challenges and opportunities of applying artificial intelligence (AI) and machine learning (ML) within internal models, with a focus on governance, explainability, and regulatory compliance.

📘 Course Description

Internal Models for Credit Risk: Validation and Model Risk under AI & ML is a specialized program designed for professionals involved in credit risk modeling, model validation, model risk management, and regulatory compliance.

The course covers the design, validation, and oversight of internal models for Probability of Default (PD), Loss Given Default (LGD), and Exposure at Default (EAD) under the IRB approach, with emphasis on supervisory principles from the ECB Guide to Internal Models (2025). It also covers the same parameters for IFRS 9 impairment.

Participants will learn how to:

Implement robust model governance and risk management frameworks.
Conduct independent validation using statistical tests, benchmarking, back-testing, and challenger models.
Apply margin of conservatism (MoC) in response to model deficiencies.
Integrate AI and ML techniques into internal models while addressing supervisory concerns about complexity, explainability, bias, and drift.
Utilize explainable AI (XAI) methods such as SHAP and LIME to validate complex ML models.
Understand the role of validation, audit, and governance in mitigating model risk throughout the model lifecycle.
Explain the pioneering model risk directive SR 11-7 in the US, the internal model review directive, TRIM, in the European Union, EU, and other important model risk and validation directives such as the estimation directive of PD and LGD and treatment of EBA default exposures.
Explain the use of artificial intelligence for model validation.
Techniques are shown to achieve the automation of the Construction and Calibration of Models through Artificial Intelligence.
Indicate the best practices for validation of internal credit risk models using Machine Learning.
Show model risk quantification techniques in credit scoring models, PD, LGD and EAD parameters and regulatory and economic capital.
Explore credit scoring validation techniques, and others such as discriminant power, stability tests and backtesting.
Offer a very significant number of econometric and machine learning methodologies to develop credit scoring, PD, LGD and EAD models under the IRB and IFRS9 approaches.
Explain methodologies to develop models of economic capital and stress testing.
Present validation techniques for economic and regulatory capital models.
Show a significant number of validation techniques for econometric models and time series used in stress testing.
Modeling the stress testing of the PD, LGD and transition matrices of consumer and corporate portfolios.
Show innovative stress testing validation techniques.
Explain and detect model risk in stress testing.

Through a combination of theory, regulatory insights, and hands-on coding exercises in Python and R, participants will gain the skills to design, validate, and manage both traditional and ML-driven internal models for credit risk.

By the end of the course, learners will be equipped to bridge regulatory requirements and innovation, ensuring that credit risk models remain compliant, explainable, and fit for purpose in a rapidly evolving banking environment.

WHO SHOULD ATTEND?

This program is aimed at managers, analysts and credit risk consultants. Particularly, to professionals of model risk, model validation and model auditing. For a better understanding of the topics, it is recommended that the participant have knowledge of statistics. The course contains exercises in SAS, R and Excel.

fondo-azul-degradado-lujo-abstracto-azul-oscuro-liso-banner-estudio-vineta-negra.jpg

Schedules:

Europe: Mon-Fri, CEST 16-19 h

America: Mon-Fri, CDT 18-21 h

Asia: Mon-Fri, IST 18-21 h

Price: 7 900 €

Online Live Course

Price: 4 900 €

Self-Paced MasterClass

Level: Advanced

Duration: 40 h

Material:

Presentations PDF

Exercises in R, Python, SAS and Excel

gesto-de-hombre-de-negocios-que-protege-el-grafico-y-el-grafico-de-inversion-virtual-cada-

Internal Models for Credit Risk: Validation and Model Risk

under AI & ML

Syllabus

Internal Models for Credit Risk: Validation and Model Risk under AI & ML

Anchor 12

CREDIT RISK INTERNAL MODELS

Module 1: Credit Risk Internal Models &

Model Risk in Machine Learning

ECB guide to internal models July 2025
Introduction and Supervisory Foundations
Foreword & purpose of the ECB Guide.
Legal framework: CRR, CRR2/CRR3, CRD, RTS/ITS.
TRIM legacy and harmonisation of practices.
Role of JSTs and supervisory investigations.
Exercise: Map EU regulations to internal model approval steps.
Overarching Principles of Internal Models
Group-wide implementation & consistency.
Documentation requirements & model register.
Data governance expectations (DORA, BCBS 239).
Model risk management framework (definition, policies, lifecycle).
Role of management body & senior management.
Exercise: Build a model risk framework checklist.
Internal Validation Principles
Independence & proportionality of validation.
Frequency of initial and annual validations.
Separation of duties between development and validation.
Resource & skill requirements.
Case Study: Validation independence in a medium-sized bank.
Internal Audit and Oversight
Scope of internal audit in internal models.
Independence, resources, and reporting lines.
Interaction with validation.
Audit documentation and follow-up.
Climate-Related and Environmental Risks
ECB guidance on integrating climate/ESG in internal models.
Materiality assessments.
Examples of climate risk transmission channels.
Exercise: Extend an LGD model with climate-related variables.
Machine Learning in Internal Models
Supervisory stance on ML models (complexity, opacity, drift).
Classification as “highly complex” or “dynamic” models.
Governance and skill requirements across lines of defence.
Data quality, bias, explainability, and ethics.
Lab: Train an ML PD model (Python) & apply SHAP for explainability.
Credit Risk Models – Governance and Validation
Scope & roll-out of IRB models.
Internal governance structures.
Model use tests and limitations.
Change management for IRB models.
Data maintenance and documentation.
Exercise: Create an IRB model governance flowchart.
Definition of Default
Consistency in default definition.
Days past due criterion.
Unlikeliness to pay indicators.
Return to non-defaulted status.
Adjustments after definition changes.
Case Study: Recalibrating PDs after default definition revision.
Estimation of Credit Risk Parameters
Use of data for estimation.
Probability of Default (PD): calibration, discriminatory power.
Loss Given Default (LGD): downturn, discounting, recovery data.
EAD/Conversion Factors: estimation, credit lines, derivatives.
Margin of Conservatism (MoC): principles and application.
Review and recalibration of estimates.
Maturity calculation for non-retail exposures.
Lab: Calibrate PD and LGD using historical data in Python/R.
Model Risk Governance & ICAAP Integration
Classification of model complexity/materiality.
MoC and its impact on capital.
Link between validation, internal audit, and ICAAP/ILAAP.
Supervisory review process (SREP).
Third-party involvement and outsourcing risks.

MODEL RISK

Module 2: Model Risk Process

Model validation process
Definition of the objective and methodology of the model
Review of model memories
validation plan
Validation conclusions
Model risk sub-risks:
- Model risk in the data
- Model risk in the methodology
- Model risk in implementation
- Risk in model results
Model governance
Controls at each sub-risk level
Analysis and validation of model documentation
Purpose of the model
Data
Design and methodology
Specification and estimate
Evidence
Implementation
monitoring
operational controls
reporting
Control panel
Solutions and technology necessary for model risk management
Case study 3: Model risk management for credit risk models, validation and documentation review

EU and US Model Risk Regulation

Module 3: Model Risk Management Directive SR 11-7 in USA

Introduction
Scope and purpose
Model Risk Management
Model development, implementation and use
Model validation
Concept strength assessment and development testing
Permanent monitoring, verification of processes and Benchmarking
Analysis of the results, including Backtesting
Governance, Policies and Controls
Politics and procedures
Roles and responsibilities
Internal audit
model inventory

Module 4: Guide for the Targeted Review of Internal Models (TRIM) EU

Scope and objectives of the guide
General principles of internal models Roll-out and PPU
governance
Internal audit
internal validation
Model Usage
Management of model changes
Data quality
Third parties Participation
Credit risk
- Scope of the credit risk guide
- Data requirements
Probability of default (PD)
- Structure of PD models
- main drivers
- Pool distribution
- Rating Philosophy
- Calculation of the default rate
- Calculation of long-term mean PD
Loss Given Default (LGD)
Credit conversion factor (CCF)
Model-related conservatism margin
Review of estimates

Module 5: Validation of Models in practice

Lessons learned from the financial crisis on validation
Validation Framework
Validation Definition
Validation principles
Roles and responsibilities
Scope and frequency
Validation Process
Internal Governance
Validation of IRB Models
Qualitative Validation
- model design
- data quality
- Use Test
Quantitative Validation
- backtesting
- Discriminating Power
- Stability Tests
Technological infrastructure
Required documentation
Internal validation department and team
Audit department and team
artificial intelligence
- Autonomous validation, reconstruction and recalibration
- Model Validation using Machine Learning
- Artificial intelligence for model risk
- Artificial intelligence to validate models

Module 6: Model Risk Management

Governance in model risk
- Role of the board and senior management
- Role of the risk department
- Responsible and creators of the model
- Model risk committees
- Internal audit
- Policy definition
Organization
- Three lines of defense
- Internal and external communication
Model Life Cycle Phases
- ID
- Model Inventory
- Classification
- Levels or "Tiering" according to materiality, sophistication and impact
- Planning and development
- internal validation
- Documentation
- pattern approval
- Implementation and use of the model
- Model monitoring
- International best practice risk management model
Model Risk Mitigation

Data quality test

Validation and audit

Benchmarks

What-if sensitivity tests

Stress testing

backtesting

Model risk appetite
- Model risk appetite statement
- risk tolerance
Qualitative measurement of model risk
- Creation of the Model Risk Scorecard
- Definition of scale and ranges
- International Scorecard Best Practices
Case study: Scorecard for model risk

Quantification of Model Risk

Module 7: Quantitative Measurement of Model Risk

The importance of quantifying model risk
Best international classifications
Optimal and non-optimal models
Categories of model risk:
Data deficiency
Model parameter uncertainty
Inappropriate use of the model
Model specification
Change in the dynamics of the financial and economic environment
Incorrect model implementation, misinterpretation of model output, and other errors
Model risk aggregation

Module 8: Models for Quantifying Model Risk

Why does this type of model risk occur?
Examples
Magnitude of model error related to human error
Qualitative measurement of model risk
Scorecard model
Creation of the model risk scorecard
Definition of scale and ranges
International best practices for scorecards
LDA approach to model risk measurement
Loss Distribution Approach (LDA)
Available data
Scenario Based Approach (SBA)
Scenario generation
Scenario assessment
Data quality and validation
Parameter determination
Frequency Distributions
Severity Distributions
Distribution of losses due to model risk
Monte Carlo Simulation
Panjer Approach
FFT
Bottom-up model uncertainty approach
Sources of information
Exercise 1: Estimation of model risk scorecard, establishment of scale and ranges.
Exercise 2: Estimation of model risk using the following frequency and severity distributions:
Frequency
- Poisson and Negative binomial
Severity
- Lognormal, Gamma, Weibull, Inverse Gaussian
- GDP EVT, G-H 4 parameter, Mixture of lognormals
- Lognormal-EVT, Poisson-Gamma Bayesian Approach, Lognormal Partition and GDP
Scenarios with Expert Criteria
Exercise 3: Selection of the best distribution using Cramer Von Misses, AD, and KS goodness-of-fit tests
Exercise 4: Comparison of model risk using Recursive Panjer, Fast Fourier Transformation, and Monte Carlo Simulation

Model Risk in Credit Scoring

Module 9: Model risk in Credit scoring

Dimension and use of materiality
Classification of scoring models by importance within the financial institution
Impact of the model on the entity
model dependency
Model limitations
Model governance
Documentation and review
Implementation
operational controls
Decision tree to assess rating and scoring models
Casuistry in expert Credit Rating models
Casuistry in statistical credit scoring models
Model risk in big data
Risk model by machine learning and black box

Module 10: Exploratory data analysis EDA and Binning

Exploratory data analysis
Data typology
transactional data
Unstructured data embedded in text documents
Social Media Data
data sources
Data review
Target definition
Time horizon of the target variable
Sampling
- Random Sampling
- Stratified Sampling
- Rebalanced Sampling
Exploratory Analysis:
- histograms
- Q Q Plot
- Moment analysis
- boxplot
Treatment of Missing values
- imputation
- Delete
- Keep
Advanced Outlier detection and treatment techniques
- Z-Score
- Mahalanobis Distance
Data Standardization
Binning
Variable categorization
- Equal Interval Binning
- Equal Frequency Binning
- Chi-Square Test
binary coding
WOE Coding
- WOE Definition
- Univariate Analysis with Target variable
- Variable Selection
- Treatment of Continuous Variables
- Treatment of Categorical Variables
- Gini and KS
- Information Value
- Optimization of continuous variables
- Optimization of categorical variables
Exercise 5: Exploratory Analysis in R
Exercise 6: Detection and treatment of Outliers in R
Exercise 7: Missing imputation techniques in R
Exercise 8: Stratified and Random Sampling
Exercise 9: Analysis of the Weight of Evidence in Excel
Exercise 10: Univariate analysis in percentiles in R
Exercise 11: Continuous variable optimal univariate analysis in R
Exercise 12: KS, Gini and IV validation of each variable
Exercise 14: Optimizing categorical variables in R
Exercise 15: Univariate Analysis with decision trees in R
Exercise 16: Segmentation using K means Clustering in R

MACHINE LEARNING MODELS

Module 11: Machine Learning

Unsupervised models
K Means
Principal Component Analysis (PCA)
Advanced PCA Visualization
Supervised Models
Ensemble Learning
Bagging trees
Random Forest
Boosting
Adaboost
Gradient Boosting Trees

Deep Learning

Module 12: Deep Learning

Feed Forward Neural Networks
Single Layer Perceptron
Multiple Layer Perceptron
Neural network architectures
Activation function
Back propagation
- Directional derivatives
- gradients
- Jacobians
- Chain rule
- Optimization and local and global minima
Deep Learning Convolutional Neural Networks CNN
- CNN for pictures
- Design and architectures
- convolution operation
- filters
- strider
- padding
- Subsampling
- pooling
- fully connected
- Credit Scoring using CNN
- Recent CNN studies applied to credit risk and scoring

Deep Learning Recurrent Neural Networks RNN
- Long Term Short Term Memory (LSTM)
- Hopfield
- Bidirectional associative memory
- descending gradient
- Global optimization methods
- RNN and LSTM for credit scoring
- One-way and two-way models
Generative Adversarial Networks (GANs)
- Generative Adversarial Networks (GANs)
- Fundamental components of the GANs
- GAN architectures
- Bidirectional GAN
- Training generative models
- Credit Scoring using GANs

Module 15: Generative AI

Introducing generative AI
What is Generative AI?
Generative AI Models
- Generative Pre- trained Transformer (GPT)
Text generation, Image generation, Music generation, Video generation
Generating text
Generating Code
Ability to solve logic problems
Enterprise Use Cases for Generative AI
Overview of Large Language Models (LLMs)
- Transformer Architecture
- Types of LLMs
- Open-Source vs. Commercial LLMs
- Key Concepts of LLMs
Prompts
Tokens
Embeddings
Model configuration
Prompt Engineering
Model adaptation
Specifying multiple Dataframes to ChatGPT
Exercise 17: Embeddings for words, sentences, question answers
Exercise 18: Embeddings on Large Dataset
Exercise 19: Advanced Prompting Techniques
Exercise 20: Large Language Models in Credit Rating

Model risk in Credit Scoring

Module 16: Multivariate models and Machine Learning

Multivariate Models
- Logistic regression
- Cox Regression
- Model Risk
Machine Learning
- decision trees
- neural networks
- SVM
- Ensemble Learning
- bagging
- Boosting
- Random Forest
Deep Learning
- CNN
- LSTM
- GAN
- LLM
Model Risk in Machine Learning
- overfitting
- Transparency
- failed sampling
- important variables
Exercise 21: Logistic Regression, stepwise method in R
Exercise 22: Piecewise Regression in Excel and R
Exercise 23: Decision trees in R
Exercise 24: Support Vector Machine in R
Exercise 25: Neural Networks in R
Exercise 26: Ensemble models in R
Exercise 27: Random Forest in R
Exercise 28: Bagging in R
Exercise 29: Credit scoring using deep learning CNN Python
Exercise 30: Credit Scoring using Deep Learning LSTM Python
Exercise 31: Credit Scoring using GANs Python
Exercise 32: Comparison of models of discriminant power between models: Neural Networks, Logistic Regression, Panel Data Logistic Regression and Cox Regression
Exercise 33: Model Risk using Confidence Intervals of Logistic Regression Coefficients

Module 14: Tuning Hyperparameters in Deep Learning

Hyperparameterization
Grid search
Random search
Bayesian Optimization
Train test split ratio
Learning rate in optimization algorithms (e.g. gradient descent)
Selection of optimization algorithm (e.g., gradient descent, stochastic gradient descent, or Adam optimizer)
Activation function selection in a (nn) layer neural network (e.g. Sigmoid, ReLU, Tanh)
Selection of loss, cost and custom function
Number of hidden layers in an NN
Number of activation units in each layer
The drop-out rate in nn (dropout probability)
Number of iterations (epochs) in training a nn
Number of clusters in a clustering task
Kernel or filter size in convolutional layers
Pooling size
Batch size
Interpretation of the Shap model
Exercise 34: Tuning hyperparameters in Xboosting, Random forest and SVM models for credit scoring
Exercise 35: Tuning hyperparameters in Deep Learning model for credit scoring

Module 17: Model Risk in the Scorecard

Scoring assignment
Scorecard Classification
- Scorecard WOE
- Binary Scorecard
- Continuous Scorecard
Scorecard Rescaling
- Factor and Offset Analysis
- Scorecard WOE
- Binary Scorecard
Reject Inference Techniques
- cut-off
- parceling
- Fuzzy Augmentation
Advanced Cut Point Techniques
- Cut-off optimization using ROC curves
Model risk by cut point decision
Model risk due to lack of data
Model Risk for not updating or recalibrating
Exercise 36: Construction of Scorecard in Excel
Exercise 37: Optimum cut-off point estimation in Excel and model risk by cut-off point selection
Exercise 38: Confusion matrix to verify Type 1 and Type 2 Error in Excel with and without variables
Exercise 39: Model risk in credit scoring due to not recalibrating on time

Model Validation

Validation of Credit Scoring models

Module 18: Stability tests

Model stability index
Factor stability index
Xi-square test
K-S test
Exercise 40: Stability tests of models and factors

Module 19: Validation of traditional and

Machine Learning models

Out of Sample and Out of time validation
Checking p-values in regressions
R squared, MSE, MAD
Waste diagnosis
Goodness of Fit Test
- deviation
- Bayesian Information Criterion (BIC)
- Akaike Information Criterion
Multivariate Multicollinearity
Cross validation
Error bootstrapping
Binary case confusion matrix
Multinomial case confusion matrix
Main discriminant power tests:
- KS, ROC Curve, Gini Index, Cumulative Accuracy Profile, Kullback-Leibler Distance, Pietra Index, 1-Ph, Conditional Entropy, Information Value, Kendall Tau, Brier Score, Divergence
confidence intervals
Jackknifing with discriminant power test
Bootstrapping with discriminant power test
Kappa statistic
K-Fold Cross Validation
Traffic Light Analysis
Exercise 41: Cross validation in R
Exercise 42: Gini Estimation, Information Value, Brier Score, Lift Curve, CAP, ROC, Divergence in Excel
Exercise 43: Bootstrapping of R parameters
Exercise 44: Gini/ROC Bootstrapping in R
Exercise 45: Kappa estimation
Exercise 46: K-Fold Cross Validation in R
Exercise 47: Out of time traffic light validation (horizon 6 years) of Logistics and Machine Learning models

Explainable Artificial Intelligence

Module 20: Explainable Artificial Intelligence XAI

Interpretability problem
model risk
Regulation of the General Data Protection Regulation GDPR
EBA discussion paper on machine learning for IRB models
- 1. The challenge of interpreting the results,
- 2. The challenge of ensuring that management functions adequately understand the models, and
- 3. The challenge of justifying the results to supervisors
Black Box Models vs. Transparent and Interpretable Algorithms
interpretability tools
Shap, Shapley Additive explanations
- Global Explanations
- Dependency Plot
- Decision Plot
- Local Explanations Waterfall Plot
Lime, agnostic explanations of the local interpretable model
Explainer Dashboard
Other advanced tools
Exercise 48: XAI interpretability of credit scoring

Advanced Validation

Module 21: Advanced Validation of AI Models

Integration of state-of-the-art methods in interpretable machine learning and model diagnosis.
Data Pipeline
Feature Selection
Black-box Models
Post-hoc Explainability
Global Explainability
Local Explainability
Model Interpretability
Diagnosis: Accuracy, WeakSpot, Overfit, Reliability, Robustness, Resilience, Fairness
Model comparison
- Comparative for Regression and Classification
- Fairness Comparison
Exercise 49: Validation and diagnosis of advanced credit scoring models

MODEL VALIDATION

PD, LGD and EAD validation IRB and IFRS 9

Module 22: Directive on the estimation of PD and LGD IRB and defaulted exposures issued by EBA (updated ECB Guide to Internal Models July 2025)

European Directive on estimation of PD and LGD, and exposures in default
Why is it advisable to consider it in Latin America?
Parameter variability reduction
Homogenization of the calculation of PD and LGD
Implementation dates in European banks
Data quality
Representativeness of the data for the development of the model and for the calibration of the risk parameters
Human judgment for parameter estimation
Treatment of deficiencies and margin of conservatism (Moc)
PD estimation
Model development
Data requirement
Risk drivers and rating criteria
Treatment of external ratings
Rating philosophy
Pool Treatment
PD Calibration
Data requirement
One Year Default Rate Calculation
Calculation and use of the average observed default rate
Long-term default rate
Calibration of the long-term default rate
LGD estimation
Methodologies for estimating PD
Data requirement
Recoveries from collaterals
Model development
Risk drivers
Collateral Eligibility
Inclusion of collaterals
LGD Calibration
Definition of economic loss and realized loss
Treatment of commissions, interest and other withdrawals after default
Discount rate
Direct and indirect costs
long-term LGD
Long Term LGD Calibration
Estimation of risk parameters for exposures in default
Estimation and calibration of the Expected Loss Best Estimate
Estimation and calibration of LGD in-default
Application of risk parameters
Review of estimates
Accompanying Documents
impact assessment
Identification of the problem
Policy objectives
Baseline scenario
Options considered
Cost-benefit analysis

Module 23: PD templates for IRB and IFRS 9 approach

Introduction to Probability of Default
Default definition
Default Triggers
Effective and robust process to detect default
Technical defaults and technical default filters
Indispensable data model
Single Factor Analysis
Multifactor Analysis
Model Selection
Historical PS
Econometric and Machine Learning Models of PD
Risk factors that affect default
- macroeconomic
- idiosyncratic
PD Logistic Regression
PD COX Regression
PD Log-log Complementary
PD Logistic Regression Data Panel
Machine Learning to estimate PD
PD Calibration
Introduction to Calibration
Anchor Point Estimate
Mapping from Score to PD
Temporal structure of the PD
- Marginal PD
- Forward PD
- Cumulative PD
Techniques for Mapping PD's to temporary structure
Vintages or vintages of PD
Bayesian PD
Expert Judgment
Prior and posterior distribution
Markov Chain Monte Carlo
probit model
Adjustment to the PD Economic Cycle
Introduction of Adjustment to the Economic Cycle
Directives on the economic cycle in the PD models
PD Trough The Cycle (PD TTC) Models
Considerations of the Adjustment to the “Scalar Variable” approach cycle
PD on Low Default Portfolios
PD estimation without correlations
PD estimation with correlations
LDP Calibration Using CAP Curves
Bayesian PD estimation for LDP
Default correlation
Correlation of defaults and multiperiod
Neutral Bayesian and Conservative Bayesian
Transition and PD Matrices
Properties of transition matrices
Multi-year transition matrix
- discrete time
- continuous time
- Generating Matrix
- Exponential of a matrix
duration method
Cohort method
error management
IFRS 9 PD Modeling
IFRS 9 requirements
Probability Weighted
Forward Looking
Lifetime PD modeling
PD Forecasting Modeling
PD Point in Time Forecasting
Markov chains
Exercise 50: Calibration of PD with COX regression in R
Exercise 51: PD calibration with log-log complementary in R
Exercise 52: PD calibration with logistic model in R
Exercise 53: PD calibration logistic Bayesian regression
Exercise 54: Calibration of PD regression panel logistic data
Exercise 55: Calibration of PD Lasso Regression
Exercise 56: Calibration of PD Bayesian Probit regression in R
Exercise 57: Transition matrices in Excel and R
Exercise 58: Multinomial Regression to estimate PD Lifetime
Exercise 59: Multi stage Markov chains in R

Module 24: Backtesting PD IRB and IFRS9

Validation of the PD in IRB PIT and TTC
Validation of PD Lifetime and PD12m in IFRS 9
Backtesting PS
PD Calibration Validation
Hosmer Lameshow test
normal test
Binomial Test
Spiegelhalter test
Redelmeier Test
Traffic Light Approach
Traffic Light Analysis and PD Dashboard
PS Stability Test
Forecasting PD vs Real PD in time
Validation with Monte Carlo simulation
Exercise 60: Backtesting PD IRB and PD IFRS 9
Exercise 61: Forecasting Estimated PD and Actual PD in Excel
Exercise 62: Validation using Monte Carlo Simulation

Module 25: LGD Models for IRB and IFRS 9 Approach

BASEL III and EBA LGD
LGD estimation and calibration in practice
Default LGD estimation
LGD Econometric and Machine Learning Models
Advantages and disadvantages of LGD Predictive Models
Forward Looking models incorporating Macroeconomic variables
Parametric and non-parametric models and transformation regressions
Linear regression and Beta transformation
Linear Regression and Logit Transformation
Linear regression and Box Cox transformation
Logistic and Linear Regression
Logistic and nonlinear regression
Censored Regression
Generalized Additive Model
neural networks
SVM
Beta regression
Inflated beta regression
Fractional Response Regression
LGD for IFRS 9
Comparison of regulatory LGD vs. IFRS 9
LGD adjustments
Selection of Interest Rates
Allocation of Costs
floors
Treatment of collateral over time
Marginal LGD
PIT LGD
Loss Lifetime Concept
exposure treatment
Exercise 63: Logistic and linear regression LGD in R
Exercise 64: Neural Networks and SVM LGD
Exercise 65: Generalized Additived Model LGD in R
Exercise 66: Beta Regression Model LGD in R
Exercise 67: Long-term LGD calibration
Exercise 68: Inflated Beta Regression in SAS
Exercise 69: Comparison of the performance of the models using Calibration and precision tests

Module 26: LGD Backtesting

LGD backtesting in IRB
LGD Backtesting in IFRS 9
Accuracy ratio
absolute accuracy indicator
Confidence Intervals
transition analysis
RR Analysis using Triangles
Advanced LGD Backtesting with a vintage approach
Backtesting for econometric models
ROC, Gini and K-S curve
Exercise 70: Comparison of the performance of the models using Calibration and precision tests.

Module 27: EAD Models

Guidelines for estimating CCF
Guidelines for Estimating CCF Downturn
Temporal horizon
Transformations to model the CCF
Approaches to Estimating CCF
Fixed Horizon approach
Cohort Approach
Variable focus time horizon
CCF Econometric and Machine Learning Models
- Linear regression
- Logistic regression
- Generalized Additive Model
- neural networks
- SVM
- Beta regression
- Inflated beta regression
- Fractional Response Regression
EAD for IFRS 9
- Comparison of regulatory EAD vs. IFRS 9
- Adjustments in the EAD
- Interest Accrual
- CCF PIT Estimate
- Modeling of available lifetime
Exercise 56: Estimation and adjustments for EAD IFRS 9 in excel and R
Exercise 57: Neural Networks and SVM CCF
Exercise 58: Generalized Additived Model CCF in R
Exercise 59: Beta Regression Model CCF in R

Module 28: EAD Backtesting

EAD validation
CCF validation
Backtesting EAD and CCF IRB
Backtesting of the EAD and CCF IFRS 9
r squared
Pearson coefficient
Spearman correlation
Validation using ROC, K-S and Gini
Exercise 71: Comparison of the performance of EAD models

IFRS 9 Expected Credit Loss (ECL) Validation

Module 29: ECL Validation

Initial validation
Periodic validation
Monitoring
Main milestones of qualitative validation
- Data quality
- Default Definition
- Relevance of the qualification process
- Override Analysis
- environmental dynamics
- user test
Main milestones of quantitative validation
- Samples used for validation purposes
- Discriminating Power
- population stability
- Characteristic Stability
- concentration analysis
- Staging analysis
- Parameter Calibration
- ECL backtesting
Principle 5 – Validation of the ECL model in Basel III
- Governance
- Model inputs
- model design
- Model output/performance
Validation metrics
- Bayesian/Akaike/Schwartz/Deviance information criteria
- Receiver operating characteristic (ROC) curve or AUROC statistic
- Lorenz curve, Gini coefficient, Kolmogorov-Smirnov test
- T-tests, F-tests, Wald test, log likelihood test
- RMSE, MAPE, MAD
- R-squared, Adjusted R-squared
- Out-of-sample testing
- benchmarking
- Population stability index (PSI)
Statistical problems
- Sampling bias
- Survivorship bias
- Disproportionately high model weightsT
- Autoregressive and lagged terms that do not capture macroeconomic effects
- spurious correlation
- Smoothing methods that alter data integrity
- simple linear models in nonlinear relationships

Validation of Regulatory and Economic Capital for Credit Risk

Module 30: Economic Capital Models

Definition and Objective
Temporal horizon
Default Correlation
asset mapping
unexpected loss
Regulatory capital models
ASRF Economic Capital Models
Business Models
Multifactorial Models
Economic Capital for retail using Charge off
Dependency modeling using copulas
VaR estimation and Expected Shortfall
Economic capital management
Exercise 72: Default correlation matrix in SAS
Exercise 73: Correlation of default: consumer portfolios in R
Exercise 74: Correlation of assets with EMV and observable data in R
Exercise 75: Creditrisk + in R
Exercise 76: One-factor model in Excel and R
Exercise 77: Multifactorial Model in Excel
Exercise 78: T-student in Excel in Excel
Exercise 79: Copulas in R

Module 31: Economic capital validation

Regulatory Capital Models
Economic Capital Models
Validation of Credit Portfolio Models
- Model Design
- Model Output
- Processes, data and test of use
Loss aggregation validation
Validation of Basel models
Testing distributions using Berkowitz test
Credit Loss Distribution
Simulation of the critical chi-square value
Berkowitz test in subportfolios
power assessment
Scope and limits of the test
Model risk in economic capital due to uncertainty
Exercise 80: Implementation of the Berkowitz test in economic and regulatory capital models
Exercise 81: Simulation of losses and model risk in regulatory and economic capital

Validation of Stress Testing Credit Risk

Module 32: Forecasting Models

Data processing
- Non-Stationary Series
- Dickey-Fuller test
- Cointegration Tests
Econometric Models
- ARIMA models
- VAR Autoregressive Vector Models
- ARCH models
- GARCH models
- Linear regression
Machine Learning Models
- Supported Vector Machine
- neural network
Exercise 82: Nonstationary Series and Cointegration Tests in R and Python
Exercise 83: macroeconomic variables with VAR in R
Exercise 84: Garch modeling market variables R
Exercise 85: Machine Learning SPV and NN modeling in Python

Module 33: Validation of econometric models

Review of assumptions of econometric models
Review of the coefficients and standard errors of the models
Model reliability measures
Error management
not normal
heteroscedasticity
Outliers
autocorrelation
multicollinearity
Exercise 86: Measuring Logistic Regression Collinearity

Module 34: Stress Testing Consumer Credit Risk

Temporal horizon
Multi-period approach
Impact on P&L, RWA and Capital
Macroeconomic Stress Scenarios in consumption
Stress Testing of PD, LGD and EAD
Stress Testing of the Transition Matrix
Chage Off Stress Testing
Losses from new impaired assets
Losses on old impaired assets
Exercise 87: Stress Testing PD in Excel and R multifactorial model
Exercise 88: Stress Testing PD Multiyear Approach
Exercise 89: Stress test of PD and Autoregressive Vectors
Exercise 90: Net Charge Off Stress Test
Exercise 91: LGD Stress Test

Module 35: Stress Testing Credit Risk Portfolios Corporate

Stress Test Methodology for corporate portfolios
Creditmetrics and Transition Matrices
Credit Index and PD
PD simulation and transition matrices
Exercise 92: Corporate portfolio stress test

Module 36: Stress Testing of ECL IFRS 9

Stress testing IFRS 9 and COVID-19
Pandemic scenarios applied to the ECL calculation
Stress Testing of IFRS 9 parameters
EBA Stress Testing 2021
Treatment of the moratorium
Possible regulatory scenarios
Impact on P&L
PIT starting parameters
PIT projected parameters
Calculation of non-productive assets and impairments
Changes in the stock of provisions
Changes in the stock of provisions for exposures phase S1
Changes in the stock of provisions for exposures phase S2
Changes in the stock of provisions of exposures phase S3
Sovereign Exposure Impairment Losses
Impact on capital
Internal Stress Testing Model for ECL IFRS 9
Exercise 93: Stress Testing of the ECL using matrices and time series R and Excel

Module 37: Validation of Stress Testing

Validation of Stress Testing
Validation of the Best Case and adverse scenarios
Stationarity of variables
The signs of economically intuitive coefficients
Statistical significance of coefficients
confidence levels
residual diagnoses
Model performance metrics
- goodness of fit
- Risk classification
- cumulative error measures
Industry Accepted Thresholds
Intuitive sort order
Generalized Cross Validation
Squared Correlation
Root Mean Squared Error
Cumulative Percentage Error
Akaike Information Criterion
Exercise 94: Validation tests of stress testing VAR vs MARS

Agent AI for Model Validation

Automating the construction and calibration of

Credit Risk models with AI

Module 38: Build Automation and Calibration

What is modeling automation?
that is automated
Automation of machine learning processes
Optimizers and Evaluators
Modeling Automation Workflow Components
- Summary
- Indicted
- Feature engineering
- Model generation
- Assessment
Hyperparameter optimization
Reconstruction or recalibration of credit scoring
Credit Scoring Modeling
- Main milestones
- Evaluation and optimization
- Possible Issues
PD calibration modeling
- Evaluation and optimization
- backtesting
- Discriminating Power
- Stability Tests
Global evaluation of modeling automation
Implementation of modeling automation in banking
Technological requirements
available tools
Benefits and possible ROI estimation
Main Issues
Model Risk
Exercise 95: Automation of modeling and optimization and validation of credit scoring hyperparameters

Module 40: Expanded Module: Agent AI for Model Validation

1. What is an Agent AI in Model Risk?

An autonomous or semi-autonomous AI system that uses RAG (retrieval-augmented generation) + monitoring scripts + reporting templates to assist model validation teams.
Unlike a static validation script, an Agent AI can:
- Listen: continuously monitor inputs/outputs of models.
- Think: apply rules, statistical tests, and supervisory requirements.
- Act: generate reports, flag anomalies, and escalate issues.
Goal: augment human validators, not replace them.

2. Functions Across the Validation Lifecycle

A. Monitoring

Continuous performance tracking (AUC, Gini, KS, Hosmer–Lemeshow).
Alerts for drift in input data (Population Stability Index, feature distribution shifts).
Watchdog for compliance with “use test” (ensuring model is applied consistently).

B. Review

Automated documentation checks: ensures PD/LGD/EAD methodologies are properly documented.
Cross-references model register vs. current version (alerts on undocumented changes).
Reviews MoC justification in light of validation outcomes.

C. Analysis

Runs challenger models: compares ML models with logistic regression benchmarks.
Conducts explainability tests: SHAP, LIME, surrogate linear models.
Identifies “black box” risks — where variables drive predictions but lack economic meaning.

D. Validation

Executes quantitative validation suite:
- Discrimination: ROC, AUC, Gini.
- Calibration: binomial test, Hosmer–Lemeshow, calibration curves.
- Stability: PSI, time stability metrics.
Automates out-of-time and stress testing (e.g. COVID dataset vs. pre-COVID).
Suggests MoC adjustments when systematic errors are detected.

E. Reporting

Auto-generates validation reports with:
- Executive summary (traffic-light system).
- Tables/graphs of metrics.
- Supervisory compliance checklist (ECB Guide sections).
Creates alerts to governance bodies (Board, Risk Committee).
Stores audit logs for ECB/JST review.

3. Architecture of an AI Validation Agent

Data Ingestion Layer: connects to model outputs, validation datasets, and logs.
Validation Engine: runs statistical tests, ML explainability, benchmarking.
Rule-based Layer: supervisory compliance checks (ECB validation requirements).
RAG Knowledge Layer: retrieves supervisory text (e.g. ECB Guide, SR 11-7, TRIM).
LLM Interaction Layer: generates human-readable validation reports.
Interface/Dashboard: alerts, monitoring, drill-down analysis.

4. Benefits of Agent AI for Banks

Efficiency: reduces manual workload in recurring tests.
Consistency: applies the same framework across multiple models and portfolios.
Auditability: provides structured logs and transparent documentation.
Early warning: detects issues (drift, bias, calibration failures) before supervisors find them.
Scalability: supports dozens of models simultaneously across retail, wholesale, IRB portfolios.

5. Practical Exercise: Build a Prototype

AI Agent for Validation

Objective: simulate an Agent AI that monitors and validates a PD model.

Understand how AI agents can support regulatory-aligned validation.
Be able to design automated workflows for monitoring and validation.
Gain hands-on skills in building a prototype Agent AI in Python/R.
Appreciate the governance, benefits, and limitations of automation in model risk.

Dataset: credit scoring dataset (default = 0/1).
Models:
- Logistic Regression (benchmark).
- Gradient Boosting (ML challenger).
Agent AI Tasks (to code in Python/R):
- Monitor: calculate AUC, PSI, KS monthly.
- Analyse: run SHAP values to explain ML model.
- Validate: compare calibration (predicted PDs vs. actual defaults).
- Report: generate an automated validation summary (PDF/Markdown).
Output:
- Traffic-light dashboard:
  - 🟢 if AUC > 0.70 and PSI < 0.1.
  - 🟡 if moderate drift or small calibration issues.
  - 🔴 if serious underperformance, drift, or poor calibration.
- A short written validation report auto-produced by the Agent AI.

📘 Course Description

​

Internal Models for Credit Risk: Validation and Model Risk under AI & ML

ECB guide to internal models July 2025

Introduction and Supervisory Foundations

Overarching Principles of Internal Models

Internal Validation Principles

Internal Audit and Oversight

Climate-Related and Environmental Risks

Machine Learning in Internal Models

Credit Risk Models – Governance and Validation

Definition of Default

Estimation of Credit Risk Parameters

Model Risk Governance & ICAAP Integration

Specifying multiple Dataframes to ChatGPT

Exercise 17: Embeddings for words, sentences, question answers

Exercise 18: Embeddings on Large Dataset

Exercise 19: Advanced Prompting Techniques

​