Global ETD Search

Return to search

Credit risk modelling and prediction: Logistic regression versus machine learning boosting algorithms

The use of machine learning methods in credit risk modelling has been proven to yield good results in terms of increasing the accuracy of the risk score as- signed to customers. In this thesis, the aim is to examine the performance of the machine learning boosting algorithms XGBoost and CatBoost, with logis- tic regression as a benchmark model, in terms of assessing credit risk. These methods were applied to two different data sets where grid search was used for hyperparameter optimization of XGBoost and CatBoost. The evaluation metrics used to examine the classification accuracy of the methods were model accuracy, ROC curves, AUC and cross validation. According to our results, the machine learning boosting methods outperformed logistic regression on the test data for both data sets and CatBoost yield the highest results in terms of both accuracy and AUC.

http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-465641

Probability Theory and Statistics

Sannolikhetsteori och statistik

Identifer	oai:union.ndltd.org:UPSALLA1/oai:DiVA.org:uu-465641
Date	January 2022
Creators	Machado, Linnéa, Holmer, David
Publisher	Uppsala universitet, Statistiska institutionen
Source Sets	DiVA Archive at Upsalla University
Language	English
Detected Language	English
Type	Student thesis, info:eu-repo/semantics/bachelorThesis, text
Format	application/pdf
Rights	info:eu-repo/semantics/openAccess

Page generated in 0.0017 seconds

Credit risk modelling and prediction: Logistic regression versus machine learning boosting algorithms

Description

Links & Downloads

Tags

Additional Fields