In this thesis, we present the use of logistic regression method to develop a credit scoring modelusing the raw data of 4447 customers of a bank. The data of customers is collected under 14independent explanatory variables and 1 default indicator. The objective of this thesis is toidentify optimal coefficients. In order to clean data, the raw data set was put through variousdata calibration techniques such as Kurtosis, Skewness, Winsorization to eliminate outliers.On this winsorized dataset, LOGIT analysis is applied in two rounds with multiple statisticaltests. These tests aim to estimate the significance of each independent variable and modelfitness. The optimal coefficients can be used to obtain the credit scores for new customers witha new data set and rank them according to their credit risk.
Identifer | oai:union.ndltd.org:UPSALLA1/oai:DiVA.org:mdh-64893 |
Date | January 2023 |
Creators | Hara Khanam, Iftho |
Publisher | Mälardalens universitet, Akademin för utbildning, kultur och kommunikation |
Source Sets | DiVA Archive at Upsalla University |
Language | English |
Detected Language | English |
Type | Student thesis, info:eu-repo/semantics/bachelorThesis, text |
Format | application/pdf |
Rights | info:eu-repo/semantics/openAccess |
Page generated in 0.0022 seconds