Return to search

應用大數據於信用評等之模型探討 / The Application of Big Data on Credit Scoring Model

信用風險或信用違約意旨金融機構提供給客戶服務卻未得償還的機率,故其在銀行信貸決策的領域是常被鑽研的對象,因為其對於金融機構所扮演的角色尤其重要,對商業銀行來說更是常難以解釋或控制,然而拜現今進步的科技所賜,金融機構可以藉由操控較過去低的成本即可進一步發展強健且精煉的系統與模型去做預測還有信用風險的控管,有鑑於對客戶的評分自大數據時代來臨起,即使是學生亦開始有了可以評鑑的痕跡,憑藉前人所實驗或仰賴的基本考量面向如客戶基本資料、財力狀況或是其於該公司今昔的借貸訊息,再輔以藉由開放資料所帶來的資訊,發想可能影響信用違約率的變數如外在規範對該客戶的紀錄,想驗證是否真有尚可開發的方向,若有則其影響可以到多深。
眾所皆知從過去到現在即有很多種方法被開創以及提出以預測信用違約率,當然所使用的方法和金融機構本身的複雜性、規模大小以及信貸類型有關,最常見的有判別分析,但其對於變數有嚴格的假設,而新興的方法神經網路可以克服判別分析的缺陷且預測的效能也不錯,但神經網路只給予預測結果而運算過程是未知的,對於想要了解變數間的關係無濟於事,故還是選擇從可以對二元分類做預測亦可以藉由模型係數看到應變數和自變數間關係的羅吉斯迴歸方法著手,而研究過程即是依著前人對於羅吉斯迴歸在信用風險上的繩索摸索,將資料如何清理、變數如何轉換、模型如何建立以及最後如何篩選做一個完整的陳述,縱然長道漫漫,對於研究假設在結果終得驗證也始見曙光,考慮的新面向確有其影響力,而在模型係數上也看到其影響的大小,為了更彰顯羅吉斯迴歸對於變數間提供的訊息,故在最後將研究結果以較文字易讀的視覺化方式作呈現。 / Credit risk or credit default means the probability of non-repayment that banks or financial institutions get after they provide services to their customers. Credit risk is also studied intensively in the field of bank lending strategy because it’s usually hard to interpret and control. However, thanks to advanced technology nowadays, banks can manipulate reduced cost to develop robust and well-trained system and models so as to predict and mange credit risk. In the light of the score on customers from the beginning of big data era, every single one can be tracked to assess even though he or she is student. Relying on common facets like personal information, financial statement and past relationship of loan in a specific bank, come up with possible variables like regulations which influence credit risk according to information from open data. Try to verify if there is a new aspect of modeling and how far it effects.
As everyone knows, there are several created and offered methodologies in order to predict credit default. They differ from complexity of banks and institutions, size and type of loan. One of the most popular method is discriminant analysis, but variables are restricted to its assumption. Neural network can fix the flaws of the assumption and work efficiently. Considering the unknown process of calculation in neural network, choose logistic regression as research method which can see the relationship between variables and predict the binary category. With the posterior research on credit risk, make a complete statement about how to clean data, how to transform variables and how to build or screen models. Although the procedure is complicated, the result of this study still validates original hypothesis that new aspect indeed has an impact on credit risk and the coefficient shows how deep it affects.

Identiferoai:union.ndltd.org:CHENGCHI/G0105354001
Creators林瑀甯
Publisher國立政治大學
Source SetsNational Chengchi University Libraries
Language中文
Detected LanguageEnglish
Typetext
RightsCopyright © nccu library on behalf of the copyright holders

Page generated in 0.0015 seconds