A preliminary study of variable selection in penalized logistic regression with rare events data / 在稀少事件下邏輯式迴歸於三種懲罰項的變數篩選能力之初步探討

碩士 / 國立成功大學 / 統計學系 / 107 / It's well known that the accuracy of MLE of the regression coefficient in logistic regression model is seriously affected by rare events. Less attention is given to the performance of variable selection in logistic regression with rare events. Therefore, this thesis studies the performance of three variable selection methods, LASSO (Least Absolute Shrinkage and Selection Operator), SCAD (Smoothly Clipper Absolute Deviation), and Adaptive LASSO, when event rate is low and the number of explanatory variables is much larger than sample sizes.
A simulation study is conducted to compare the accuracy in selecting important explanatory variables of logistic regression model. Based on limited simulation scenarios, when event rate is as low as 0.05, the simulation results recommended using Adaptive LASSO to select important explanatory variables. Consequently, Adaptive LASSO is recommended for variable selection and prediction with rare events data.

Identiferoai:union.ndltd.org:TW/107NCKU5337022
Date January 2019
CreatorsDing-HuangLin, 林鼎晃
ContributorsYun-Chan Chi, 嵇允嬋
Source SetsNational Digital Library of Theses and Dissertations in Taiwan
Languagezh-TW
Detected LanguageEnglish
Type學位論文 ; thesis
Format33

Page generated in 0.0015 seconds