Return to search

Comparison of multiple imputation methods for missing data : A simulation study

Despite a well-designed and controlled study, missing values are consistently present inresearch. It is well established that when disregarding missingness by analyzing completecases only, statistical power is reduced and parameter estimates are biased. The existing traditional methods of imputing missing data are incapable of accounting for misleading representation of data. Research shows that these traditional methods like single imputation, often underestimate the variance. This problem can be bypassed by imputing a missing value multiple times and taking the uncertainty of imputing correctly into consideration. In this thesis a simulation study is conducted to compare two different multiple imputation models. A comparison between a defined linear stochastic regression model and a non defined flexible neural network model, where the validation MSE loss is used to account for variance in the imputed values, is done. In total there are three simulated data sets sampled from a multiple bivariate linear regression model where som of the values in Y2 are MAR given the Y1 variable. When applying a neural network on the datasets with 25, 50 and 75 percent missing values a total of 30 times and the result from the regression analysis on the complete data is pooled, the results show that almost all confidence intervals of the intercept are covering the expected value. The only exception was in the case of 75 percent missingness. When applying Multiple imputation by chained equations on the data sets, the true intercept is covered by all confidence intervals. When 25 percent of the data is missing, both models yield unbiased results.

Identiferoai:union.ndltd.org:UPSALLA1/oai:DiVA.org:umu-187318
Date January 2021
CreatorsSchelhaas, Sjoerd
PublisherUmeƄ universitet, Statistik
Source SetsDiVA Archive at Upsalla University
LanguageEnglish
Detected LanguageEnglish
TypeStudent thesis, info:eu-repo/semantics/bachelorThesis, text
Formatapplication/pdf
Rightsinfo:eu-repo/semantics/openAccess

Page generated in 0.0018 seconds