Global ETD Search

1	Seven methods of handling missing data using samples from a national data base / Witta, Eleanor Lea, January 1992 (has links) Thesis (Ph. D.)--Virginia Polytechnic Institute and State University, 1992. / Vita. Abstract. Includes bibliographical references (leaves 66-72). Also available via the Internet. Missing observations (Statistics)
2	Impact of missing data on building prognostic models and summarizing models across studies Munshi, Mahtab R. McGee, Daniel. January 2005 (has links) Thesis (Ph. D.)--Florida State University, 2005. / Advisor: Daniel McGee, Sr. , Florida State University, College of Arts and Sciences, Dept. of Statistics. Title and description from dissertation home page (viewed Jan. 24, 2006). Document formatted into pages; contains xi, 124 pages. Includes bibliographical references.
3	Improving classification performance in missing insurance data Duma, Mlungisi Sizwe 27 May 2013 (has links) D.Phil. (Electrical and Electronic Engineering) / The ubiquitous missing data and its pervasiveness in large scale datasets (such as insurance datasets) have inspired research conducted on this thesis to focus on techniques that sustain high accuracies and robustness. It is a consensus in research and in practice that missing data reduces the quality of data and negatively affects the accuracy in classification. The increase in pervasiveness of missing data affects the accuracy and robustness (or resilience) of classifiers. This effectively impacts decision making and calculation of premiums. The goal of the thesis is to present methods that will improve the accuracy and/or robustness of classifiers in the presence of missing data in insurance datasets. The first contribution in this thesis is a comprehensive comparative study of machine learning techniques (classifiers) in the presence of increasing missing data. The study explores and scrutinises their performance and robustness. The classifiers are the repeated incremental pruning to produce error reduction (RIPPER), naïve Bayes (NB), k-nearest neighbour (k-NN), logistic discriminant analysis (LgDA) and support vector machines (SVM). The study reveals that the sensitivity of the classifiers decreases with increasing missing data rate. The RIPPER shows better performance overall, whilst the NB shows better robustness as the quality of the data deteriorates. A second contribution presented in this thesis is a novel relevance determination (ARD) ensemble for effective attribute selection in insurance datasets with large number of attributes and contains missing data. ARD ensemble applies the Bayesian neural networks and evidence framework to find and order attributes based on their relevance to the target outcome. The data is partitioned into numerical and nominal subsets. Each ARD in the ensemble is then constructed using each of the subsets. The combined outcome of each ARD is scrutinised using a confidence factor and the most relevant attributes are selected. Missing data imputation is performed using the mean-mode imputation. The performance of the ARD ensemble is compared to that of the principal component analysis (PCA). The results show that classifiers that use the ARD ensemble achieve high accuracies and sustain robustness than when applied using the PCA. Missing observations (Statistics) Insurance
4	A comparison and selection of methods for handling missing data in data mining / Zou, Ying. January 2004 (has links) Thesis (M.Sc.)--York University, 2004. Graduate Programme in Computer Science. / Typescript. Includes bibliographical references (leaves 104-109). Also available on the Internet. MODE OF ACCESS via web browser by entering the following URL:http://gateway.proquest.com/openurl?url%5Fver=Z39.88-2004&res%5Fdat=xri:pqdiss&rft%5Fval%5Ffmt=info:ofi/fmt:kev:mtx:dissertation&rft%5Fdat=xri:pqdiss:MQ99411
5	The performance of missing data treatments for longitudinal data with a time-varying covariate Adachi, Eishi 28 August 2008 (has links) Not available / text Longitudinal method Missing observations (Statistics)
6	Classification in the missing data Zhang, Xin Unknown Date No description available. Missing observations (Statistics) Research -- Methodology
7	Estimation for counting processes with incomplete data / Zhang, Ying, January 1998 (has links) Thesis (Ph. D.)--University of Washington, 1998. / Vita. Includes bibliographical references (p. [121]-127).
8	Forecasting of work in process quality using Holt-Winters method for missing observations Kayande, Sarang R. January 1999 (has links) Thesis (M.S.)--West Virginia University, 1999. / Title from document title page. Document formatted into pages; contains x, 110 p. : ill. (some col.) Includes abstract. Includes bibliographical references (p. 90-91).
9	Random feature subspace ensemble based approaches for the analysis of data with missing features / Mohammed, Hussein Syed. January 2006 (has links) Thesis (M.S.)--Rowan University, 2006. / Typescript. Includes bibliographical references.
10	Visualizing incomplete data in multidimensional databases Peterson, Nina Marie. January 2006 (has links) (PDF) Thesis (M.S.)--Washington State University, May 2006. / Includes bibliographical references (p. 81-83).

Search results