Global ETD Search

Return to search

Mixture model cluster analysis under different covariance structures using information complexity

In this thesis, a mixture-model cluster analysis technique under different covariance structures of the component densities is developed and presented, to capture the compactness, orientation, shape, and the volume of component clusters in one expert system to handle Gaussian high dimensional heterogeneous data sets to achieve flexibility in currently practiced cluster analysis techniques. Two approaches to parameter estimation are considered and compared; one using the Expectation-Maximization (EM) algorithm and another following a Bayesian framework using the Gibbs sampler. We develop and score several forms of the ICOMP criterion of Bozdogan (1994, 2004) as our fitness function; to choose the number of component clusters, to choose the correct component covariance matrix structure among nine candidate covariance structures, and to select the optimal parameters and the best fitting mixture-model. We demonstrate our approach on simulated datasets and a real large data set, focusing on early detection of breast cancer. We show that our approach improves the probability of classification error over the existing methods.

http://trace.tennessee.edu/utk_gradthes/968

Gaussian mixture

model-based clustering

information complexity

Gibbs sampler

eigenvalue decomposition

Multivariate Analysis

Statistical Models

Identifer	oai:union.ndltd.org:UTENN/oai:trace.tennessee.edu:utk_gradthes-2096
Date	01 August 2011
Creators	Erar, Bahar
Publisher	Trace: Tennessee Research and Creative Exchange
Source Sets	University of Tennessee Libraries
Detected Language	English
Type	text
Format	application/pdf
Source	Masters Theses

Page generated in 0.0021 seconds

Mixture model cluster analysis under different covariance structures using information complexity

Description

Links & Downloads

Tags

Additional Fields