Return to search

Comparing Multivariate Regression Methods For Compositional Data : Through Simulation Studies & Applications

Compositional data, where measurements are vectors with each component constituting a percentage of a whole, is abundant throughout many disciplines of science. Consequently, there is a strong need to establish valid statistical procedures for this type of data. In this work the basic theory of the compositional sample space is presented and through simulation studies and a case study on data from industrial applications, the current available methods for regression as applied to compositional data are evaluated. The main focus of this work is to establish linear regression in a way compatible with compositional data sets and compare this approach with the alternative of applying standard multivariate regression methods on raw compositional data. It is found that for several data sets, the difference between 'naive' multivariate linear regression and compositional linear regression is negligible; while for others (in particular where the dependence of covariates is not strictly linear) the compositional regression methods are shown to be stronger.

Identiferoai:union.ndltd.org:UPSALLA1/oai:DiVA.org:umu-138463
Date January 2017
CreatorsLångström, Christoffer
PublisherUmeå universitet, Institutionen för matematik och matematisk statistik
Source SetsDiVA Archive at Upsalla University
LanguageEnglish
Detected LanguageEnglish
TypeStudent thesis, info:eu-repo/semantics/bachelorThesis, text
Formatapplication/pdf
Rightsinfo:eu-repo/semantics/openAccess

Page generated in 0.0141 seconds