1 |
Latent variable models for longitudinal twin dataDominicus, Annica January 2006 (has links)
<p>Longitudinal twin data provide important information for exploring sources of variation in human traits. In statistical models for twin data, unobserved genetic and environmental factors influencing the trait are represented by latent variables. In this way, trait variation can be decomposed into genetic and environmental components. With repeated measurements on twins, latent variables can be used to describe individual trajectories, and the genetic and environmental variance components are assessed as functions of age. This thesis contributes to statistical methodology for analysing longitudinal twin data by (i) exploring the use of random change point models for modelling variance as a function of age, (ii) assessing how nonresponse in twin studies may affect estimates of genetic and environmental influences, and (iii) providing a method for hypothesis testing of genetic and environmental variance components. The random change point model, in contrast to linear and quadratic random effects models, is shown to be very flexible in capturing variability as a function of age. Approximate maximum likelihood inference through first-order linearization of the random change point model is contrasted with Bayesian inference based on Markov chain Monte Carlo simulation. In a set of simulations based on a twin model for informative nonresponse, it is demonstrated how the effect of nonresponse on estimates of genetic and environmental variance components depends on the underlying nonresponse mechanism. This thesis also reveals that the standard procedure for testing variance components is inadequate, since the null hypothesis places the variance components on the boundary of the parameter space. The asymptotic distribution of the likelihood ratio statistic for testing variance components in classical twin models is derived, resulting in a mixture of chi-square distributions. Statistical methodology is illustrated with applications to empirical data on cognitive function from a longitudinal twin study of aging. </p>
|
2 |
Latent variable models for longitudinal twin dataDominicus, Annica January 2006 (has links)
Longitudinal twin data provide important information for exploring sources of variation in human traits. In statistical models for twin data, unobserved genetic and environmental factors influencing the trait are represented by latent variables. In this way, trait variation can be decomposed into genetic and environmental components. With repeated measurements on twins, latent variables can be used to describe individual trajectories, and the genetic and environmental variance components are assessed as functions of age. This thesis contributes to statistical methodology for analysing longitudinal twin data by (i) exploring the use of random change point models for modelling variance as a function of age, (ii) assessing how nonresponse in twin studies may affect estimates of genetic and environmental influences, and (iii) providing a method for hypothesis testing of genetic and environmental variance components. The random change point model, in contrast to linear and quadratic random effects models, is shown to be very flexible in capturing variability as a function of age. Approximate maximum likelihood inference through first-order linearization of the random change point model is contrasted with Bayesian inference based on Markov chain Monte Carlo simulation. In a set of simulations based on a twin model for informative nonresponse, it is demonstrated how the effect of nonresponse on estimates of genetic and environmental variance components depends on the underlying nonresponse mechanism. This thesis also reveals that the standard procedure for testing variance components is inadequate, since the null hypothesis places the variance components on the boundary of the parameter space. The asymptotic distribution of the likelihood ratio statistic for testing variance components in classical twin models is derived, resulting in a mixture of chi-square distributions. Statistical methodology is illustrated with applications to empirical data on cognitive function from a longitudinal twin study of aging.
|
3 |
Cluster-based lack of fit tests for nonlinear regression modelsMunasinghe, Wijith Prasantha January 1900 (has links)
Doctor of Philosophy / Department of Statistics / James W. Neill / Checking the adequacy of a proposed parametric nonlinear regression model is important
in order to obtain useful predictions and reliable parameter inferences. Lack of fit is said to
exist when the regression function does not adequately describe the mean of the response
vector. This dissertation considers asymptotics, implementation and a comparative performance
for the likelihood ratio tests suggested by Neill and Miller (2003). These tests use
constructed alternative models determined by decomposing the lack of fit space according to
clusterings of the observations. Clusterings are selected by a maximum power strategy and a
sequence of statistical experiments is developed in the sense of Le Cam. L2 differentiability
of the parametric array of probability measures associated with the sequence of experiments
is established in this dissertation, leading to local asymptotic normality. Utilizing contiguity,
the limit noncentral chi-square distribution under local parameter alternatives is then
derived. For implementation purposes, standard linear model projection algorithms are
used to approximate the likelihood ratio tests, after using the convexity of a class of fuzzy
clusterings to form a smooth alternative model which is necessarily used to approximate the
corresponding maximum optimal statistical experiment. It is demonstrated empirically that
good power can result by allowing cluster selection to vary according to different points along
the expectation surface of the proposed nonlinear regression model. However, in some cases,
a single maximum clustering suffices, leading to the development of a Bonferroni adjusted
multiple testing procedure. In addition, the maximin clustering based likelihood ratio tests
were observed to possess markedly better simulated power than the generalized likelihood
ratio test with semiparametric alternative model presented by Ciprian and Ruppert (2004).
|
Page generated in 0.0909 seconds