1 |
MULTI-STATE MODELS WITH MISSING COVARIATESLou, Wenjie 01 January 2016 (has links)
Multi-state models have been widely used to analyze longitudinal event history data obtained in medical studies. The tools and methods developed recently in this area require the complete observed datasets. While, in many applications measurements on certain components of the covariate vector are missing on some study subjects. In this dissertation, several likelihood-based methodologies were proposed to deal with datasets with different types of missing covariates efficiently when applying multi-state models.
Firstly, a maximum observed data likelihood method was proposed when the data has a univariate missing pattern and the missing covariate is a categorical variable. The construction of the observed data likelihood function is based on the model of a joint distribution of the response longitudinal event history data and the discrete covariate with missing values.
Secondly, we proposed a maximum simulated likelihood method to deal with the missing continuous covariate when applying multi-state models. The observed data likelihood function was approximated by using the Monte Carlo simulation method.
At last, an EM algorithm was used to deal with multiple missing covariates when estimating the parameters of multi-state model. The EM algorithm would be able to handle multiple missing discrete covariates in general missing pattern efficiently.
All the proposed methods are justified by simulation studies and applications to the datasets from the SMART project, a consortium of 11 different high-quality longitudinal studies of aging and cognition.
|
2 |
Outils et modèles pour l'étude de quelques risques spatiaux et en réseaux : application aux extrêmes climatiques et à la contagion en finance / Tools and models for the study of some spatial and network risks : application to climate extremes and contagion in financeKoch, Erwan 02 July 2014 (has links)
Cette thèse s’attache à développer des outils et modèles adaptés a l’étude de certains risques spatiaux et en réseaux. Elle est divisée en cinq chapitres. Le premier consiste en une introduction générale, contenant l’état de l’art au sein duquel s’inscrivent les différents travaux, ainsi que les principaux résultats obtenus. Le Chapitre 2 propose un nouveau générateur de précipitations multi-site. Il est important de disposer de modèles capables de produire des séries de précipitations statistiquement réalistes. Alors que les modèles précédemment introduits dans la littérature concernent essentiellement les précipitations journalières, nous développons un modèle horaire. Il n’implique qu’une seule équation et introduit ainsi une dépendance entre occurrence et intensité, processus souvent considérés comme indépendants dans la littérature. Il comporte un facteur commun prenant en compte les conditions atmosphériques grande échelle et un terme de contagion auto-regressif multivarié, représentant la propagation locale des pluies. Malgré sa relative simplicité, ce modèle reproduit très bien les intensités, les durées de sècheresse ainsi que la dépendance spatiale dans le cas de la Bretagne Nord. Dans le Chapitre 3, nous proposons une méthode d’estimation des processus maxstables, basée sur des techniques de vraisemblance simulée. Les processus max-stables sont très adaptés à la modélisation statistique des extrêmes spatiaux mais leur estimation s’avère délicate. En effet, la densité multivariée n’a pas de forme explicite et les méthodes d’estimation standards liées à la vraisemblance ne peuvent donc pas être appliquées. Sous des hypothèses adéquates, notre estimateur est efficace quand le nombre d’observations temporelles et le nombre de simulations tendent vers l’infini. Cette approche par simulation peut être utilisée pour de nombreuses classes de processus max-stables et peut fournir de meilleurs résultats que les méthodes actuelles utilisant la vraisemblance composite, notamment dans le cas où seules quelques observations temporelles sont disponibles et où la dépendance spatiale est importante / This thesis aims at developing tools and models that are relevant for the study of some spatial risks and risks in networks. The thesis is divided into five chapters. The first one is a general introduction containing the state of the art related to each study as well as the main results. Chapter 2 develops a new multi-site precipitation generator. It is crucial to dispose of models able to produce statistically realistic precipitation series. Whereas previously introduced models in the literature deal with daily precipitation, we develop a hourly model. The latter involves only one equation and thus introduces dependence between occurrence and intensity; the aforementioned literature assumes that these processes are independent. Our model contains a common factor taking large scale atmospheric conditions into account and a multivariate autoregressive contagion term accounting for local propagation of rainfall. Despite its relative simplicity, this model shows an impressive ability to reproduce real intensities, lengths of dry periods as well as the spatial dependence structure. In Chapter 3, we propose an estimation method for max-stable processes, based on simulated likelihood techniques. Max-stable processes are ideally suited for the statistical modeling of spatial extremes but their inference is difficult. Indeed the multivariate density function is not available and thus standard likelihood-based estimation methods cannot be applied. Under appropriate assumptions, our estimator is efficient as both the temporal dimension and the number of simulation draws tend towards infinity. This approach by simulation can be used for many classes of max-stable processes and can provide better results than composite-based methods, especially in the case where only a few temporal observations are available and the spatial dependence is high
|
3 |
A new estimation approach for modeling activity-travel behavior : applications of the composite marginal likelihood approach in modeling multidimensional choicesFerdous, Nazneen 04 November 2011 (has links)
The research in the field of travel demand modeling is driven by the need to understand individuals’ behavior in the context of travel-related decisions as accurately as possible. In this regard, the activity-based approach to modeling travel demand has received substantial attention in the past decade, both in the research arena as well as in practice. At the same time, recent efforts have been focused on more fully realizing the potential of activity-based models by explicitly recognizing the multi-dimensional nature of activity-travel decisions. However, as more behavioral elements/dimensions are added, the dimensionality of the model systems tends to explode, making the estimation of such models all but infeasible using traditional inference methods. As a result, analysts and practitioners often trade-off between recognizing attributes that will make a model behaviorally more representative (from a theoretical viewpoint) and being able to estimate/implement a model (from a practical viewpoint).
An alternative approach to deal with the estimation complications arising from multi-dimensional choice situations is the technique of composite marginal likelihood (CML). This is an estimation technique that is gaining substantial attention in the statistics field, though there has been relatively little coverage of this method in transportation and other fields. The CML approach is a conceptually and pedagogically simpler simulation-free procedure (relative to traditional approaches that employ simulation techniques), and has the advantage of reproducibility of the results. Under the usual regularity assumptions, the CML estimator is consistent, unbiased, and asymptotically normally distributed.
The discussion above indicates that the CML approach has the potential to contribute in the area of travel demand modeling in a significant way. For example, the approach can be used to develop conceptually and behaviorally more appealing models to examine individuals’ travel decisions in a joint framework. The overarching goal of the current research work is to demonstrate the applicability of the CML approach in the area of activity-travel demand modeling and to highlight the enhanced features of the choice models estimated using the CML approach. The goal of the dissertation is achieved in three steps as follows: (1) by evaluating the performance of the CML approach in multivariate situations, (2) by developing multidimensional choice models using the CML approach, and (3) by demonstrating applications of the multidimensional choice models developed in the current dissertation. / text
|
Page generated in 0.0519 seconds