Global ETD Search

1	ENVELOPE MODEL FOR MULTIVARIATE LINEAR REGRESSION WITH ELLIPTICAL ERROR Alkan, Gunes, 0000-0001-9356-2173 January 2021 (has links) In recent years, the need for models which can accommodate higher order covariates have increased greatly. We first consider linear regression with vector-valued response Y and tensor-valued predictors X. Envelope models (Cook et al., 2010) can significantly improve the estimation efficiency of the regression coefficients by linking the regression mean with the covariance of the regression error. Most existing tensor regression models assume that the conditional distribution of Y given X follows a normal distribution, which may be violated in practice. In Chapter 2, we propose an envelope multivariate linear regression model with tensor-valued predictors and elliptically contoured error distributions. The proposed estimator is more robust to violations of the error normality assumption, and it is more efficient than the estimators without considering the underlying envelope structure. We compare the new proposal with existing estimators in extensive simulation studies. In Chapter 3, we explore how the missing data problem can be addressed for multivariate linear regression setting with envelopes and elliptical error. A popular and efficient approach, multiple imputation is implemented with bootstrapped expectation-maximization (EM) algorithm to fill the missing data, which is then followed with an adjustment in estimating regression coefficients. Simulations with synthetic data as well as real data are presented to establish the superiority of the adjusted multiple imputation method proposed. / Statistics Statistics Elliptical distributions Envelope model Tensor
2	Modelos multivariados binários com funções de ligação assimétricas / Multivariate binary regression models with asymmetric link functions Farias, Rafael Braz Azevedo 25 May 2012 (has links) Conjuntos de dados com respostas multivariadas aparecem frequentemente em pesquisas em que os dados são provenientes de questionários. Exemplos mais comuns são pesquisas de opinião, mais especificamente, pesquisas de marketing em que a preferência do consumidor em potencial é avaliado: pelo produto, marca, preço, praça, promoção e etc. Um tipo pesquisa de opinião que ganha grande destaque no Brasil de dois em dois anos são as pesquisas eleitorais de intenção de votos. Nós introduzimos nesta tese uma classe de modelos de regressão multivariados com funções de ligação assimétricas para o ajuste de conjuntos de dados com respostas multivariadas binárias. As funções de ligação consideradas são bastante flexíveis e robustas, contemplando funções de ligação simétricas como casos particulares. Devido a complexidade do modelo, nós discutimos a sua identificabilidade. A abordagem Bayesiana foi considerada e alguns algoritmos de Monte Carlo via Cadeia de Markov (MCMC) foram desenvolvidos. Nós descrevemos algumas ferramentas de seleção de modelos, os quais incluem o Critério de Informação da Deviance (DIC), a Pseudo-Verossimilhança Marginal e o Pseudo-Fator de Bayes. Adicionalmente, um estudo de simulação foi desenvolvido com dois objetivos; i) verificar a qualidade dos algoritmos desenvolvidos e ii) verificar a importância da escolha da função de ligação . No final da tese uma aplicação em um conjunto de dados real é considerada com o objetivo de ilustrar as metodologias e técnicas apresentadas. / Data sets with multivariate responses often appear in surveys where the data came from questionnaires. Opinion poll, sometimes simply referred to as a poll, are common examples of studies in which the responses are multivariate. One type poll that gain great prominence in Brazil in election years, is the survey of vote intent. However, despite the higher visibility of prognostic studies of election, opnion polls is a tool widely used to detect trends and positions of different social segments on various topics, be they political, social or governmental. We introduce in this work a class of multivariate regression models with asymmetric link functions to fit data sets with multivariate binary responses. The link functions here considered are quite flexible and robust, contemplating symmetrical link functions as special cases. Due to the complexity of the model, we discuss its identifiability. The Bayesian approach was considered and some Monte Carlo Markov Chain (MCMC) algorithms have been developed. Simulation studies have been developed with two objectives: i) verify the quality of the algorithms developed and ii) to verify the importance of choosing the link function. At the end of this work an application in a real data set is considered in order to illustrate the methodologies and techniques presented. binary models modelos binários modelos elípticos-assimétricos multivariate regression models regressão multivariada skew-elliptical distributions
3	Modelos multivariados binários com funções de ligação assimétricas / Multivariate binary regression models with asymmetric link functions Rafael Braz Azevedo Farias 25 May 2012 (has links) Conjuntos de dados com respostas multivariadas aparecem frequentemente em pesquisas em que os dados são provenientes de questionários. Exemplos mais comuns são pesquisas de opinião, mais especificamente, pesquisas de marketing em que a preferência do consumidor em potencial é avaliado: pelo produto, marca, preço, praça, promoção e etc. Um tipo pesquisa de opinião que ganha grande destaque no Brasil de dois em dois anos são as pesquisas eleitorais de intenção de votos. Nós introduzimos nesta tese uma classe de modelos de regressão multivariados com funções de ligação assimétricas para o ajuste de conjuntos de dados com respostas multivariadas binárias. As funções de ligação consideradas são bastante flexíveis e robustas, contemplando funções de ligação simétricas como casos particulares. Devido a complexidade do modelo, nós discutimos a sua identificabilidade. A abordagem Bayesiana foi considerada e alguns algoritmos de Monte Carlo via Cadeia de Markov (MCMC) foram desenvolvidos. Nós descrevemos algumas ferramentas de seleção de modelos, os quais incluem o Critério de Informação da Deviance (DIC), a Pseudo-Verossimilhança Marginal e o Pseudo-Fator de Bayes. Adicionalmente, um estudo de simulação foi desenvolvido com dois objetivos; i) verificar a qualidade dos algoritmos desenvolvidos e ii) verificar a importância da escolha da função de ligação . No final da tese uma aplicação em um conjunto de dados real é considerada com o objetivo de ilustrar as metodologias e técnicas apresentadas. / Data sets with multivariate responses often appear in surveys where the data came from questionnaires. Opinion poll, sometimes simply referred to as a poll, are common examples of studies in which the responses are multivariate. One type poll that gain great prominence in Brazil in election years, is the survey of vote intent. However, despite the higher visibility of prognostic studies of election, opnion polls is a tool widely used to detect trends and positions of different social segments on various topics, be they political, social or governmental. We introduce in this work a class of multivariate regression models with asymmetric link functions to fit data sets with multivariate binary responses. The link functions here considered are quite flexible and robust, contemplating symmetrical link functions as special cases. Due to the complexity of the model, we discuss its identifiability. The Bayesian approach was considered and some Monte Carlo Markov Chain (MCMC) algorithms have been developed. Simulation studies have been developed with two objectives: i) verify the quality of the algorithms developed and ii) to verify the importance of choosing the link function. At the end of this work an application in a real data set is considered in order to illustrate the methodologies and techniques presented. modelos binários modelos elípticos-assimétricos regressão multivariada binary models multivariate regression models skew-elliptical distributions
4	Dependency Measures and Copulas for Multivariate Infinitely Divisible Distributions Maddox, Wesley J. 02 June 2017 (has links) No description available. Statistics
5	Extrêmes multivariés et spatiaux : approches spectrales et modèles elliptiques / Multivariate and spatial extremes : spectral approaches and elliptical models Opitz, Thomas 30 October 2013 (has links) Cette thèse présente des contributions à la modélisation multivariée et spatiale des valeurs extrêmes. Au travers d'une extension de la représentation par coordonnées pseudo-polaires, représentation très utilisée en théorie des valeurs extrêmes, une approche unifiée et générale pour la modélisation en valeurs extrêmes est proposée. La variable radiale de ces coordonnées est donnée par une fonction non négative et homogène dite fonction d'agrégation permettant d'agréger un vecteur dans un scalaire. La loi de la variable d'angle est caractérisée par une mesure dite angulaire ou spectrale. Nous définissons les lois radiales de Pareto et une version inversée de ces lois, toutes deux motivées dans le cadre de la variation régulière multivariée. Cette classe de modèles est assez souple et permet de modéliser les valeurs extrêmes de vecteurs aléatoires dont la variable agrégée est à décroissance de type Pareto ou Pareto inversé. Dans le cadre spatial, nous mettons l'accent sur les lois bivariées à l'instar des méthodes couramment utilisées. Des approches inférentielles originales sont développées, fondées sur un nouvel outil de représentation appelé spectrogramme. Le spectrogramme est constitué des mesures spectrales caractérisant le comportement extrémalbivarié. Enfin, la construction dite spectrale du processus limite max-stable des processus elliptiques, à savoir le processus t-extrémal, est présentée. Par ailleurs, nous énonçons des méthodesd'inférence et explorons des méthodes de simulation des processus de type max-stable et de type Pareto. L'intérêt pratique des modèles et méthodes proposés est illustré au travers d'applications à des données environnementales et financières. / This PhD thesis presents contributions to the modelling of multivariate andspatial extreme values. Using an extension of commonly used pseudo-polar representations inextreme value theory, we propose a general unifying approachto modelling of extreme value dependence. The radial variable of such coordinates is obtained from applying a nonnegative and homogeneous function, called aggregation function, allowing us to aggregate a vector into a scalar value. The distribution of the angle component is characterized by a so-called angular or spectral measure. We define radial Pareto distribution and an inverted version of thesedistributions, both motivated within the framework of multivariateregular variation. This flexible class of models allows for modelling of extreme valuesin random vectors whose aggregated variable shows tail decay of thePareto or inverted Pareto type. For the purpose of spatial extreme value analysis, we follow standard methodology in geostatistics of extremes and put the focus on bivariatedistributions. Inferentialapproaches are developed based on the notion of a spectrogram,a tool composed of thespectral measures characterizing bivariate extreme value behavior. Finally, the so-called spectral construction of the max-stable limit processobtained from elliptical processes, known as extremal-t process, ispresented. We discuss inference and explore simulation methods for the max-stableprocess and the corresponding Pareto process. The utility of the proposed models and methods is illustrated throughapplications to environmental and financial data. Extrêmes multivariés Extrêmes spatiaux Lois elliptiques Mesure spectrale Processus de Pareto Multivariate extremes Spatial extremes Elliptical distributions Spectral measure Pareto process
6	Rank Estimation in Elliptical Models : Estimation of Structured Rank Covariance Matrices and Asymptotics for Heteroscedastic Linear Regression Kuljus, Kristi January 2008 (has links) This thesis deals with univariate and multivariate rank methods in making statistical inference. It is assumed that the underlying distributions belong to the class of elliptical distributions. The class of elliptical distributions is an extension of the normal distribution and includes distributions with both lighter and heavier tails than the normal distribution. In the first part of the thesis the rank covariance matrices defined via the Oja median are considered. The Oja rank covariance matrix has two important properties: it is affine equivariant and it is proportional to the inverse of the regular covariance matrix. We employ these two properties to study the problem of estimating the rank covariance matrices when they have a certain structure. The second part, which is the main part of the thesis, is devoted to rank estimation in linear regression models with symmetric heteroscedastic errors. We are interested in asymptotic properties of rank estimates. Asymptotic uniform linearity of a linear rank statistic in the case of heteroscedastic variables is proved. The asymptotic uniform linearity property enables to study asymptotic behaviour of rank regression estimates and rank tests. Existing results are generalized and it is shown that the Jaeckel estimate is consistent and asymptotically normally distributed also for heteroscedastic symmetric errors. elliptical distributions multivariate ranks rank covariance matrix linear rank regression heteroscedastic errors linear rank statistics Mathematical statistics Matematisk statistik
7	Aplicações estatísticas na área industrial / Statistical applications in the industrial area Silva, Gecirlei Francisco da 10 June 2009 (has links) Apresentamos algumas aplicações de ferramentas estatísticas que são comumente utilizadas na melhoria da qualidade de processos industriais. Inicialmente, desenvolveu-se procedimentos para testar a competência de laboratórios que participam de programas de ensaios de proficiência. Em situações onde os laboratórios medem várias vezes no mesmo ponto, utilizou-se o modelo de erros de medição, proposto por Jaech [39](1985). Além disso, a inferência sobre os parâmetros de tendência aditiva foi generalizada para a classe de distribuições elípticas. A competência dos laboratórios é avaliada pelo teste da razão de verossimilhança generalizada, do qual, obtemos a distribuição exata para a estatística proposta. Em situações onde os laboratórios medem várias vezes em vários pontos e a variável em análise apresenta variações naturais, utilizou-se o modelo com erro nas variáveis. Diante disso, vamos estender o modelo estrutural definido em Barnett [13] (1969) para o modelo ultra-estrutural com réplicas. Neste caso, vamos avaliar não somente a tendência aditiva, mas também, a tendência multiplicativa, ou seja, avaliar a linearidade das medições. As estimativas dos parâmetros foram obtidas via procedimento do algorítmo EM, com isso, desenvolvemos os teste de Wald, razão de verossimilhança e escore para avaliar a competência dos laboratórios. Nos dois modelos propostos, generalizamos o erro normalizado (En) sugerido pelo Guia 43 [37] para testar a competência dos laboratórios participantes de programas de ensaio de proficiência. Apresentamos também, um procedimento para calcular índices de performance para processos univariados e multivariados. Nestes casos, consideramos que a distribuição dos dados segue uma distribuição Normal assimétrica. Além disso, apresentamos uma análise de simulação onde concluímos que a presença de assimetria nos dados pode causar interpretações erradas sobre o processo, quando a distribuição assumida para os dados é a Normal / We present some applications of statistical tools that are used in the improvement of the quality of industrial processes. Initially, we develop procedures to test the ability of laboratories that participate of programs of proficiency test. In situations where the laboratories measure several times in the same point, we use the model of errors of measurement, considered for Jaech [39](1985). Moreover, the inference on the parameters additive bias was generalized for the class of elliptical distributions. The ability of the laboratories is evaluated by the generalized likelihood ratio test, of which, we get the accurate distribution for the statistics proposal. In situations where the laboratories measure some times in some points and the variable in analysis presents natural variations, uses the model with error in the variable. With this, we go to extend the model structural defined in Barnett [13] (1969) for the ultrastructural model with replicate. In this case, we go to not only evaluate the bias additive, but also, the bias multiplicative, that is, to evaluate the linearity of the measurements. The estimates of the parameters had been gotten by the procedure of the EM algorithm, with this, develop of Wald, likelihood ratio and score test to evaluate the ability of the laboratories. In the two considered models, we generalize the normalized error (En) suggested for Guide 43 [37] to test the ability of the participant laboratories of programs of proficiency test. We also present, a procedure to calculate index of performance for univariate and multivariate processes. In these cases, we consider that the distribution of the data follows a skew Normal distribution. Moreover, we present a simulation analysis where we conclude that the presence of asymmetry in the data can cause interpretations missed on the process, when the distribution assumed for the data is the Normal Análise da capacidade de processos Analysis of the capacity of process Comparações interlaboratoriais Distribuição normal assimétrica Distribuições elípticas Elliptical distributions Ensaios de proficiência Interlaboratorial comparisons Skew normal distributions Test of proficiency
8	Modelos mistos aditivos semiparamétricos de contornos elípticos / Elliptical contoured semiparametric additive mixed models. Pulgar, Germán Mauricio Ibacache 14 August 2009 (has links) Neste trabalho estendemos os modelos mistos semiparamétricos propostos por Zhang et al. (1998) para uma classe mais geral de modelos, a qual denominamos modelos mistos aditivos semiparamétricos com erros de contornos elípticos. Com essa nova abordagem, flexibilizamos a curtose da distribuição dos erros possibilitando a escolha de distribuições com caudas mais leves ou mais pesadas do que as caudas da distribuição normal padrão. Funções de verossimilhança penalizadas são aplicadas para a obtenção das estimativas de máxima verossimilhança com os respectivos erros padrão aproximados. Essas estimativas, sob erros de caudas pesadas, são robustas no sentido da distância de Mahalanobis contra observações aberrantes. Curvaturas de influência local são obtidas segundo alguns esquemas de perturbação e gráficos de diagnóstico são propostos. Exemplos ilustrativos são apresentados em que ajustes sob erros normais são comparados, através das metodologias de sensibilidade desenvolvidas no trabalho, com ajustes sob erros de contornos elípticos. / In this work we extend the models proposed by Zhang et al. (1998) to a more general class of models, know as semiparametric additive mixed models with elliptical errors in order to allow distributions with heavier or lighter tails than the normal ones. Penalized likelihood equations are applied to derive the maximum likelihood estimates which appear to be robust against outlying observations in the sense of the Mahalanobis distance. In order to study the sensitivity of the penalized estimates under some usual perturbation schemes in the model or data, the local influence curvatures are derived and some diagnostic graphics are proposed. Motivating examples preliminary analyzed under normal errors are reanalyzed under some appropriate elliptical errors. The local influence approach is used to compare the sensitivity of the model estimates. Distribuições elípticas Elliptical distributions estimativas robustas influência local. local influence method. modelos não paramétricos non-parametric models penalized likelihood estimates robust estimates
9	Análise bayesiana do modelo fatorial dinâmico para um vetor de séries temporais usando distribuições elípticas. / Bayesian Analysis of the dynamic factorial models for a time series vector using elliptical distribuitions. Borges, Livia Costa 27 May 2008 (has links) A análise fatorial é uma importante ferramenta estatística que tem amplas aplicações práticas e explica a correlação entre um grande número de variáveis observáveis em termos de um pequeno número de variáveis não observáveis, conhecidas como variáveis latentes. A proposta deste trabalho é fazer a análise Bayesiana, que incorpora à análise o conhecimento que se tenha sobre os parâmetros antes da coleta dos dados, do modelo fatorial dinâmico na classe de modelos elípticos multivariados, assumindo que a um vetor de q séries temporais pode-se ajustar um modelo fatorial com k < q fatores mais um ruído branco, e que a parte latente segue um modelo vetorial auto-regressivo. A classe de modelos elípticos citada acima é rica em distribuições simétricas com caudas mais pesadas que as da distribuição normal, característica importante na análise de séries financeiras. Essa classe inclui as distribuições t de Student, exponencial potência, normal contaminada, entre outras. A inferência sobre os parâmetros foi feita utilizando métodos de Monte Carlo via Cadeias de Markov, com os algoritmos Metropolis-Hastings e Griddy-Gibbs, através da obtenção das distribuições a posteriori dos parâmetros e dos fatores. A determinação da convergência do processo foi feita por técnicas gráficas e pelos métodos de Geweke (1992), de Heidelberger e Welch (1983) e Half-Width. O método foi ilustrado usando dados reais e simulados. / The factor analysis is an important statistical tool that has wide practical applications and it explains the correlation among a large number of observable variables in terms of a small number of unobservable variables, known as latent variables. The proposal of this work is the Bayesian analysis, which incorporates the information we have concerning the parameters before collecting data into the analysis of a dynamical factor model in the class of multivariate elliptical models, where the factors follow a multivariate autoregressive model, assuming that a vector of q time series can be adjusted with k < q factors and a white noise. The class of elliptical models is rich in symmetrical distributions with heavier tails than the normal distribution, which is an important characteristic in financial series analysis. This class includes t-Student, power exponential, contaminated normal and other distributions. The parameters inference was made through Monte Carlo Markov Chain methods, with Metropolis-Hastings and Griddy-Gibbs algorithms, by obtaining the parameters and factors posteriori distributions. The convergence process was made through graphical technics and by Geweke (1992) and by Heidelberger and Welch (1983) and Half- Width methods. The method was illustrated using simulated and real data. Anállise Bayesiana Baysian analysis distribuições elípticas elliptical distributions factorial model Metropolis-Hastings Metropolis-Hastings< modelo fatorial séries temporais time series
10	Estimation de mesures de risque pour des distributions elliptiques conditionnées / Estimation of risk measures for conditioned elliptical distributions Usseglio-Carleve, Antoine 26 June 2018 (has links) Cette thèse s'intéresse à l'estimation de certaines mesures de risque d'une variable aléatoire réelle Y en présence d'une covariable X. Pour cela, on va considérer que le vecteur (X,Y) suit une loi elliptique. Dans un premier temps, on va s'intéresser aux quantiles de Y sachant X=x. On va alors tester d'abord un modèle de régression quantile assez répandu dans la littérature, pour lequel on obtient des résultats théoriques que l'on discutera. Face aux limites d'un tel modèle, en particulier pour des niveaux de quantile dits extrêmes, on proposera une nouvelle approche plus adaptée. Des résultats asymptotiques sont donnés, appuyés par une étude numérique puis par un exemple sur des données réelles. Dans un second chapitre, on s'intéressera à une autre mesure de risque appelée expectile. La structure du chapitre est sensiblement la même que celle du précédent, à savoir le test d'un modèle de régression inadapté aux expectiles extrêmes, pour lesquels on propose une approche méthodologique puis statistique. De plus, en mettant en évidence le lien entre les quantiles et expectiles extrêmes, on s'aperçoit que d'autres mesures de risque extrêmes sont étroitement liées aux quantiles extrêmes. On se concentrera sur deux familles appelées Lp-quantiles et mesures d'Haezendonck-Goovaerts, pour lesquelles on propose des estimateurs extrêmes. Une étude numérique est également fournie. Enfin, le dernier chapitre propose quelques pistes pour traiter le cas où la taille de la covariable X est grande. En constatant que nos estimateurs définis précédemment étaient moins performants dans ce cas, on s'inspire alors de quelques méthodes d'estimation en grande dimension pour proposer d'autres estimateurs. Une étude numérique permet d'avoir un aperçu de leurs performances / This PhD thesis focuses on the estimation of some risk measures for a real random variable Y with a covariate vector X. For that purpose, we will consider that the random vector (X,Y) is elliptically distributed. In a first time, we will deal with the quantiles of Y given X=x. We thus firstly investigate a quantile regression model, widespread in the litterature, for which we get theoretical results that we discuss. Indeed, such a model has some limitations, especially when the quantile level is said extreme. Therefore, we propose another more adapted approach. Asymptotic results are given, illustrated by a simulation study and a real data example.In a second chapter, we focus on another risk measure called expectile. The structure of the chapter is essentially the same as that of the previous one. Indeed, we first use a regression model that is not adapted to extreme expectiles, for which a methodological and statistical approach is proposed. Furthermore, highlighting the link between extreme quantiles and expectiles, we realize that other extreme risk measures are closely related to extreme quantiles. We will focus on two families called Lp-quantiles and Haezendonck-Goovaerts risk measures, for which we propose extreme estimators. A simulation study is also provided. Finally, the last chapter is devoted to the case where the size of the covariate vector X is tall. By noticing that our previous estimators perform poorly in this case, we rely on some high dimensional estimation methods to propose other estimators. A simulation study gives a visual overview of their performances Distributions elliptiques Quantiles extrêmes Expectiles Régression quantile Régression expectile Théorie des valeurs extrêmes Elliptical distributions Extreme quantiles Expectiles Quantile regression Expectile regression Extreme value theory 510

Search results