Global ETD Search

1	Contributions to the analysis of dispersed count data / Contribuições à análise de dados de contagem Ribeiro Junior, Eduardo Elias 18 February 2019 (has links) In many agricultural and biological contexts, the response variable is a nonnegative integer value which we wish to explain or analyze in terms of a set of covariates. Unlike the Gaussian linear model, the response variable is discrete with a distribution that places probability mass at natural numbers only. The Poisson regression is the standard model for count data. However, assumptions of this model forces the equality between mean and variance, which may be implausible in many applications. Motivated by experimental data sets, this work intended to develop more realistic methods for the analysis of count data. We proposed a novel parametrization of the COM-Poisson distribution and explored the regression models based on it. We extended the model to allow the dispersion, as well as the mean, depending on covariates. A set of count statistical models, namely COM-Poisson, Gamma-count, discrete Weibull, generalized Poisson, double Poisson and Poisson-Tweedie, was reviewed and compared, considering the dispersion, zero-inflation, and heavy tail indexes, together with the results of data analyzes. The computational routines developed in this dissertation were organized in two R packages available on GitHub. / Em diversos estudos agrícolas e biológicos, a variável resposta é um número inteiro não negativo que desejamos explicar ou analisar em termos de um conjunto de covariáveis. Diferentemente do modelo linear Gaussiano, a variável resposta é discreta com distribuição de probabilidade definida apenas em valores do conjunto dos naturais. O modelo Poisson é o modelo padrão para dados em forma de contagens. No entanto, as suposições desse modelo forçam que a média seja igual a variância, o que pode ser implausível em muitas aplicações. Motivado por conjuntos de dados experimentais, este trabalho teve como objetivo desenvolver métodos mais realistas para a análise de contagens. Foi proposta uma nova reparametrização da distribuição COM-Poisson e explorados modelos de regressão baseados nessa distribuição. Uma extensão desse modelo para permitir que a dispersão, assim como a média, dependa de covariáveis, foi proposta. Um conjunto de modelos para contagens, nomeadamente COM-Poisson, Gamma-count, Weibull discreto, Poisson generalizado, duplo Poisson e Poisson-Tweedie, foi revisado e comparado, considerando os índices de dispersão, inflação de zero e cauda pesada, juntamente com os resultados de análises de dados. As rotinas computacionais desenvolvidas nesta dissertação foram organizadas em dois pacotes R disponíveis no GitHub. Count data Dados de contagens Discrete probability models Dispersão variável Inferência baseada em verossimilhança Likelihood-based inference Modelos probabilísticos discretos Overdispersion Subdispersão Superdipersão Underdispersion Varying dispersion
2	Profitabilita životních smluv a složené GLM / Profitability of life policies and compound GLM Kostka, Ján January 2022 (has links) Life insurance policies are not equally profitable is sense of expected value. In practice, profitability is an output of complex cash flow models, which need utilizing special systems and the run time of such calculation can be significant if number of policies is high. Therefore we consider variables, which change most frequently, stimulate the profitability model with several values of these variables and then we search for a regression model to explain the changes. We apply Gamma regression on the data. But what if there exist some policies which are negative? Then we determine these policies with logistic regression applied on data censored to the binary form. Loss of these policies is modelled using symmetrical Gamma model. These three models, when considered together, can be viewed as a single model, which is a generalization of the well known zero inflated count model. The most interesting part of inference in such model is diagnostics. We show that the basic types of residuals - Pearson, deviance and quantile - can be defined. We also build an ordinary linear model and we compare utility of these two approaches. While building models, we meet various statistical issues like dimension reduction of yield curve or dispersion proportional to sum insured. 1

Search results

Contributions to the analysis of dispersed count data / Contribuições à análise de dados de contagem

Profitabilita životních smluv a složené GLM / Profitability of life policies and compound GLM