Global ETD Search

41	Estimação dos parâmetros da distribuição beta bivariada: aplicações em severidade de doenças em plantas / Parameters estimation of beta bivariate distribution: applications in disease severity in plants Barros, Otávio Akira de 04 December 2015 (has links) A distribuição beta é apropriada para analisar dados de variáveis medidas no intervalo (0, 1), como taxas e proporções, como por exemplo a proporção de severidade de doenças em plantas. Portanto, dados que são pares observações de taxas e proporções, naturalmente pensa-se numa distribuição beta bivariada com suporte (0, 1)2. O objetivo deste trabalho constitui-se em encontrar a melhor distribuição beta bivariada na literatura para este caso e, além disso, tentar encontrar estimadores para seus parâmetros, a fim de verificar se esta distribuição escolhida se ajusta bem aos dados. Foi criada uma metodologia para a estimação dos parâmetros, utilizando aquela distribuição que consideramos a mais adequada. Posteriormente foram feitas simulações para avaliar a qualidade desses estimadores e, por fim, foram utilizados três bancos de dados com a finalidade de exemplificar esta metodologia. / Beta distribution is suitable for analyzing variable data measured in the range (0, 1), as rates and proportions, such as the proportion of disease severity in plants. Therefore, data that are paired observations rates and proportions naturally thinks in a bivariate distribution beta supported (0, 1)2. The objective of this work is on finding the best beta bivariate distribution in the literature for this case and, furthermore, try to find estimators for its parameters in order to verify that this chosen distribution fits the data well. A methodology was created for the estimation of parameters using that distribution we consider the most appropriate. Later simulations were performed to evaluate the quality of these estimators and, finally, we use three databases in order to illustrate this methodology. Beta bivariate distribution Disease severity in plants Distribuição beta bivariada Estimação Estimation Severidade de doença em plantas
42	A Sensitivity Analysis of a Nonignorable Nonresponse Model Via EM Algorithm and Bootstrap Zong, Yujie 15 April 2011 (has links) The Slovenian Public Opinion survey (SPOS), which carried out in 1990, was used by the government of Slovenia as a benchmark to prepare for an upcoming plebiscite, which asked the respondents whether they support independence from Yugoslavia. However, the sample size was large and it is quite likely that the respondents and nonrespondents had divergent viewpoints. We first develop an ignorable nonresponse model which is an extension of a bivariate binomial model. In order to accommodate the nonrespondents, we then develop a nonignorable nonresponse model which is an extension of the ignorable model. Our methodology uses an EM algorithm to fit both the ignorable and nonignorable nonresponse models, and estimation is carried out using the bootstrap mechanism. We also perform sensitivity analysis to study different degrees of departures of the nonignorable nonresponse model from the ignorable nonresponse model. We found that the nonignorable nonresponse model is mildly sensitive to departures from the ignorable nonresponse model. In fact, our finding based on the nonignorable model is better than an earlier conclusion about another nonignorable nonresponse model fitted to these data. Bivariate binomial distribution Bootstrap EM algorithm Missing not at random Multinomial model 2X2 categorical tables
43	Análise de contagens multivariadas. / Multivariate count analysis. Ho, Linda Lee 15 September 1995 (has links) Este trabalho apresenta uma análise estatística de contagens multivariadas proveniente de várias populações através de modelos de regressão. Foram considerados casos onde os vetores respostas obedeçam às distribuições Poisson multivariada e Poisson log-normal multivariada. Esta distribuição admite correlação de ambos sinais entre componentes do vetor resposta, enquanto que as distribuições mais usuais para dados de contagens (como a Poisson multivariada) admitem apenas correlação positiva entre as componentes do vetor resposta. São discutidos métodos de estimação e testes de hipóteses sobre os parâmetros do modelo para o caso bivariado. Estes modelos de regressão foram aplicados a um conjunto de dados referentes a contagens de dois tipos de defeitos em 100 gramas de fibras têxteis de quatro máquinas craqueadeiras, sendo duas de um fabricante e as outras de um segundo fabricante. Os resultados obtidos nos diferentes modelos de regressão foram comparados. Para estudar o comportamento das estimativas dos parâmetros de uma distribuição Poisson Log-Normal, amostras foram simuladas segundo esta distribuição. / Regression models are presented to analyse multivariate counts from many populations. Due to the random vector characteristic, we consider two classes of probability models: Multivariate Poisson distribution and Multivariate Poisson Log-Normal distribution. The last distribution admits negative and positive correlations between two components of a random vector under study, while other distributions (as Multivariate Poisson) admit only positive correlation. Estimation methods and test of hypothese on the parameters in bivariate case are discussed. The proposed techniques are illustrated by numerical examples, considering counts of two types of defects in 100g of textile fibers produced by four machines, two from one manufacturer and the other two from another one. The results from different regression models are compared. The empirical distribution of Poisson Log-Normal parameter estimations are studied by simulated samples. Bivariate Poisson distribution Engenharia de produção Modelo de regressão Poisson bivariada Regression model for counts
44	Modelos de regressão bivariada: uma aplicação em equações mincerianas de rendimento / Bivariate regression models: an application to mincerian earnings equations Cunha, Danúbia Rodrigues da 08 February 2018 (has links) Submitted by Franciele Moreira (francielemoreyra@gmail.com) on 2018-03-28T12:09:11Z No. of bitstreams: 2 Dissertação - Danúbia Rodrigues da Cunha - 2018.pdf: 7944363 bytes, checksum: f0229888d666e1d8367f8c31ee27a238 (MD5) license_rdf: 0 bytes, checksum: d41d8cd98f00b204e9800998ecf8427e (MD5) / Approved for entry into archive by Luciana Ferreira (lucgeral@gmail.com) on 2018-03-29T11:30:21Z (GMT) No. of bitstreams: 2 Dissertação - Danúbia Rodrigues da Cunha - 2018.pdf: 7944363 bytes, checksum: f0229888d666e1d8367f8c31ee27a238 (MD5) license_rdf: 0 bytes, checksum: d41d8cd98f00b204e9800998ecf8427e (MD5) / Made available in DSpace on 2018-03-29T11:30:21Z (GMT). No. of bitstreams: 2 Dissertação - Danúbia Rodrigues da Cunha - 2018.pdf: 7944363 bytes, checksum: f0229888d666e1d8367f8c31ee27a238 (MD5) license_rdf: 0 bytes, checksum: d41d8cd98f00b204e9800998ecf8427e (MD5) Previous issue date: 2018-02-08 / Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - CAPES / In this work, bivariate regression models based on the bivariate normal, t and Birnbaum-Saunders distributions are used to analyze labor market data. In special, the objective is to model the dependent variable of the Mincerian earnings equation separately, namely, the variable hourly earnings (which is obtained by dividing gross monthly earnings by hours worked) is modeled in two parts, earnings and hours worked. The bivariate regression models are used to model these two parts in order to try to capture the correlation between them and the different effects, that is, remuneration or premium for labor effort, and the labor supply or the time that the worker offers to the market. In order to accomplish this, data from the Brazilian National Household Sample Survey (PNAD) for the years 2013, 2014 and 2015 are used. The parameters of the models are estimated using the maximum likelihood method. The results show that the bivariate regression model based on the bivariate t distribution has the best fit for the data, and that the presence of correlation between earnings and hours worked indicates that the bivariate model is more adequate than the univariate model. / Nessa dissertação, modelos de regressão bivariada baseados nas distribuições bivariadas normal, t e Birnbaum-Saunders são usados para analisar dados do mercado de trabalho. Em especial, o objetivo é modelar a variável dependente da equação de rendimento minceriana de forma separada, ou seja, o rendimento-hora é modelado em duas partes, rendimento e horas trabalhadas. Os modelos de regressão bivariada são utilizados para modelar essas duas partes de forma a tentar captar a correlação entre elas e os distintos efeitos, ou seja, remuneração ou prêmio pelo esforço desprendido pela mão de obra, e oferta de trabalho ou o tempo que o trabalhador disponibiliza ao mercado. Para tal, usa-se dados da Pesquisa Nacional por Amostra de Domicílios (PNAD) para os anos de 2013, 2014 e 2015. Os parâmetros dos modelos são estimados usando o método da máxima verossimilhança. Os resultados mostram que o modelo de regressão bivariada baseada na distribuição bivariada t tem o melhor ajuste para os dados, e que a presença de correlação entre rendimento e horas trabalhadas indica que o modelo bivariado é mais adequado que o univariado. Equação de salário minceriana Modelo de regressão bivariada Mincerian earnings equation Bivariate regression model CIENCIAS SOCIAIS APLICADAS::ECONOMIA
45	D- and Ds-optimal Designs for Estimation of Parameters in Bivariate Copula Models Liu, Hua-Kun 27 July 2007 (has links) For current status data, the failure time of interest may not be observed exactly. The type of this data consists only of a monitoring time and knowledge of whether the failure time occurred before or after the monitoring time. In order to be able to obtain more information from this data, so the monitoring time is very important. In this work, the optimal designs for determining the monitoring times such that maximum information may be obtained in bivariate copula model (Clayton) are investigated. Here, the D- optimal criterion is used to decide the best monitoring time Ci (i = 1; ¢ ¢ ¢ ; n), then use these monitoring times Ci to estimate the unknown parameters simultaneously by maximizing the corresponding likelihood function. Ds-optimal designs for estimation of association parameter in the copula model are also discussed. Simulation studies are presented to compare the performance of using monitoring time C¤D and C¤Ds to do the estimation. Ds-optimal design monitoring time Bivariate current status data Copula model D-optimal design
46	Interval Censoring and Longitudinal Survey Data Pantoja Galicia, Norberto January 2007 (has links) Being able to explore a relationship between two life events is of great interest to scientists from different disciplines. Some issues of particular concern are, for example, the connection between smoking cessation and pregnancy (Thompson and Pantoja-Galicia 2003), the interrelation between entry into marriage for individuals in a consensual union and first pregnancy (Blossfeld and Mills 2003), and the association between job loss and divorce (Charles and Stephens 2004, Huang 2003 and Yeung and Hofferth 1998). Establishing causation in observational studies is seldom possible. Nevertheless, if one of two events tends to precede the other closely in time, a causal interpretation of an association between these events can be more plausible. The role of longitudinal surveys is crucial, then, since they allow sequences of events for individuals to be observed. Thompson and Pantoja-Galicia (2003) discuss in this context several notions of temporal association and ordering, and propose an approach to investigate a possible relationship between two lifetime events. In longitudinal surveys individuals might be asked questions of particular interest about two specific lifetime events. Therefore the joint distribution might be advantageous for answering questions of particular importance. In follow-up studies, however, it is possible that interval censored data may arise due to several reasons. For example, actual dates of events might not have been recorded, or are missing, for a subset of (or all) the sampled population, and can be established only to within specified intervals. Along with the notions of temporal association and ordering, Thompson and Pantoja-Galicia (2003) also discuss the concept of one type of event "triggering" another. In addition they outline the construction of tests for these temporal relationships. The aim of this thesis is to implement some of these notions using interval censored data from longitudinal complex surveys. Therefore, we present some proposed tools that may be used for this purpose. This dissertation is divided in five chapters, the first chapter presents a notion of a temporal relationship along with a formal nonparametric test. The mechanisms of right censoring, interval censoring and left truncation are also overviewed. Issues on complex surveys designs are discussed at the end of this chapter. For the remaining chapters of the thesis, we note that the corresponding formal nonparametric test requires estimation of a joint density, therefore in the second chapter a nonparametric approach for bivariate density estimation with interval censored survey data is provided. The third chapter is devoted to model shorter term triggering using complex survey bivariate data. The semiparametric models in Chapter 3 consider both noncensoring and interval censoring situations. The fourth chapter presents some applications using data from the National Population Health Survey and the Survey of Labour and Income Dynamics from Statistics Canada. An overall discussion is included in the fifth chapter and topics for future research are also addressed in this last chapter. Interval censoring Complex sampling designs Bivariate density estimation Local likelihood Complex surveys Statistics
47	Interval Censoring and Longitudinal Survey Data Pantoja Galicia, Norberto January 2007 (has links) Being able to explore a relationship between two life events is of great interest to scientists from different disciplines. Some issues of particular concern are, for example, the connection between smoking cessation and pregnancy (Thompson and Pantoja-Galicia 2003), the interrelation between entry into marriage for individuals in a consensual union and first pregnancy (Blossfeld and Mills 2003), and the association between job loss and divorce (Charles and Stephens 2004, Huang 2003 and Yeung and Hofferth 1998). Establishing causation in observational studies is seldom possible. Nevertheless, if one of two events tends to precede the other closely in time, a causal interpretation of an association between these events can be more plausible. The role of longitudinal surveys is crucial, then, since they allow sequences of events for individuals to be observed. Thompson and Pantoja-Galicia (2003) discuss in this context several notions of temporal association and ordering, and propose an approach to investigate a possible relationship between two lifetime events. In longitudinal surveys individuals might be asked questions of particular interest about two specific lifetime events. Therefore the joint distribution might be advantageous for answering questions of particular importance. In follow-up studies, however, it is possible that interval censored data may arise due to several reasons. For example, actual dates of events might not have been recorded, or are missing, for a subset of (or all) the sampled population, and can be established only to within specified intervals. Along with the notions of temporal association and ordering, Thompson and Pantoja-Galicia (2003) also discuss the concept of one type of event "triggering" another. In addition they outline the construction of tests for these temporal relationships. The aim of this thesis is to implement some of these notions using interval censored data from longitudinal complex surveys. Therefore, we present some proposed tools that may be used for this purpose. This dissertation is divided in five chapters, the first chapter presents a notion of a temporal relationship along with a formal nonparametric test. The mechanisms of right censoring, interval censoring and left truncation are also overviewed. Issues on complex surveys designs are discussed at the end of this chapter. For the remaining chapters of the thesis, we note that the corresponding formal nonparametric test requires estimation of a joint density, therefore in the second chapter a nonparametric approach for bivariate density estimation with interval censored survey data is provided. The third chapter is devoted to model shorter term triggering using complex survey bivariate data. The semiparametric models in Chapter 3 consider both noncensoring and interval censoring situations. The fourth chapter presents some applications using data from the National Population Health Survey and the Survey of Labour and Income Dynamics from Statistics Canada. An overall discussion is included in the fifth chapter and topics for future research are also addressed in this last chapter. Interval censoring Complex sampling designs Bivariate density estimation Local likelihood Complex surveys Statistics
48	Effect Of Estimation In Goodness-of-fit Tests Eren, Emrah 01 September 2009 (has links) (PDF) In statistical analysis, distributional assumptions are needed to apply parametric procedures. Assumptions about underlying distribution should be true for accurate statistical inferences. Goodness-of-fit tests are used for checking the validity of the distributional assumptions. To apply some of the goodness-of-fit tests, the unknown population parameters are estimated. The null distributions of test statistics become complicated or depend on the unknown parameters if population parameters are replaced by their estimators. This will restrict the use of the test. Goodness-of-fit statistics which are invariant to parameters can be used if the distribution under null hypothesis is a location-scale distribution. For location and scale invariant goodness-of-fit tests, there is no need to estimate the unknown population parameters. However, approximations are used in some of those tests. Different types of estimation and approximation techniques are used in this study to compute goodness-of-fit statistics for complete and censored samples from univariate distributions as well as complete samples from bivariate normal distribution. Simulated power properties of the goodness-of-fit tests against a broad range of skew and symmetric alternative distributions are examined to identify the estimation effects in goodness-of-fit tests. The main aim of this thesis is to modify goodness-of-fit tests by using different estimators or approximation techniques, and finally see the effect of estimation on the power of these tests. HA Statistics 36161
49	Generalized rank tests for univariate and bivariate interval-censored failure time data Sun, De-Yu 20 June 2003 (has links) In Part 1 of this paper, we adapt Turnbull¡¦s algorithm to estimate the distribution function of univariate interval-censored and truncated failure time data. We also propose four non-parametric tests to test whether two groups of the data come from the same distribution. The powers of proposed test statistics are compared by simulation under different distributions. The proposed tests are then used to analyze an AIDS study. In Part 2, for bivariate interval-censored data, we propose some models of how to generate the data and several methods to measure the correlation between the two variates. We also propose several nonparametric tests to determine whether the two variates are mutually independent or whether they have the same distribution. We demonstrate the performance of these tests by simulation and give an application to AIDS study¡]ACTG 181¡^. Turnbull's algorithm non-parametric tests simulation
50	A Latent Mixture Approach to Modeling Zero-Inflated Bivariate Ordinal Data Kadel, Rajendra 01 January 2013 (has links) Multivariate ordinal response data, such as severity of pain, degree of disability, and satisfaction with a healthcare provider, are prevalent in many areas of research including public health, biomedical, and social science research. Ignoring the multivariate features of the response variables, that is, by not taking the correlation between the errors across models into account, may lead to substantially biased estimates and inference. In addition, such multivariate ordinal outcomes frequently exhibit a high percentage of zeros (zero inflation) at the lower end of the ordinal scales, as compared to what is expected under a multivariate ordinal distribution. Thus, zero inflation coupled with the multivariate structure make it difficult to analyze such data and properly interpret the results. Methods that have been developed to address the zero-inflated data are limited to univariate-logit or univariate-probit model, and extension to bivariate (or multivariate) probit models has been very limited to date. In this research, a latent variable approach was used to develop a Mixture Bivariate Zero-Inflated Ordered Probit (MBZIOP) model. A Bayesian MCMC technique was used for parameter estimation. A simulation study was then conducted to compare the performances of the estimators of the proposed model with two existing models. The simulation study suggested that for data with at least a moderate proportion of zeros in bivariate responses, the proposed model performed better than the comparison models both in terms of lower bias and greater accuracy (RMSE). Finally, the proposed method was illustrated with a publicly-available drug-abuse dataset to identify highly probable predictors of: (i) being a user/nonuser of marijuana, cocaine, or both; and (ii), conditional on user status, the level of consumption of these drugs. The results from the analysis suggested that older individuals, smokers, and people with a prior criminal background have a higher risk of being a marijuana only user, or being the user of both drugs. However, cocaine only users were predicted on the basis of being younger and having been engaged in the criminal-justice system. Given that an individual is a user of marijuana only, or user of both drugs, age appears to have an inverse effect on the latent level of consumption of marijuana as well as cocaine. Similarly, given that a respondent is a user of cocaine only, all covariates--age, involvement in criminal activities, and being of black race--are strong predictors of the level of cocaine consumption. The finding of older age being associated with higher drug consumption may represent a survival bias whereby previous younger users with high consumption may have been at elevated risk of premature mortality. Finally, the analysis indicated that blacks are likely to use less marijuana, but have a higher latent level of cocaine given that they are user of both drugs. Bayesian approach bivariate ordered probit MCMC WinBUGS zero-inflation Biostatistics Statistics and Probability

Search results