• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 101
  • 11
  • 9
  • 9
  • 5
  • 5
  • 3
  • 2
  • 2
  • 2
  • 2
  • 1
  • 1
  • 1
  • 1
  • Tagged with
  • 174
  • 174
  • 66
  • 53
  • 53
  • 38
  • 27
  • 26
  • 22
  • 21
  • 19
  • 18
  • 17
  • 16
  • 16
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
51

Modelos de regressão quantílica / Quantile Regression Models

Bruno Ramos dos Santos 02 March 2012 (has links)
Este trabalho trata de modelos de regressão quantílica. Foi feita uma introdução a essa classe de modelos para motivar a discussão. Em seguida, conceitos inferenciais, como estimação, intervalos de confiança, testes de hipóteses para os parâmetros são discutidos, acompanhados de alguns estudos de simulação. Para analisar a qualidade do ajuste, são apresentados o coeficiente de determinação e um teste de falta de ajuste para modelos de regressão quantílica. Também é proposta a utilização de gráficos para análise da qualidade do ajuste considerando a distribuição Laplace Assimétrica. Uma aplicação utilizando um banco de dados com informação sobre renda no Brasil foi utilizado para exemplificar os tópicos discutidos durante o texto. / This work is about quantile regression models. An introduction was made to this class of models to motivate the discussion. Then, inferential concepts, like estimation, confidence intervals, tests of hypothesis for the parameters are discussed, followed by some simulation studies. To analyse goodness of fit, a coefficient of determination and a lack-of-fit test for quantile regression models are presented. Its also proposed the use of graphs for the goodness of fit analysis considering the Asymmetric Laplace Distribution. An application using a data base with information about income in Brazil was used to exemplify the topics discussed during the text.
52

The Power of Categorical Goodness-Of-Fit Statistics

Steele, Michael C., n/a January 2003 (has links)
The relative power of goodness-of-fit test statistics has long been debated in the literature. Chi-Square type test statistics to determine 'fit' for categorical data are still dominant in the goodness-of-fit arena. Empirical Distribution Function type goodness-of-fit test statistics are known to be relatively more powerful than Chi-Square type test statistics for restricted types of null and alternative distributions. In many practical applications researchers who use a standard Chi-Square type goodness-of-fit test statistic ignore the rank of ordinal classes. This thesis reviews literature in the goodness-of-fit field, with major emphasis on categorical goodness-of-fit tests. The continued use of an asymptotic distribution to approximate the exact distribution of categorical goodness-of-fit test statistics is discouraged. It is unlikely that an asymptotic distribution will produce a more accurate estimation of the exact distribution of a goodness-of-fit test statistic than a Monte Carlo approximation with a large number of simulations. Due to their relatively higher powers for restricted types of null and alternative distributions, several authors recommend the use of Empirical Distribution Function test statistics over nominal goodness-of-fit test statistics such as Pearson's Chi-Square. In-depth power studies confirm the views of other authors that categorical Empirical Distribution Function type test statistics do not have higher power for some common null and alternative distributions. Because of this, it is not sensible to make a conclusive recommendation to always use an Empirical Distribution Function type test statistic instead of a nominal goodness-of-fit test statistic. Traditionally the recommendation to determine 'fit' for multivariate categorical data is to treat categories as nominal, an approach which precludes any gain in power which may accrue from a ranking, should one or more variables be ordinal. The presence of multiple criteria through multivariate data may result in partially ordered categories, some of which have equal ranking. This thesis proposes a modification to the currently available Kolmogorov-Smirnov test statistics for ordinal and nominal categorical data to account for situations of partially ordered categories. The new test statistic, called the Combined Kolmogorov-Smirnov, is relatively more powerful than Pearson's Chi-Square and the nominal Kolmogorov-Smirnov test statistic for some null and alternative distributions. A recommendation is made to use the new test statistic with higher power in situations where some benefit can be achieved by incorporating an Empirical Distribution Function approach, but the data lack a complete natural ordering of categories. The new and established categorical goodness-of-fit test statistics are demonstrated in the analysis of categorical data with brief applications as diverse as familiarity of defence programs, the number of recruits produced by the Merlin bird, a demographic problem, and DNA profiling of genotypes. The results from these applications confirm the recommendations associated with specific goodness-of-fit test statistics throughout this thesis.
53

Estimation and testing the effect of covariates in accelerated life time models under censoring

Liero, Hannelore January 2010 (has links)
The accelerated lifetime model is considered. To test the influence of the covariate we transform the model in a regression model. Since censoring is allowed this approach leads to a goodness-of-fit problem for regression functions under censoring. So nonparametric estimation of regression functions under censoring is investigated, a limit theorem for a L2-distance is stated and a test procedure is formulated. Finally a Monte Carlo procedure is proposed.
54

On the Measurement of Model Fit for Sparse Categorical Data

Kraus, Katrin January 2012 (has links)
This thesis consists of four papers that deal with several aspects of the measurement of model fit for categorical data. In all papers, special attention is paid to situations with sparse data. The first paper concerns the computational burden of calculating Pearson's goodness-of-fit statistic for situations where many response patterns have observed frequencies that equal zero. A simple solution is presented that allows for the computation of the total value of Pearson's goodness-of-fit statistic when the expected frequencies of response patterns with observed frequencies of zero are unknown. In the second paper, a new fit statistic is presented that is a modification of Pearson's statistic but that is not adversely affected by response patterns with very small expected frequencies. It is shown that the new statistic is asymptotically equivalent to Pearson's goodness-of-fit statistic and hence, asymptotically chi-square distributed. In the third paper, comprehensive simulation studies are conducted that compare seven asymptotically equivalent fit statistics, including the new statistic. Situations that are considered concern both multinomial sampling and factor analysis. Tests for the goodness-of-fit are conducted by means of the asymptotic and the bootstrap approach both under the null hypothesis and when there is a certain degree of misfit in the data. Results indicate that recommendations on the use of a fit statistic can be dependent on the investigated situation and on the purpose of the model test. Power varies substantially between the fit statistics and the cause of the misfit of the model. Findings indicate further that the new statistic proposed in this thesis shows rather stable results and compared to the other fit statistics, no disadvantageous characteristics of the fit statistic are found. Finally, in the fourth paper, the potential necessity of determining the goodness-of-fit by two sided model testing is adverted. A simulation study is conducted that investigates differences between the one sided and the two sided approach of model testing. Situations are identified for which two sided model testing has advantages over the one sided approach.
55

A Novel Approach to the Analysis of Nonlinear Time Series with Applications to Financial Data

Lee, Jun Bum 2012 May 1900 (has links)
The spectral analysis method is an important tool in time series analysis and the spectral density plays a crucial role on the spectral analysis. However, one of limitations of the spectral density is that the spectral density reflects only the covariance structure among several dependence measures in the time series data. To overcome this restriction, we define two spectral densities, the quantile spectral density and the association spectral density. The quantile spectral density can model the pairwise dependence structure and provide identification of nonlinear time series and the association spectral density allows detecting periodicities on different parts of the domain of the time series. We propose the estimators for the quantile spectral density and the association spectral density and derive their sampling properties including asymptotic normality. Furthermore, we use the quantile spectral density to develop a goodness-of-fit tests for time series and explain how this test can be used for comparing the sequential dependence structure of two time series. The asymptotic sampling properties of the test statistic are derived under the null and alternative hypothesis, and a bootstrap procedure is suggested to obtain finite sample approximation. The method is illustrated with simulations and some real data examples. Besides the exploration of the new spectral densities, we consider general quadratic forms of alpha-mixing time series and derive asymptotic normality of these forms under the relatively weak assumptions.
56

Frequentist-Bayes Goodness-of-fit Tests

Wang, Qi 2011 August 1900 (has links)
In this dissertation, the classical problems of testing goodness-of-fit of uniformity and parametric families are reconsidered. A new omnibus test for these problems is proposed and investigated. The new test statistics are a combination of Bayesian and score test ideas. More precisely, singletons that contain only one more parameter than the null describing departures from the null model are introduced. A Laplace approximation to the posterior probability of the null hypothesis is used, leading to test statistics that are weighted sums of exponentiated squared Fourier coefficients. The weights depend on prior probabilities and the Fourier coefficients are estimated based on score tests. Exponentiation of Fourier components leads to tests that can be exceptionally powerful against high frequency alternatives. Comprehensive simulations show that the new tests have good power against high frequency alternatives and perform comparably to some other well-known omnibus tests at low frequency alternatives. Asymptotic distributions of the proposed test are derived under null and alternative hypotheses. An application of the proposed test to an interesting real problem is also presented.
57

Analysing stochastic call demand with time varying parameters

Li, Song 25 November 2005
In spite of increasingly sophisticated workforce management tools, a significant gap remains between the goal of effective staffing and the present difficulty predicting the stochastic demand of inbound calls. We have investigated the hypothesized nonhomogeneous Poisson process model of modem pool callers of the University community. In our case, we tested if the arrivals could be approximated by a piecewise constant rate over short intervals. For each of 1 and 10-minute intervals, based on the close relationship between the Poisson process and the exponential distribution, the test results did not show any sign of homogeneous Poisson process. We have examined the hypothesis of a nonhomogeneous Poisson process by a transformed statistic. Quantitative and graphical goodness-of-fit tests have confirmed nonhomogeneous Poisson process. <p>Further analysis on the intensity function revealed that linear rate intensity was woefully inadequate in predicting time varying arrivals. For sinusoidal rate model, difficulty arose in setting the period parameter. Spline models, as an alternative to parametric modelling, had more control of balance between data fitting and smoothness, which was appealing to our analysis on call arrival process.
58

A consolidated study of goodness-of-fit tests

Paul, Ajay Kumar 03 June 2011 (has links)
An important problem in statistical inference is to check the adequacy of models upon which inferences are based. Some valuable tools are available for examining a model's suitability of which the most widely used is the goodness-of-fit test. The pioneering work in this area is by Karl Pearson (1900). Since then, a considerable amount of work has been done so far and investigation is still going on in this field due to its importance in the hypothesis testing problem.This thesis contains an expository discussion of the goodness-of-fit tests, intended for the users of the statistical theory. An attempt is made here to give a complete coverage of the historical development, present status and other current problems related to this topic. Numerical examples are provided to best explain the test procedures. The contents, taken as a whole, constitute a unified presentation of some of the most important aspects of goodness-of-fit tests.Ball State UniversityMuncie, IN 57406
59

Analysing stochastic call demand with time varying parameters

Li, Song 25 November 2005 (has links)
In spite of increasingly sophisticated workforce management tools, a significant gap remains between the goal of effective staffing and the present difficulty predicting the stochastic demand of inbound calls. We have investigated the hypothesized nonhomogeneous Poisson process model of modem pool callers of the University community. In our case, we tested if the arrivals could be approximated by a piecewise constant rate over short intervals. For each of 1 and 10-minute intervals, based on the close relationship between the Poisson process and the exponential distribution, the test results did not show any sign of homogeneous Poisson process. We have examined the hypothesis of a nonhomogeneous Poisson process by a transformed statistic. Quantitative and graphical goodness-of-fit tests have confirmed nonhomogeneous Poisson process. <p>Further analysis on the intensity function revealed that linear rate intensity was woefully inadequate in predicting time varying arrivals. For sinusoidal rate model, difficulty arose in setting the period parameter. Spline models, as an alternative to parametric modelling, had more control of balance between data fitting and smoothness, which was appealing to our analysis on call arrival process.
60

Generalized score tests for missing covariate data

Jin, Lei 15 May 2009 (has links)
In this dissertation, the generalized score tests based on weighted estimating equations are proposed for missing covariate data. Their properties, including the effects of nuisance functions on the forms of the test statistics and efficiency of the tests, are investigated. Different versions of the test statistic are properly defined for various parametric and semiparametric settings. Their asymptotic distributions are also derived. It is shown that when models for the nuisance functions are correct, appropriate test statistics can be obtained via plugging the estimates of the nuisance functions into the appropriate test statistic for the case that the nuisance functions are known. Furthermore, the optimal test is obtained using the relative efficiency measure. As an application of the proposed tests, a formal model validation procedure is developed for generalized linear models in the presence of missing covariates. The asymptotic distribution of the data driven methods is provided. A simulation study in both linear and logistic regressions illustrates the applicability and the finite sample performance of the methodology. Our methods are also employed to analyze a coronary artery disease diagnostic dataset.

Page generated in 0.1047 seconds