• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 227
  • 71
  • 28
  • 25
  • 21
  • 14
  • 11
  • 6
  • 6
  • 6
  • 5
  • 3
  • 3
  • 2
  • 2
  • Tagged with
  • 511
  • 126
  • 95
  • 88
  • 73
  • 72
  • 70
  • 48
  • 48
  • 43
  • 39
  • 38
  • 36
  • 35
  • 34
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
261

Bayesian analysis of regression models for proportional data in the presence of zeros and ones = Análise bayesiana de modelos de regressão para dados de proporções na presença de zeros e uns / Análise bayesiana de modelos de regressão para dados de proporções na presença de zeros e uns

Galvis Soto, Diana Milena, 1978- 26 August 2018 (has links)
Orientador: Víctor Hugo Lachos Dávila / Tese (doutorado) - Universidade Estadual de Campinas, Instituto de Matemática Estatística e Computação Científica / Made available in DSpace on 2018-08-26T02:34:17Z (GMT). No. of bitstreams: 1 GalvisSoto_DianaMilena_D.pdf: 1208980 bytes, checksum: edbc193912a2a800da4936526ed79fa3 (MD5) Previous issue date: 2014 / Resumo: Dados no intervalo (0,1) geralmente representam proporções, taxas ou índices. Porém, é possível observar situações práticas onde as proporções sejam zero e/ou um, representando ausência ou presença total da característica de interesse. Nesses casos, os modelos que analisam o efeito de covariáveis, tais como a regressão beta, beta retangular e simplex não são convenientes. Com o intuito de abordar este tipo de situações, considera-se como alternativa aumentar os valores zero e/ou um ao suporte das distribuições previamente mencionadas. Nesta tese, são propostos modelos de regressão de efeitos mistos para dados de proporções aumentados de zeros e uns, os quais permitem analisar o efeito de covariáveis sobre a probabilidade de observar ausência ou presença total da característica de interesse, assim como avaliar modelos com respostas correlacionadas. A estimação dos parâmetros de interesse pode ser via máxima verossimilhança ou métodos Monte Carlo via Cadeias de Markov (MCMC). Nesta tese, será adotado o enfoque Bayesiano, o qual apresenta algumas vantagens em relação à inferência clássica, pois não depende da teoria assintótica e os códigos são de fácil implementação, através de softwares como openBUGS e winBUGS. Baseados na distribuição marginal, é possível calcular critérios de seleção de modelos e medidas Bayesianas de divergência q, utilizadas para detectar observações discrepantes / Abstract: Continuous data in the unit interval (0,1) represent, generally, proportions, rates or indices. However, zeros and/or ones values can be observed, representing absence or total presence of a carachteristic of interest. In that case, regression models that analyze the effect of covariates such as beta, beta rectangular or simplex are not appropiate. In order to deal with this type of situations, an alternative is to add the zero and/or one values to the support of these models. In this thesis and based on these models, we propose the mixed regression models for proportional data augmented by zero and one, which allow analyze the effect of covariates into the probabilities of observing absence or total presence of the interest characteristic, besides of being possivel to deal with correlated responses. Estimation of parameters can follow via maximum likelihood or through MCMC algorithms. We follow the Bayesian approach, which presents some advantages when it is compared with classical inference because it allows to estimate the parameters even in small size sample. In addition, in this approach, the implementation is straightforward and can be done using software as openBUGS or winBUGS. Based on the marginal likelihood it is possible to calculate selection model criteria as well as q-divergence measures used to detect outlier observations / Doutorado / Estatistica / Doutora em Estatística
262

Survival modelling and analysis of HIV/AIDS patients on HIV care and antiretroviral treatment to determine longevity prognostic factors

Maposa, Innocent January 2016 (has links)
Philosophiae Doctor - PhD / The HIV/AIDS pandemic has been a torment to the African developmental agenda, especially the Southern African Development Countries (SADC), for the past two decades. The disease and condition tends to affect the productive age groups. Children have also not been spared from the severe effects associated with the disease. The advent of antiretroviral treatment (ART) has brought a great relief to governments and patients in these regions. More people living with HIV/AIDS have experienced a boost in their survival prospects and hence their contribution to national developmental projects. Survival analysis methods are usually used in biostatistics, epidemiological modelling and clinical research to model time to event data. The most interesting aspect of this analysis comes when survival models are used to determine risk factors for the survival of patients undergoing some treatment or living with a certain disease condition. The purpose of this thesis was to determine prognostic risk factors for patients' survival whilst on ART. The study sought to highlight the risk factors that impact the survival time negatively at different survival time points. The study utilized a sample of paediatric and adult datasets from Namibia and Zimbabwe respectively. The paediatric dataset from Katutura hospital (Namibia) comprised of the adolescents and children on ART, whilst the adult dataset from Bulawayo hospital (Zimbabwe) comprised of those patients on ART in the 15 years and above age categories. All datasets used in this thesis were based on retrospective cohorts followed for some period of time. Different methods to reduce errors in parameter estimation were employed to the datasets. The proportional hazards, Bayesian proportional hazards and the censored quantile regression models were utilized in this study. The results from the proportional hazards model show that most of the variables considered were not signifcant overall. The Bayesian proportional hazards model shows us that all the considered factors had different risk profiles at the different quartiles of the survival times. This highlights that by using the proportional hazards models, we only get a fixed constant effect of the risk factors, yet in reality, the effect of risk factors differs at different survival time points. This picture was strongly highlighted by the censored quantile regression model which indicated that some variables were significant in the early periods of initiation whilst they did not significantly affect survival time at any other points in the survival time distribution. The censored quantile regression models clearly demonstrate that there are significant insights gained on the dynamics of how different prognostic risk factors affect patient survival time across the survival time distribution compared to when we use proportional hazards and Bayesian propotional hazards models. However, the advantages of using the proportional hazards framework, due to the estimation of hazard rates as well as it's application in the competing risk framework are still unassailable. The hazard rate estimation under the censored quantile regression framework is an area that is still under development and the computational aspects are yet to be incorporated into the mainstream statistical softwares. This study concludes that, with the current literature and computational support, using both model frameworks to ascertain the dynamic effects of different prognostic risk factors for survival in people living with HIV/AIDS and on ART would give the researchers more insights. These insights will then help public health policy makers to draft relevant targeted policies aimed at improving these patients' survival time on treatment.
263

An Improved Framework for Dynamic Origin-Destination (O-D) Matrix Estimation

Chi, Hongbo 09 November 2010 (has links)
This dissertation aims to improve the performance of existing assignment-based dynamic origin-destination (O-D) matrix estimation models to successfully apply Intelligent Transportation Systems (ITS) strategies for the purposes of traffic congestion relief and dynamic traffic assignment (DTA) in transportation network modeling. The methodology framework has two advantages over the existing assignment-based dynamic O-D matrix estimation models. First, it combines an initial O-D estimation model into the estimation process to provide a high confidence level of initial input for the dynamic O-D estimation model, which has the potential to improve the final estimation results and reduce the associated computation time. Second, the proposed methodology framework can automatically convert traffic volume deviation to traffic density deviation in the objective function under congested traffic conditions. Traffic density is a better indicator for traffic demand than traffic volume under congested traffic condition, thus the conversion can contribute to improving the estimation performance. The proposed method indicates a better performance than a typical assignment-based estimation model (Zhou et al., 2003) in several case studies. In the case study for I-95 in Miami-Dade County, Florida, the proposed method produces a good result in seven iterations, with a root mean square percentage error (RMSPE) of 0.010 for traffic volume and a RMSPE of 0.283 for speed. In contrast, Zhou’s model requires 50 iterations to obtain a RMSPE of 0.023 for volume and a RMSPE of 0.285 for speed. In the case study for Jacksonville, Florida, the proposed method reaches a convergent solution in 16 iterations with a RMSPE of 0.045 for volume and a RMSPE of 0.110 for speed, while Zhou’s model needs 10 iterations to obtain the best solution, with a RMSPE of 0.168 for volume and a RMSPE of 0.179 for speed. The successful application of the proposed methodology framework to real road networks demonstrates its ability to provide results both with satisfactory accuracy and within a reasonable time, thus establishing its potential usefulness to support dynamic traffic assignment modeling, ITS systems, and other strategies.
264

Finanční analýza vybrané společnosti / Financial analysis of the selected company

Krumpholc, Martin January 2012 (has links)
The aim of this thesis is to evaluate the performance of the company Philip Morris ČR. The evaluation is done for years 2006 to 2011. The first theoretical part describes methods and principles which are used in practical part. All information in this part have been obtained from literature. The financial analysis in the second part contains horizontal and vertical analysis of the balance sheet and the statement of income, the ration indicators, the balance rules, creditworthy and bankruptcy models, Du Pont analysis, analysis of the Economic Value Added, and company comparisons. In the conclusion, the overall summary is carried out and possible recommendations that could lead to improvements are mentioned.
265

Sobrevivência de mulheres com câncer de mama sob a perspectiva dos modelos de riscos competitivos / Survival of women with breast cancer in the perspective of competing risks models

Ferraz, Rosemeire de Olanda, 1973- 02 November 2015 (has links)
Orientador: Djalma de Carvalho Moreira Filho / Tese (doutorado) - Universidade Estadual de Campinas, Faculdade de Ciências Médicas / Made available in DSpace on 2018-08-26T22:55:22Z (GMT). No. of bitstreams: 1 Ferraz_RosemeiredeOlanda_D.pdf: 2711370 bytes, checksum: b4966f4c4ea3b88daffa54c0576bd307 (MD5) Previous issue date: 2015 / Resumo: O objetivo deste estudo é identificar os fatores associados ao tempo de sobrevida do câncer de mama, como idade, estadiamento e extensão do tumor, utilizando modelos de riscos proporcionais de Cox e de riscos competitivos de Fine-Gray. E também propor um modelo de regressão paramétrico para ajustar o tempo de sobrevida na presença dos riscos competitivos. É um estudo de coorte retrospectivo de base-populacional referente a 524 mulheres diagnosticadas com câncer de mama no período de 1993 a 1995, acompanhadas até 2011, residentes no município de Campinas/SP. Um ponto de corte para a variável contínua da idade foi escolhido utilizando-se modelos de Cox. Nos ajustes de modelos simples e múltiplo de Fine-Gray e de Cox, a idade não foi significativa quando o óbito por câncer de mama foi o evento de interesse. As curvas de sobrevivências estimadas por Kaplan-Meier evidenciaram diferenças expressivas nas probabilidades comparando-se os óbitos por câncer de mama e por riscos competitivos. As curvas de sobrevida por câncer de mama não apresentaram diferenças significativas quando comparadas as categorias de idades, segundo teste de log rank. Os modelos de Fine-Gray e Cox identificaram praticamente as mesmas covariáveis influenciando no tempo de sobrevida para ambos eventos de interesse, óbitos por câncer de mama e óbitos por riscos competitivos. Foram comparados os modelos exponencial, de Weibull e lognormal com o modelo gama generalizada e conclui-se que o modelo de regressão de Weibull foi o mais adequado para ajustar o tempo de sobrevida na presença dos riscos competitivos, conforme resultados dos testes de razões de verossimilhanças / Abstract: The aim of this study is to identify associated factors to time failure survival of breast cancer such as age, stage and extent of the tumor using Cox's proportional hazards and Fine-Gray competing risks models. It is a retrospective cohort study of population-based concerning to 524 women diagnosed with breast cancer in the period 1993-1995, followed until 2011, living in the city of Campinas, São Paulo State, Brazil. The cutoff age variable has been defined using Cox models. In the settings of simple and multiple models of Fine-Gray and Cox age was not significant when the death from breast cancer was the outcome of interest. The survival curves estimated by Kaplan-Meier showed significant differences in the odds comparing the deaths from breast cancer and competing risks. The survival curves for breast cancer showed no significant differences when comparing age groups, according to the logrank test. The Fine-Gray and Cox models identified the same covariates influencing the survival time for both events of interest: deaths from breast cancer and deaths from competing risks. The exponential, Weibull and lognormal regression models were compared with generalized gamma model and it is concluded that the Weibull regression model was the most appropriate to adjust the survival time in the presence of competing risks, according to results of the ratio likelihood tests / Doutorado / Epidemiologia / Doutora em Saúde Coletiva
266

Genetic analysis of longevity in specialized lines of rabbits

El Nagar, Ayman Gamal Fawzy 29 June 2015 (has links)
[EN] The global objective of the present thesis was to study the functional longevity defined as length of productive life (LPL) in five Spanish specialized lines of rabbit (A, V, H and LP). Chapter 3, aimed to check the genetic heterogeneity for longevity between the five lines estimating the additive variance and the corresponding effective heritabilities. As well as to test the genetic importance of time-dependent factors such as positive palpation order (OPP), physiological status (PS) and number of kits born alive (NBA) on the genetics of longevity. This point has been assessed using four different Cox proportional hazard models; the first one (Model 1) included all the previous factors in addition to the year-season effect, the inbreeding coefficient effect and finally the animal effect as random factor. The remaining three models were the same as Model 1 but excluding OPP (Model 2), or PS (Model 3), or NBA (Model 4). The complete data set comprised 15,670 does with records 35.6 % having censoring data, and the full pedigree file involved 19,405 animals. The heritability estimates for longevity in the five lines were low and ranged from 0.02±0.01 to 0.14±0.09, and consequently, it is not recommended to include this trait as selection criteria in rabbit breeding programs. Despite of the large variation of the heritability estimates, the corresponding HPD95% always overlapped and consequently the hypothesis of all lines having the same heritability cannot be discarded. Comparing the additive variance estimates of the four models, it was observed that by correcting for PS 51, 39, 38, 83 and 75% of the additive variance in lines A, V, H, LP and R, respectively, was removed. The risk of death or culling decreases as OPP advanced. Non-pregnant-non-lactating females are those under the higher risk. The does which had zero NBA had the highest risk, apart for this special figure (zero NBA) the risk decreased as NBA increased. Chapter 4 intended to estimate the genetic and environmental correlations between longevity and two prolificacy traits (number of kits born alive (NBA) and number of kits alive at weaning (NW)). Furthermore, to estimate the genetic and environmental correlations between longevity and the percentage of days that the doe spent in the different physiological statuses with respect to its entire productive life. The complete pedigree file comprised 19,405 animals. The datasets included records on 15,670 does which had 58,329 kindlings and 57,927 weanings. In general the genetic correlations between NBA and NW, and the hazard were low to very low, and the only line for which it can be said these genetic correlation to be different from zero was the LP line. Regarding the correlations between longevity and the percentage of days the doe spent in each physiological status, there were evidences of non-negligible genetic correlations between the two traits. Chapter 5 purposed to compare the five lines at their foundation and at fixed time periods during their selection programs. The first comparison was done at the origin of the lines, involving the complete data set, and using a genetic model (CM) including the additive values of the animals, so the effect of selection was considered. For the second comparison the same model as the first comparison was used, but excluding the additive effects from the model of analysis (IM), and involving only the data corresponding to each period, so the differences between the lines included the additive values of the animals. The lines V, H and LP showed at foundation a substantial superiority over line A. The line R had higher risk of death or culling with relevant differences when compared to V, H and LP lines. The maximum relative risks were observed between the lines LP and R (0.239), and between LP and A (0.317). For the comparisons at fixed times, the pattern of the differences between the A line and the others was similar to those observed at foundation. / [ES] El objetivo global de la presente tesis fue estudiar la longevidad funcional en cinco líneas españolas de conejos (A, V, H y LP), el carácter se definió como la longitud de la vida productiva. En el Capítulo 3, dirigido a comprobar la heterogeneidad genética de la longevidad entre las 5 líneas, se estimaron las varianzas aditivas y sus correspondientes heredabilidades efectivas. Y además se evaluó la importancia del orden de la palpación positiva (OPP), el estado fisiológico (PS) y el número de gazapos nacidos vivos (NBA) sobre el determinismo genético de la longevidad. Para ello se utilizaron 4 modelos de Cox de riesgos proporcionales; el primer modelo (Modelo 1) incluyó todos los factores anteriores, además del efecto del año-estación, el efecto de la consanguinidad y, finalmente, el valor aditivo de los animales como efecto aleatorio. Los otros tres modelos fueron igual que el Modelo 1 pero excluyendo OPP (Modelo 2), o PS (Modelo 3), o NBA (Modelo 4). Los datos de longevidad estaban referidos a 15,670 conejas y tuvieron una tasa de censura de 35.6%. La genealogía completa involucró a 19,405 animales. Las estimas de heredabilidad efectiva para la longevidad en las 5 líneas fueron bajas y variaron de 0.02±0.01 a 0.14±0.09. A pesar de la gran variación de las estimas puntuales de heredabilidad, los correspondientes intervalos HPD95% siempre se solaparon y por lo tanto la hipótesis de que todas las líneas tengan la misma heredabilidad no pudo descartase. Se observó que la exclusión de PS incrementó la varianza aditiva aproximadamente, en un 51, 39, 38, 83 y 75% en las líneas A, V, H, LP y R, respectivamente. El riesgo de muerte o eliminación disminuía a medida que avanzaba el OPP, observándose el riesgo más alto durante los primeros dos partos, partos en los que las conejas todavía están creciendo lo que sería un factor de riesgo importante. El nivel No-Gestante-No-Lactante de PS tuvo el mayor riesgo. Este nivel se interpreta como indicador de baja fertilidad y/o problemas de salud de la coneja. Las conejas que tenían cero NBA tuvieron el mayor riesgo de muerte o eliminación, aunque para el resto de niveles de NBA se apreció una disminución del riesgo a medida que aumenta la prolificidad. En el capítulo 4, se estimaron las correlaciones genéticas y ambientales entre la longevidad y dos caracteres de prolificidad [número de gazapos nacidos vivos (NBA) y el número de destetados (NW)]. El fichero de datos incluyó 58,329 partos y 57,927 destetes. También se estimaron las correlaciones entre longevidad y el porcentaje de días que la coneja pasó en los diferentes estados fisiológicos con respecto a la totalidad de su vida productiva. La única línea para la que se puede decir que la correlación genética entre NBA o NW y el riesgo fue significativamente diferente de cero fue la línea LP. Hubo evidencias de correlaciones genéticas no despreciables entre la longevidad y el porcentaje de días que la hembra pasó en cada estado fisiológico los dos caracteres. En el capítulo 5 se compararon las longevidades medias de las 5 líneas en su fundación y en períodos de tiempo determinados. La comparación de las líneas en el origen, utilizó todos los datos y un modelo genético (CM) que incluía los valores aditivos de los animales. Para la comparación en tiempos fijos se utilizó el mismo modelo, pero excluyendo los efectos aditivos del modelo de análisis (IM), utilizando sólo los datos correspondientes a cada período, por lo que las diferencias entre las líneas incluían los cambios debidos a la selección. Las líneas V, H y LP mostraron una superioridad sustancial sobre las líneas A y R. Los riesgos relativos máximos se observaron entre las líneas LP y R (0.239), y entre LP y A (0.317). Con respecto a las comparaciones en tiempos fijos, el patrón de las diferencias entre la línea de A y las otras líneas fue similar a los observados en la fundación. / [CAT] L'objectiu global de la present tesi va ser estudiar la longevitat funcional en cinc línies espanyoles de conills (A, V, H i LP), el caràcter es va definir com la longitud de la vida productiva. Al Capítol 3, dirigit a comprovar l'heterogeneïtat genètica de la longevitat entre les 5 línies, es van estimar les variàncies additives i les seues corresponents heretabilitats efectives. A més a més, es va avaluar la importància de factors dependents del temps, com l'orde de la palpació positiva (OPP) , l'estat fisiològic (PS) i el nombre de llorigons nascuts vius (NBA) sobre el determinisme genètic de la longevitat. Per a això es van utilitzar 4 models de Cox de riscos proporcionals; el primer model (Model 1) va incloure tots els factors anteriorment assenyalats, a més de l'efecte de l'any-estació, l'efecte de la consanguinitat i, finalment, el valor additiu dels animals com a efecte aleatori. Els altres tres models van ser igual que el Model 1 però excloent l'OPP (Model 2) , o PS (Model 3) , o NBA (Model 4) . Les dades de longevitat estaven referides a 15,670 conilles i van tindre una taxa de censura de 35.6%. La genealogia completa va involucrar a 19,405 animals. Les estimes d'heretabilitat efectiva (Model 1) per a la longevitat en les 5 línies van ser baixes i van variar de 0.02±0.01 a 0.14±0.09. A pesar de la gran variació de les estimes puntuals d'heretabilitat, els corresponents intervals HPD95% sempre es van solapar i per tant la hipòtesi que totes les línies tinguen la mateixa heretabilitat no va poder descartar-se. Es va observar que l'exclusió de PS va incrementar la variància additiva, aproximadament, en un 51, 39, 38, 83 i 75% en les línies A, V, H, LP i R, respectivament. El risc de mort o eliminació disminuïa a mesura que avançava l'OPP, observant-se el risc més alt durant els primers dos parts, en què les conilles encara estan creixent el que seria un factor de risc important. El nivell No-Gestant-No-Lactant de PS va tindre el major risc en comparació amb els altres nivells. Les conilles que tenien zero NBA van tindre el major risc de mort o eliminació, encara que per a la resta de nivells de NBA es va apreciar una disminució del risc a mesura que augmentà la prolificitat. Al Capítol 4, es van estimar les correlacions genètiques i ambientals entre la longevitat i dos caràcters de prolificitat [nombre de llorigons nascuts vius (NBA) i el nombre de deslletats (NW)]. El fitxer de dades va incloure 58,329 parts i 57,927 deslletaments. L'única línia per a la que es pot dir que la correlació genètica entre NBA o NW i el risc va ser significativament diferent de zero va ser la línia LP. Evidències de correlacions genètiques no menyspreables entre longevitat i els percentatge de dies que la femella va passar en cada estat fisiològic. Al Capítol 5 es compararen les longevitats mitges de les 5 línies en la seua fundació i en períodes de temps determinats. Per a la comparació de les línies a l'origen, es van utilitzar totes les dades i un model genètic (CM) que incloïa els valors additius dels animals, per la qual cosa es va considerar l'efecte de la selecció a partir de la fundació. En la comparació en temps fixos se va utilitzar el mateix model que en l'anterior, però excloent els efectes additius del model d'anàlisi (IM), utilitzant només les dades corresponents a cada període, per la qual cosa les diferències entre les línies incloïen els canvis deguts a la selecció. Les línies V, H i LP van mostrar una superioritat substancial sobre les línies A i R. Els riscos relatius màxims es van observar entre les línies LP i R (0.239), i entre LP i A (0.317). Respecte a les comparacions en temps fixos, el patró de les diferències entre la línia de A i les altres línies va ser semblant als observats en la fundació. / El Nagar, AGF. (2015). Genetic analysis of longevity in specialized lines of rabbits [Tesis doctoral no publicada]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/52390 / TESIS
267

Konstrukce koncového testovacího zařízení elektromagnetu / The construction of electromagnet testing device

Vlasák, František January 2016 (has links)
Master thesis describes a single-purpose machine design, that is used for testing a basic features of proportional solenoid. The design is created in cooperation with company NUVIA a.s. and contains mechanical solution that is processed in respect with ergonomic requirements, safety and overall costs. Machine concept is affected by customer requirements, whose name, as well as real parameters of the product will not be published, because of non-disclosure agreement.
268

Řízení proporcionálního hydraulického ventilu / Control of proportional directional control valves

Hoferek, Martin January 2017 (has links)
The thesis deals with design and implementation of proportional hydraulic valve, which will be integrated to hydraulic system of small hydro in Rájec - Jestřebí. This valve will be used to control one of the wicket gates of double Francis turbine. The thesis is processed for the company Mavel a.s., which is the owner of SH. The goal of this thesis is to create control of the valve according to the client's requirements, its implementation to the control system and commissioning.
269

Energy-efficient multistable valve driven by magnetic shape memory alloys

Schiepp, Thomas, Schnetzler, René, Riccardi, Leonardo, Laufenberg, Markus January 2016 (has links)
Magnetic shape memory alloys are active materials which deform under the application of a magnetic field or an external stress. Due to their internal friction, recognizable from the strain-stress hysteresis, this new material technology allows the design of multistable actuators. This paper describes and characterizes an innovative airflow control valve whose aperture is proportional to the deformation of the active material and thus controllable by the input voltage. The multistability of the material is partially exploited within an airflow control loop to reduce the energy losses of the valve when a specific airflow value must be hold.
270

Computation of estimates in a complex survey sample design

Maremba, Thanyani Alpheus January 2019 (has links)
Thesis (M.Sc. (Statistics)) -- University of Limpopo, 2019 / This research study has demonstrated the complexity involved in complex survey sample design (CSSD). Furthermore the study has proposed methods to account for each step taken in sampling and at the estimation stage using the theory of survey sampling, CSSD-based case studies and practical implementation based on census attributes. CSSD methods are designed to improve statistical efficiency, reduce costs and improve precision for sub-group analyses relative to simple random sample(SRS).They are commonly used by statistical agencies as well as development and aid organisations. CSSDs provide one of the most challenging fields for applying a statistical methodology. Researchers encounter a vast diversity of unique practical problems in the course of studying populations. These include, interalia: non-sampling errors,specific population structures,contaminated distributions of study variables,non-satisfactory sample sizes, incorporation of the auxiliary information available on many levels, simultaneous estimation of characteristics in various sub-populations, integration of data from many waves or phases of the survey and incompletely specified sampling procedures accompanying published data. While the study has not exhausted all the available real-life scenarios, it has outlined potential problems illustrated using examples and suggested appropriate approaches at each stage. Dealing with the attributes of CSSDs mentioned above brings about the need for formulating sophisticated statistical procedures dedicated to specific conditions of a sample survey. CSSD methodologies give birth to a wide variety of approaches, methodologies and procedures of borrowing the strength from virtually all branches of statistics. The application of various statistical methods from sample design to weighting and estimation ensures that the optimal estimates of a population and various domains are obtained from the sample data.CSSDs are probability sampling methodologies from which inferences are drawn about the population. The methods used in the process of producing estimates include adjustment for unequal probability of selection (resulting from stratification, clustering and probability proportional to size (PPS), non-response adjustments and benchmarking to auxiliary totals. When estimates of survey totals, means and proportions are computed using various methods, results do not differ. The latter applies when estimates are calculated for planned domains that are taken into account in sample design and benchmarking. In contrast, when the measures of precision such as standard errors and coefficient of variation are produced, they yield different results depending on the extent to which the design information is incorporated during estimation. The literature has revealed that most statistical computer packages assume SRS design in estimating variances. The replication method was used to calculate measures of precision which take into account all the sampling parameters and weighting adjustments computed in the CSSD process. The creation of replicate weights and estimation of variances were done using WesVar, astatistical computer package capable of producing statistical inference from data collected through CSSD methods. Keywords: Complex sampling, Survey design, Probability sampling, Probability proportional to size, Stratification, Area sampling, Cluster sampling.

Page generated in 0.0727 seconds