Spelling suggestions: "subject:"istatistical cience"" "subject:"istatistical cscience""
11 |
Application of ANOVA for the analysis of temporal and spatial differences in the length of pelagic goby preyed on by Cape fur seals in the coasts of NamibiaAnday, Tekie T January 2005 (has links)
Includes bibliographical references (leaves 62-66). / The Analysis of variance is a robust technique whereby the total variation present in a set of data is partitioned into two or more components (Wayne, 1999). In this thesis, ANOVA was used to uncover the differences in goby length preyed on by three different colonies of fur seals at the Namibian coast. Moreover, ANOVA was used to investigate temporal differences in lengths of goby preyed on by fur seals in each location of the seal colonies. Results of the analysis are shown in the Analysis and results section, and the findings are discussed in the discussion section. But before these two sections, there are three sections of the thesis. The first section is the general introduction that explains about the general situation and the targets of this thesis. The second section gives a general background on the ANOVA technique. The third section explains the nature of the data and gives background information on gobies.
|
12 |
A framework for regime identification and asset allocationKondlo, Mpumelelo January 2016 (has links)
The purpose of this thesis is to examine a regime-based asset allocation strategy and evaluate whether accounting for regime-dependent risk and return of asset classes provides any significant improvement on portfolio performance. The South African market and economy are considered as a proxy for the analysis. Motivation of this thesis stems from the growing body of research by practitioners devoted to models that are reflective of the interdependency between financial assets and the real economy. The asset classes under consideration for the analysis are domestic and foreign cash, domestic and foreign bonds, domestic and foreign equity, inflation linked bonds, property, gold and commodities. In order to evaluate the performance of the regime-based strategy, this thesis proposes a framework based on Principal Component Analysis and Fuzzy Cluster Analysis for regime identification and asset allocation. The performance of the strategy is tested against two strategies that are not cognizant of regime changes. These are an equally weighted portfolio and a buy-and-hold strategy. Furthermore, relative performance analysis was performed by comparing the regime-based strategy proposed in this thesis against the Alexander Forbes Large Manager Watch Index. Due to data limitations, the analysis is done on an in-sample basis without an out-of-sample testing. The results from the analysis showed the extent of outperformance of the proposed regime-based strategy relative to an equally weighted strategy and a buy-and-hold strategy. These results were consistent with existing literature on regime-based strategies. Furthermore, the results provided strong motivation for the use of the regime identification framework together with tactical asset allocation proposed in this thesis.
|
13 |
A comparative evaluation of data mining classification techniques on medical trauma dataRamaboa, Kutlwano K K M January 2004 (has links)
Includes bibliographical references (leaves 109-113). / The purpose of this research was to determine the extent to which a selection of data mining classification techniques (specifically, Discriminant Analysis, Decision Trees, and three artifical neural network models - Backpropogation, Probablilistic Neural Networks, and the Radial Basis Function) are able to correctly classify cases into the different categories of an outcome measure from a given set of input variables (i.e. estimate their classification accuracy) on a common database.
|
14 |
Statistical investigation into academic performance in the Faculty of Science at the University of Cape Town in the period 1990-1997Ronda, Katarzyna January 1999 (has links)
Includes bibliography. / Ultimate academic success at any tertiary institution is affected and partially determined by many factors related to various aspects of individual's life. These factors could be separated into the following distinct categories, namely, educational, biographical, environmental and personal factors. Some of these determinants are used in the admission procedures adopted at tertiary institutions. In South Africa, the results of different final matriculation examinations (referred to as matric or matric exams) written in several educational departments throughout the country are employed to assess the individual's potential to succeed. However, effectiveness of matric results as predictors of successful academic performance has always been controversial. Expressing these concerns and desiring to explore them, the Faculty of Science at the University of Cape Town (UCT) accepted a proposal from the Department of Statistical Sciences to investigate several issues affecting students' performance in the Faculty. The proposal has led to developing this M.Sc. thesis. The major issue of concern in this study is to describe, on a retrospective basis, the extent to which the current selection criteria based on the matric results may have predicted various types of academic performance in the Faculty amongst those selected and admitted. The thesis also exhibits a coherent and fairly complete methodology that is applicable at general or at particular levels of student performance data analysis on a continuing year-to-year basis. The particular statistical methods and techniques in this study have been summarised and discussed in the three Appendices.
|
15 |
The Swift Tern Sterna bergii in Southern Africa : growth and movementLe Roux, Janine January 2006 (has links)
Inlcudes bibliographical references.
|
16 |
Robben Island penguin pressure model: a decision support tool for an ecosystems approach to fisheries managementCecchini, Lee-Anne January 2012 (has links)
Includes bibliographical references. / The African penguin (Spheniscus demersus) population in southern Africa has declined from approximately 575 000 adults at the start of the 20th century to 180 000 adults in the early 1990s. The population is still declining, leading to the International Union for the Conservation of Nature upgrading the status of African penguins to Endangered on the Red List of Threatened Species. This dissertation uses a systems dynamics approach to produce a model incorporating all important pressures. The model is stochastic and spatially explicit, and uses expert opinion where data are not available. The model has been produced and revised with the help of the Penguin Modelling Group, based at the University of Cape Town. The modelling process culminated in a workshop where participants experimented with the model themselves. The model in this dissertation is only applicable to the penguin population on Robben Island and, as such, conclusions drawn cannot necessarily be applied to other penguin colonies.
|
17 |
Modelling growth patterns of bird species using non-linear mixed effects modelsNtirampeba, D January 2008 (has links)
Includes bibliographical references. / The analysis of growth data is important as it allows us to assess how fast things grow and determine various factors that have impact on their growth. In the current study, growth measurements on body features (body mass, wing length, head length, bill (culmen) length, foot length, and tarsus length) for Grey-headed Gulls populating Bonaero Park and Modderfontein Pan in Gauteng province, South Africa, and for Swift Terns on Robben Island were taken. Different methods such as polynomial regressions, non-parametric models and non-linear mixed effects models have been used to fit models to growth data. In recent years, non-linear mixed effects models have become an important tool for growth models. We have fitted univariate inverse exponential, Gompertz, logistic, and Richards non-linear mixed effects models to each of the six body features. We have modeled these six features simultaneously by adding a categorical covariate, which distinguishes between different features, to the model. This approach allows for straightforward comparison of growth between the different body features. In growth studies, the knowledge of the age of each individual is an essential information for growth analysis. For Swift Terns, the exact age of most chicks was unknown, but a small portion of the sample was followed from nestling up to the end of the study period. For chicks with unknown age, we estimated age by fitting the growth curve, obtained from birds with known age, to the mass measurements of the chick with unknown age. It was found that the logistic models were most appropriate to describe the growth of body mass and wing length while the Gompertz models provided best fits for bill, tarsus, head and foot for Grey-headed Gulls. For Swift Terns, the inverse exponential model provided the best univariate fit for four of six features. The logistic model, with a variance function increasing as a power of fitted values, with a different power for each feature and autoregressive correlation structure for within bird errors with errors from different features within the same subject assumed to be independent, gave the best model to describe the growth of all body features taken simultaneously for both Grey-headed Gull and Swift Tern data. It was shown that growth of Grey-headed Gull and Swift Tern chicks occurs in the following order (foot, body mass, tarsus)-(bill, head)-( wing) and (tarsus, foot)-(body mass, bill, head)-(wing) , respectively.
|
18 |
Joint models for nonlinear longitudinal profiles in the presence of informative censoringChatora, Tinashe 18 February 2019 (has links)
Malaria is the parasitic disease which affects the most humans, with Plasmodium falciparum malaria being responsible for the majority of severe malaria and malaria related deaths. The asexual form of the parasite causes the signs and symptoms associated with malaria infection. The sexual form of the parasite, also known as a gametocyte, is the stage responsible for infectivity of the human host (patient) to the mosquito vector, and thus ongoing transmission of malaria and the spread of antimalarial drug resistance. Historically malaria therapeutic efficacy studies have focused mainly on the clearance of asexual parasites. However, malaria in a community can only be truly combated if a treatment program is implemented which is able to clear both asexual and sexual parasites effectively. In this thesis focus will be on the modeling of the key features of gametocytemia. Particular emphasis will be on the modeling of the time to gametocyte emergence, the density of gametocytes and the duration of gametocytemia. It is also of interest to investigate the impact of the administered treatment on the aforementioned features. Gametocyte data has several interesting features. Firstly, the distribution of gametocyte data is zero-inflated with a long tail to the right. The observed longitudinal gametocyte profile also has a nonlinear relationship with time. In addition, since most malaria intervention studies are not designed to optimally measure the evolution of the longitudinal gametocyte profile, there are very few observation points in the time period where the gametocyte profile is expected to peak. Gametocyte data collected from malaria intervention studies are also affected by informative censoring, which leads to incomplete gametocyte profiles. An example of informative censoring is when a patient who experiences treatment failure is “rescued", and withdrawn, from the study in order to receive alternative treatment. This patient can be considered to be in worse health as compared to the patients who remain in this study. There are also competing risks of exit from the study, as a patient can either experience treatment failure or be lost to follow-up. The above mentioned features of gametocyte data make it a statistically appealing dataset to analyze. In literature there are several modeling techniques which can be used to analyze individual features of the data. These techniques include standard survival models for modeling the time to gametocyte emergence and the duration of gametocytemia. The longitudinal nonlinear gametocyte profile would typically be modeled using nonlinear mixed effect models. These nonlinear models could then subsequently be extended to accommodate the zero-inflation in the data, by changing the underlying assumption around the distribution of the response variable. However, it is important to note that these standard techniques do not account for informative censoring. Failure to account for informative censoring leads to bias in parameter estimates. Joint modeling techniques can be used to account for informative censoring. The joint models applied in this thesis combined the longitudinal nonlinear gametocyte densities and the time to censoring due to either lost to follow up or treatment failure. The data analyzed in this thesis were collected from a series of clinical trials conducted be- tween 2002 and 2004 in Mozambique and the Mpumulanga province of South Africa. These trials were a part of the South East African Combination Antimalarial Therapy (SEACAT) evaluation of the phased introduction of combination anti-malarial therapy, nested in the Lubombo Spatial Development Initiative. The aim of these studies was primarily to measure the efficacy of sulfadoxine-pyrimethamine (SP) and a combination of artesunate and sulfadoxine-pyrimethamine (ACT), in eliminating asexual parasites in patients. The patients enrolled in the study had uncomplicated malaria, at a time of increasing resistance to sulfadoxine-pyrimethamine (SP) treatment. Blood samples were taken from patients during the course of 6 weeks on days 0, 1, 2, 3, 7, 14, 21, 28 and 42. Analysis of these blood samples provided longitudinal measurements for asexual 1 parasite densities, gametocyte densities, sulfadoxine drug concentrations and pyrimethamine drug concentrations. The gametocyte data collected in this study was initially analyzed using standard survival modeling techniques. Non-parametric Cox regression models and parametric survival models were applied to the data as part of this initial investigation. These models were used to investigate the factors which affected the time to gametocyte emergence. Subsequently, using the subset of the population which experienced gametocytemia, accelerated failure time models were applied to investigate the factors which affected the duration of gametocytemia. It is evident that the findings from the aforementioned duration investigation would only be able to provide valid duration estimates for patients who were detected to have gametocytemia. This work was extended to allow for population level duration estimates by incorporating the prevalence of gametocytemia into the estimation of duration, for generic patients with specific covariate patterns. The prevalence of gametocytemia was modeled using an underlying binomial distribution. The delta method was subsequently used to derive confidence intervals for the population level duration estimates which were associated with specific covariate patterns. An investigation into the factors affecting the early withdrawal of patients from the study was also conducted. Early exit from the study arose either through loss to follow-up (LTFU) or through treatment failure. The longitudinal gametocyte profile was modeled using joint modeling techniques. The resulting joint model used shared random effects to combine a Weibull survival model, describing the cause- specific hazards of patient exit from the study, with a nonlinear zero-adjusted gamma mixed effect model for the longitudinal gametocyte profile. This model was used to impute the incomplete gametocyte profiles, after adjusting for informative censoring. These imputed profiles were then used to estimate the duration of gametocytemia. It was found, in this thesis, that treatment had a very strong effect on the hazard of gametocyte emergence, density of gametocytes and the duration of gametocytemia. Patients who received a combination of sulfadoxine-pyrimethamine and artesunate were found to have significantly lower hazards of gametocyte emergence, lower predicted durations of gametocytemia and lower predicted longitudinal gametocyte densities as compared to patients who received sulfadoxine-pyrimethamine treatment only.
|
19 |
Selecting the best model for predicting a term deposit product take-up in bankingHlongwane, Rivalani Willie 19 February 2019 (has links)
In this study, we use data mining techniques to build predictive models on data collected by a Portuguese bank through a term savings product campaign conducted between May 2008 and November 2010. This data is imbalanced, given an observed take-up rate of 11.27%. Ling et al. (1998) indicated that predictive models built on imbalanced data tend to yield low sensitivity and high specificity, an indication of low true positive and high true negative rates. Our study confirms this finding. We, therefore, use three sampling techniques, namely, under-sampling, oversampling and Synthetic Minority Over-sampling Technique, to balance the data, this results in three additional datasets to use for modelling. We build the following predictive models: random forest, multivariate adaptive regression splines, neural network and support vector machine on the datasets and we compare the models against each other for their ability to identify customers that are likely to take-up a term savings product. As part of the model building process, we investigate parameter permutations related to each modelling technique to tune the models, we find that this assists in building robust models. We assess our models for predictive performance through the use of the receiver operating characteristic curve, confusion matrix, GINI, kappa, sensitivity, specificity, and lift and gains charts. A multivariate adaptive regression splines model built on over-sampled data is found to be the best model for predicting term savings product takeup.
|
20 |
SÚS a rozvoj statistické vědy v meziválečném období / Czechoslovak State statistical office in the interwar period and statistical science developmentHouska, Lukáš January 2010 (has links)
The thesis focuses on the creation and functioning of the State statistical office and its contribution to the statistical science and theory development. The main goal of the thesis is to make the readers acquainted with the first period of the czechoslovak state statistics and enable them to get a thorough look into the institution's publication activities. In this concept the thesis is divided into three parts. In the first one the "modus operandi" of the statistical office itself is described, the second part comes up with the State statistical office's most influential personalities' biografical data. The third part brings the description and analysis ot books, magasines and other pieces publication. At the conclusion of the third section the key works of the statistical theory are analysed. The enclosure of the thesis implies the attachment with published laws and regulations of the Czechoslovak republic, which are directly tied to the statistical office's activities, and also the list of pieces published int the two key editions of the publication system. The contribution of the thesis is in the complex insight on the topic of the czechoslovak statistics in the interwar period. By now only some fragments have been compiled and described.
|
Page generated in 0.1115 seconds