Global ETD Search

531	Faktory ovlivňující spokojenost doktorandů se zázemím pro studium / Factors influencing the satisfaction with facilities for PhD studies Paul, Miroslav January 2016 (has links) This diploma thesis deals with the satisfaction of PhD students with facilities for the study by means of data gained from DOKTORANDI 2014 survey. The aim of the thesis is to identify factors that influence the satisfaction with facilities for PhD studies and finding similarities among different fields of studies according to satisfaction with facilities. The first part of this thesis contains a description of higher education with a focus on PhD programs and a description of statistical methods that are subsequently used in analytical part and a description of DOKTORANDI 2014 survey. The analytical part aims to answer the questions which factors affect the PhD students´ satisfaction with facilities for study using logistic regression and decision trees. Further it tries to determine the satisfaction similarities of PhD study fields with facilities for studying using cluster analysis.
532	Modelování vývoje úmrtnosti v České republice / Modeling of the Mortality Development in the Czech Republic Hejdová, Martina January 2011 (has links) The thesis deals with a mortality modelling in the Czech Republic. The main aim of this thesis is to describe historical development of mortality in the Czech Republic on the basis of multinomial regression. In a theoretical part the basic knowledge demanded for model construction divided into two chapters can be found. In the first chapter the basic difference between linear and generalized linear model is described. Second chapter is devoted to a logistic regression. Here we proceed from simple, binomial variable and then we generalize it for multinomial variable. Practical part deals with a construction of a described model. In the third chapter data (type and source) are described and in the fourth chapter characterizes all three models itself (two of them are descriptive and the third one is used for simple prediction). The prediction is added rather for completeness than for the importance of the work itself because it was not the main goal.
533	Vývoj situace juniorů a seniorů v ČR / The development of the situation of juniors and seniors Siegelová, Klára January 2011 (has links) The final thesis deals with social situations juniors and seniors in selected countries of the European Union. The thesis monitors changes in social developments primarily in terms of income, education, and especially of unemployment. The selected period is the period from approximately 2005 to 2011, in some cases up to 2013. The aim of this thesis is the statistical analysis of the data set EU-SILC for 2005 and 2010 of Czech Republic, Slovakia, Poland, Germany, France and Spain with focusing on income, education and unemployment among age groups.
534	Využití metod data miningu při analýze kreditních dat / Using data mining methods in the analysis of credit risk data Tvaroh, Tomáš January 2013 (has links) This thesis focuses on comparison of selected data mining methods for solving classification tasks with the method of logistic regression. First part of the thesis briefly introduces data mining as a scientific discipline and classification task is shown in the context of knowledge data discovery. Next part explains the principle of particular methods amongst which, along with logistic regression, artificial neural networks, classification decision trees and Support Vector Machine method were selected. Together with mathematical background of each algorithm, demonstration of how the classification functions for new examples is mentioned. Analytical part of this thesis tests decribed methods on real-world data from the Lending Club company and they are compared based on classification accuracy. Towards the end, an evaluation of logistic regression is made in terms of whether its majority position is due to historical reasons or for its high classification accuracy compared to other methods.
535	Prediktivní modelování v oblasti řízení kreditních rizik / Predictive Modeling in Credit Risk Management Švastalová, Iva January 2012 (has links) The diploma thesis is focused on predictive modeling in credit risk management. Banks and financial institutions are mainly interested in it to estimate the probability of client's default in order to make a decision about which client will be accepted and which client will be rejected. The theoretical part includes an introduction of credit scoring and a description of discrete choice models. The linear probability model, the probit model and the logit model are described in detail. The logit model is afterwards used for the prediction of client's default. The practical part is focused on a statistical description of the dataset and a description of how to work with it before we start with the development of the credit scoring model. After that follows the estimation of the model on testing sample, its testing and the estimation of the model on full sample with a description of individual steps of calculation and outputs of the program SPSS.
536	Faktory ovlivňující finanční situaci studentů doktorských studijních programů v České republice / Factors influencing the financial situation of Ph.D. students in the Czech Republic Zahradníčková, Jana January 2015 (has links) Ph.D. students are an integral part of the tertiary education system. Encouragement for doctoral programs and their students is very important because they are the ones who will participate in research projects in the future and they will contribute to society as a whole. The majority of scholarships for Ph.D. students comes from public sources. An important question to be asked is whether the scholarships are sufficient to finance Ph.D. studies and whether there are differences in the amount depending on gender, field of study or region. This thesis aims to answer these questions by applying statistical methods to the results of the survey DOKTORANDI 2014.
537	Využití statistických metod v data miningu při predikci chování zákazníků internetového obchodu / The use of statistical methods in data mining in predicting consumer behaviour for Internet purchases Podzimková, Michaela January 2015 (has links) Data mining is a new discipline that occurs with increasing amount of stored data and the increasing need to obtain the information hidden in them. It is focused on the mining of potentially useful information from large data sets and it lies at the intersection of statistics, machine learning, artificial intelligence, databases and other areas. The aim of this thesis is to present the process of data mining with an emphasis on its connection with statistics and to describe a selection of statistical methods widely used in this field and which were also used in the applied data mining problem in this thesis. Real data from purchases in the online store show that using different methods gives different results and interesting information about purchasing behavior, and also proves that not all methods are always applicable to all types of tasks.
538	Modelos bayesianos semi-paramétricos para dados binários / Bayesian semi-parametric models for binary data Márcio Augusto Diniz 11 June 2015 (has links) Este trabalho propõe modelos Bayesiano semi-paramétricos para dados binários. O primeiro modelo é uma mistura em escala que permite lidar com discrepâncias relacionadas a curtose do modelo Logístico. É uma extensão relevante a partir do que já foi proposto por Basu e Mukhopadhyay (2000) ao possibilitar a interpretação da distribuição a priori dos parâmetros através de razões de chances. O segundo modelo usufrui da mistura em escala em conjunto com a transformação proposta por \\Yeo e Johnson (2000) possibilitando que a curtose assim como a assimetria sejam ajustadas e um parâmetro informativo de assimetria seja estimado. Esta transformação é muito mais apropriada para lidar com valores negativos do que a transformação de Box e Cox (1964) utilizada por Guerrero e Johnson (1982) e é mais simples do que o modelo proposto por Stukel (1988). Por fim, o terceiro modelo é o mais geral entre todos e consiste em uma mistura de posição e escala tal que possa descrever curtose, assimetria e também bimodalidade. O modelo proposto por Newton et al. (1996), embora, seja bastante geral, não permite uma interpretação palpável da distribuição a priori para os pesquisadores da área aplicada. A avaliação dos modelos é realizada através de medidas de distância de probabilidade Cramér-von Mises, Kolmogorov-Smirnov e Anderson-Darling e também pelas Ordenadas Preditivas Condicionais. / This work proposes semi-parametric Bayesian models for binary data. The first model is a scale mixture that allows handling discrepancies related to kurtosis of Logistic model. It is a more interesting extension than has been proposed by Basu e Mukhopadyay (1998) because this model allows the interpretation of the prior distribution of parameters using odds ratios. The second model enjoys the scale mixture together with the scale transformation proposed by Yeo and Johnson (2000) modeling the kurtosis and the asymmetry such that a parameter of asymmetry is estimated. This transformation is more appropriate to deal with negative values than the transformation of Box e Cox (1964) used by Guerrero e Johnson (1982) and simpler than the model proposed by Stukel (1988). Finally, the third model is the most general among all and consists of a location-scale mixture that can describe kurtosis and skewness also bimodality. The model proposed by Newton et al (1996), although general, does not allow a tangible interpretation of the a priori distribution for reseachers of applied area. The evaluation of the models is performed through distance measurements of distribution of probabilities Cramer-von Mises Kolmogorov-Smirnov and Anderson-Darling and also the Conditional Predictive sorted. Assimetria Dados binários Inferência bayesiana semi-paramétrica Regressão logística Asymmetry Binary data Logistic regression Semi-parametric Bayesian inference
539	Modelagem para probabilidade de frutificação em café Arábica baseado em ocupação de metâmeros / Modeling for probability of fruiting in Arabica coffee based on occupancy metameres Luiza Yoko de Barros 10 February 2017 (has links) Este é um trabalho proveniente de vários encontros com pesquisadores da Embrapa de Campinas, cujo objetivo principal se centrou em investigar a probabilidade de frutificação em árvores de Café Arábica. Para isso foram analisados 20 bancos de dados para árvores de Café Arábica de um cultivar localizado no Instituto Agronômico do Paraná em dois momentos distintos (Junho de 2010 e em Novembro - Dezembro de 2010) e consideradas as seguintes variáveis: \"OCUPAÇÃO\" (quadrática ou retangular), \"ESPAÇAMENTO\" (6.000 plantas/ha e 10.000 plantas/ha) ambas relativas ao cultivo das plantas e \"TAMANHO DO ENTRENÓ\" (medição em cm de uma partição definida da planta). Na busca de um modelo representativo de tal fenômeno, foram estudadas paralelamente tópicos relativos a alometria e assimetria dessas mesmas plantas, os quais permitiram modelar determinadas associações entre algumas estruturas como largura e comprimento de folhas. Os modelos ajustados apresentaram uma grande significância para a variável \"ESPAÇAMENTO\" nos dois tempos estudados, enquanto que a variável \"OCUPAÇÃO\" foi significativa apenas no segundo tempo e variável \"TAMANHO DO ENTRENÓ\" não foi significativa para nenhum dos tempos. A metodologia adotada para investigar essa probabilidade se deu através dos modelos de regressão logístico. Com o intuito de agregar a variável \"TEMPO\", juntou-se os dois bancos de dados em diferentes tempos e, baseado na metodologia de modelos mistos, obteve-se um modelo ajustado com retas paralelas, onde apenas a variável \"ESPAÇAMENTO\" foi considerada significativa. / This work is based on several meetings with Embrapa researchers from Campinas in an integrated study with professors from the ESALQ Department of Statistics and Agronomic Experimentation, whose main objective was to investigate the probability of fruiting in Arabica Coffee trees. In order to do so, it was analyzed 20 databases for Coffee trees of a cultivar located at the Agronomic Institute of Paraná at two different times (June 2010 and November - December 2010), where the following variables were considered: \"OCCUPATION \"(quadratic or rectangular),\"SPACING \"(6,000 plants / ha and 10,000 plants / ha), both related to plant cultivation and \"SIZE OF ENTRENO\"( measured in cm of a defined plant partition). In order to find a representative model of this phenomenon, topics related to allometry and asymmetry of these same plants were studied in parallel, which allowed to model certain associations between some structures such as width and length of leaves. The adjusted models presented a great significance for the variable \"SPACING\"in the two studied times, whereas the variable \"OCCUPATION\"was significant only in the second time and the variable \"SIZE OF THE ENTRENÓ\"was not significant for any of the times. The methodology used to investigate this probability was based on logistic regression models. In order to aggregate the variable \"TIME\", the two databases were combined at different times and, based on the methodology of mixed models, a model adjusted with parallel lines was obtained, where only the variable \"SPACING\"was considered significant. Café Arábica Dados longitudinais Metâmero Modelos mistos Regressão logística Arabica Cofee Logistic regression Longitudinal data Metamer Mixed models
540	Statistická klasifikace pomocí zobecněných lineárních modelů. / Statistical Classification by means of generalized linear models Sladká, Vladimíra January 2010 (has links) The goal of this thesis is introduce the theory of generalized linear models, namely probit and logit model. This models are especially used for medical data processing. In our concrete case these mentioned models are applied to data file obtained in teaching hospital Brno. The aim is statically analyzed immune response of child patients in dependence of twelve selected types of genes and find out which combinations of these genes influence septic state of patients.

Search results