Spelling suggestions: "subject:"multinomial logistic degression"" "subject:"multinomial logistic aregression""
1 |
Classification of Bone Cements Using Multinomial Logistic Regression MethodWei, Jinglun 29 April 2018 (has links)
Bone cement surgery is a new technique widely used in medical field nowadays. In this thesis I analyze 48 bone cement types using their content of 20 elements. My goal is to ?find a method to classify new found bone cement sample into these 48 categories. Here I will use multinomial logistic regression method to see whether it works or not. Due to the lack of observations, I generate enough data by adding white noise in proper scales to the original data again and again, and then I get a data set of over 100 times as many points as the original one. Then I use purposeful variable selection method to pick the covariates I need, rather than stepwise selection. There are 15 covariates left after the selection, and then I use my new data set to fit such a multinomial logistic regression model. The model doesn't perform that good in goodness of ?fit test, but the result is still acceptable, and the diagnostic statistics also indicate a good performance. Combined with clinical experience and prior conditions, this model is helpful in this classification case.
|
2 |
Analys av bortfall i en uppföljningsundersökning av hälsa / Analysis of attrition in a longitudinal health studyUdd, Mattias, Pettersson, Niklas January 2008 (has links)
The LSH-study started in 2003 at the department of Health and Society at the University of Linköping. The purpose of the study was to examine the relationship between life condition, stress and health. A total of 1007 people from ten different health centres in Östergötlands län participated. At the follow up, a couple of years later, 795 of the 1007 participated. 127 of the 212 in the attrition turned down the follow up, twelve people were not invited (for example in case of death) and the rest did not respond at all. The purpose of this paper is to find out in what degree the attrition in the follow up can be predicted using the information from the first survey and which variables are important. The differences between different types of attrition have also been examined. Simple and multiple bi- and multinomial logistic regression have been used in the analysis. In total 34 variables were examined and in the final model six variables remained with a significant relation to the attrition. High BMI, regular smoking, high pulse and lack of daily exercise at the first survey were connected to a higher risk for an individual to not participate at the follow up. It is interesting that these factors are considered as risk factors for unhealthy living. Other factors related to a higher attrition were unemployment in the last year before the first survey and if the individual had parents born in another country than Sweden. The risk for attrition increased gradually when more risk factors were shown by the individual. The factors contributing an individual to turn down the follow up instead of not responding at all was if he or she were in the older age segments in the survey or if they were not active in any type of association.
|
3 |
Factors Affecting Cotton Producers' Choice of Marketing OutletPace, Jason 1979- 14 March 2013 (has links)
In recent years, changes in government policies, supply and demand fundamentals and price patterns in the cotton market have led to several shifts in how producers market their cotton. This thesis examined producer cash marketing choices, including direct and indirect hedging, in four different periods since 2001. Special emphasis was placed on the 2010 season - a season characterized by historically high prices and volatility. Producer marketing behavior was modeled as a discrete choice between four different cash market outlets: forward contracting with a merchant, post-harvest cash contracting, contracting with a merchant pool and contracting with a cooperative pool. Hedging was characterized as a tool that was used in conjunction with one of the four discrete choices. This thesis employed multinomial logit estimation to determine the influence of factors on producers' choice of primary cash marketing decisions. Data were collected from a mail survey of the population of cotton growers in Texas, Oklahoma and Kansas. The most important determinants of cotton cash marketing choices were 1) prior participation in cooperative pools, beliefs about the value of pre-harvest pricing, beliefs about the performance of merchant pools, willingness to accept lower prices to reduce risk, and several socio-economic variables.
|
4 |
Functionality Classification Filter for WebsitesJärvstråt, Lotta January 2013 (has links)
The objective of this thesis is to evaluate different models and methods for website classification. The websites are classified based on their functionality, in this case specifically whether they are forums, news sites or blogs. The analysis aims at solving a search engine problem, which means that it is interesting to know from which categories in a information search the results come. The data consists of two datasets, extracted from the web in January and April 2013. Together these data sets consist of approximately 40.000 observations, with each observation being the extracted text from the website. Approximately 7.000 new word variables were subsequently created from this text, as were variables based on Latent Dirichlet Allocation. One variable (the number of links) was created using the HTML-code for the web site. These data sets are used both in multinomial logistic regression with Lasso regularization, and to create a Naive Bayes classifier. The best classifier for the data material studied was achieved when using Lasso for all variables with multinomial logistic regression to reduce the number of variables. The accuracy of this model is 99.70 %. When time dependency of the models is considered, using the first data to make the model and the second data for testing, the accuracy, however, is only 90.74 %. This indicates that the data is time dependent and that websites topics change over time.
|
5 |
Analys av bortfall i en uppföljningsundersökning av hälsa / Analysis of attrition in a longitudinal health studyUdd, Mattias, Pettersson, Niklas January 2008 (has links)
<p>The LSH-study started in 2003 at the department of Health and Society at the University of Linköping. The purpose of the study was to examine the relationship between life condition, stress and health. A total of 1007 people from ten different health centres in Östergötlands län participated. At the follow up, a couple of years later, 795 of the 1007 participated. 127 of the 212 in the attrition turned down the follow up, twelve people were not invited (for example in case of death) and the rest did not respond at all. The purpose of this paper is to find out in what degree the attrition in the follow up can be predicted using the information from the first survey and which variables are important. The differences between different types of attrition have also been examined. Simple and multiple bi- and multinomial logistic regression have been used in the analysis.</p><p>In total 34 variables were examined and in the final model six variables remained with a significant relation to the attrition. High BMI, regular smoking, high pulse and lack of daily exercise at the first survey were connected to a higher risk for an individual to not participate at the follow up. It is interesting that these factors are considered as risk factors for unhealthy living. Other factors related to a higher attrition were unemployment in the last year before the first survey and if the individual had parents born in another country than Sweden. The risk for attrition increased gradually when more risk factors were shown by the individual. The factors contributing an individual to turn down the follow up instead of not responding at all was if he or she were in the older age segments in the survey or if they were not active in any type of association.</p>
|
6 |
Eine empirische Analyse des individuellen Verkehrsmittelwahlverhaltens am Beispiel der Stadt DresdenSchletze, Matthias 15 December 2015 (has links) (PDF)
Das Verkehrsmittelwahlverhalten von Menschen ist komplex. So spielen soziodemografische, sozioökonomische sowie raum- und siedlungsstrukturelle Merkmale eine Rolle. In dieser Arbeit wird dieses Verhalten untersucht. Dabei wird eine homogene Grundgesamtheit geschaffen, welche alle Personen beinhaltet, die sowohl über eine Dauerkarte des öffentlichen Personenverkehrs als auch einen Personenkraftwagen verfügen. Anhand derer soll eine deskriptive Analyse und eine multinomiale logistische Regression Aufschluss geben, ob es Unterschiede zwischen den jeweiligen Nutzergruppen gibt.
So lässt sich die Gruppe der ÖV-Nutzer durch folgende Charakteristiken beschreiben: der Großteil sind Frauen, sowie Personen, die eine hohe schulische und berufliche Bildung besitzen. Des Weiteren werden eher weniger Wege mit dem ÖV als mit dem PKW zurückgelegt. Erwerbstätige hingegen entscheiden sich eher für den PKW. / Human behavior towards the choice of transportation varies in very complex ways such as sociodemographics, socioeconomics as well as settlement structures. For this paper a homogenous population is created from season ticket holders for public transportation and car owners. Based on this population a descriptive analysis followed by a multinomial logistic regression is supposed to generate the differences between the user groups.
The group of users of the public transportation system can be characterized as followed: the majority of users are women as well as highly educated people. Within this specific group distances are more likely to be covered by public transportation rather than by car. However the working population prefers to go by passenger car.
|
7 |
Predicting essay grades for the Swedish national writing test based on the new grading scale A-FLöfving, Jimmy January 2019 (has links)
Based on the curriculum of 2011 a new grading scale ranging from A-F was introduced in the Swedish upper secondary school system. Previous research on similar data have focused on the earlier grading scale, and its crucial that the new circumstances are addressed to understand the impact on grading. Using 348 essays from the national writing test this study investigates the use of automated essay scoring as a way of grading in this new setting. Using various classication methods the models for younger students outperform the corresponding models for older students. This implies that it is harder to predict grades on essays written by older students. Based on the current data the result shows that with the new grading scale the use of automated essay scoring should be used with caution.
|
8 |
Incorporating survey weights into logistic regression modelsWang, Jie 24 April 2013 (has links)
Incorporating survey weights into likelihood-based analysis is a controversial issue because the sampling weights are not simply equal to the reciprocal of selection probabilities but they are adjusted for various characteristics such as age, race, etc. Some adjustments are based on nonresponses as well. This adjustment is accomplished using a combination of probability calculations. When we build a logistic regression model to predict categorical outcomes with survey data, the sampling weights should be considered if the sampling design does not give each individual an equal chance of being selected in the sample. We rescale these weights to sum to an equivalent sample size because the variance is too small with the original weights. These new weights are called the adjusted weights. The old method is to apply quasi-likelihood maximization to make estimation with the adjusted weights. We develop a new method based on the correct likelihood for logistic regression to include the adjusted weights. In the new method, the adjusted weights are further used to adjust for both covariates and intercepts. We explore the differences and similarities between the quasi-likelihood and the correct likelihood methods. We use both binary logistic regression model and multinomial logistic regression model to estimate parameters and apply the methods to body mass index data from the Third National Health and Nutrition Examination Survey. The results show some similarities and differences between the old and new methods in parameter estimates, standard errors and statistical p-values.
|
9 |
Využití logistické regrese ve výzkumu trhu / The use of logistic regression in the market researchBrabcová, Hana January 2009 (has links)
The aim of this work is to decide the real usage of logistic regression in the market research tasks respecting the needs of final users of research results. The main argument for the final decision is the comparison of its output to the output of an alternative classification method used in practice -- a classification tree method. The topic is divided into three parts. The first part describes the theoretical framework and approaches linked to logistic regression (chapter 2 and 3). The second part analyses the experience with the usage of logistic regression in Czech market research companies (chapter 4) and the topic is closed by applying the method on real data and comparing the output to the classification tree output (chapter 5 and 6).
|
10 |
Application of factor analysis to the 2009 general household survey in South AfricaMonyai, Simon Malesela January 2015 (has links)
Thesis (M.Sc. (Statistics)) -- University of Limpopo, 2015 / Introduction: The high number of variables from the 2009 General Household Survey is prohibitive to do holistic analysis of data due to high correlations that exist among many variables, making it virtually impractical to apply traditional methods such as multinomial logistic regression. The purpose of this study to identify observed variables that can be explained by a few unobservable quantities called factors, using factor analysis. Methods: Factor analysis is used to describe covariance relationships among 162 variables of interest in the 2009 General Household Survey (GHS) and 2009 Quarterly Labour Force Survey of South Africa (QLFS). Data for the respondents aged 15 years and above was analysed by first applying factor analysis to the 162 variables to produce factor scores and develop models for five core areas: education, health, housing, labour force and social development. Multinomial logistic regression was then used to model educational levels and service satisfaction using identified factor sores. Results: The variability among the 162 variables of interest was described by only 29 factors identified using factor analysis, even though these factors are not measured directly. Multinomial logistic regression (MLR) analysis showed negative and significant impact of education factors (fees too high, violence and absence of parental care) on levels of educational attainment. “Historically advantaged” factor is the only factor significant and positively affects educational levels. Housing and social development factors were regressed against service satisfaction. Housing factors such as the home owners, age of a house and male household heads were found to be significant. Social development factors such as “no problem with health”, sufficient water, high income, household size and telephone access were found to be significant. Labour force factors such as employment, industrial business and occupation, employment history and long-term unemployment have positive and significant impact on levels of education. Conclusion: It can be concluded that factor analysis as a data reduction technique has managed to describe the variability among the 162 variables in terms of just 29 unobservable variables. Using MLR in subsequent analysis, this study has managed to identify factors positively or negatively associated with educational levels and service satisfaction. The study suggests that educational, housing, social development and labour force facilities should be improved and education should be used to improve life circumstances. Keywords: factor analysis, factors, multinomial logistic regression, logits, educational levels of attainment, service satisfaction, quality of service delivery. / DST-NRF, Centre of Excellence in Mathematical and Statistical Sciences (MaSS)
|
Page generated in 0.1215 seconds