This thesis is based on longitudinal data of the Swedish population provided byStatistics Sweden and is conducted on behalf of the Institute for Analytical Sociology.The focus is on investigating the effectiveness of a frequent sequence miningmethod called constrained Sequential PAttern Discovery using Equivalence classes(cSPADE). The method is applied to data on segregation within workplaces, specificallyreasons for Swedish employees moving to more segregated workplaces. Thethesis found that no unique pattern of age, gender, education, unemployment, income,workplace size or foreignness index explain why a Swedish employee movesto a more segregated workplace. Evaluating the algorithm, it was found that thenumber of observations need to be smaller or an alteration of the algorithm needsto be done to reduce the process time for this specific data set.
Identifer | oai:union.ndltd.org:UPSALLA1/oai:DiVA.org:liu-119395 |
Date | January 2015 |
Creators | Hietala, Isak |
Publisher | Linköpings universitet, Statistik, Linköpings universitet, Tekniska fakulteten |
Source Sets | DiVA Archive at Upsalla University |
Language | English |
Detected Language | English |
Type | Student thesis, info:eu-repo/semantics/bachelorThesis, text |
Format | application/pdf |
Rights | info:eu-repo/semantics/openAccess |
Page generated in 0.0019 seconds