Besides the social survey data, texts have been an important source of sociological data since the beginning of the development of sociological methodology. Text analysis methods contain two main branches of development: Bernard Berelson's content analysis and Hans-Georg Gadamer's hermeneutic analysis. Both these methodological branches have been influenced by the development of information technologies in the last twenty years. The thesis presented here deals with one of the methods of computer text analysis (CATA), which stands on the border between these two methodological streams, a method of analyzing words' collocations in texts. The thesis presents the method in the context of other methods of text analysis, and mentions sources of inspiration for further development of these methods - corpus linguistics and text mining. The second part discusses the different steps of words' collocation analysis: building a text corpus, dictionary compilation, calculation of data matrix and visualisation of words' distances using multidimensional scaling (MDS). The method is also applied to a specific data, two text corpora compiled from transcripts of biographical interviews with actors of Czechoslovak normalization - with dissidents and Communist functionaries. Quality of the models is assessed, depending...
Identifer | oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:307460 |
Date | January 2012 |
Creators | Čepelák, Václav |
Contributors | Hájek, Martin, Soukup, Petr |
Source Sets | Czech ETDs |
Language | Czech |
Detected Language | English |
Type | info:eu-repo/semantics/masterThesis |
Rights | info:eu-repo/semantics/restrictedAccess |
Page generated in 0.0023 seconds