This thesis is focused on cluster analysis in the field of text mining and its application to real data. The aim of the thesis is to find suitable categories (clusters) in the transcribed calls recorded in the contact center of Česká pojišťovna a.s. by transferring these textual documents into the vector space using basic text mining methods and the implemented clustering algorithms. From the formal point of view, the thesis contains a description of preprocessing and representation of textual data, a description of several common clustering methods, cluster validation, and the application itself.
Identifer | oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:382412 |
Date | January 2018 |
Creators | Zubková, Kateřina |
Contributors | Karpíšek, Zdeněk, Žák, Libor |
Publisher | Vysoké učení technické v Brně. Fakulta strojního inženýrství |
Source Sets | Czech ETDs |
Language | Czech |
Detected Language | English |
Type | info:eu-repo/semantics/masterThesis |
Rights | info:eu-repo/semantics/restrictedAccess |
Page generated in 0.002 seconds