Global ETD Search

Return to search

Metody shlukování textových dat / Textual Data Clustering Methods

Clustering of text data is one of tasks of text mining. It divides documents into the different categories that are based on their similarities. These categories help to easily search in the documents. This thesis describes the current methods that are used for the text document clustering. From these methods we chose Simultaneous keyword identification and clustering of text documents (SKWIC). It should achieve better results than the standard clustering algorithms such as k-means. There is designed and implemented an application for this algorithm. In the end, we compare SKWIC with a k-means algorithm.

http://www.nusl.cz/ntk/nusl-237060

Identifer	oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:237060
Date	January 2011
Creators	Miloš, Roman
Contributors	Burgetová, Ivana, Bartík, Vladimír
Publisher	Vysoké učení technické v Brně. Fakulta informačních technologií
Source Sets	Czech ETDs
Language	Czech
Detected Language	English
Type	info:eu-repo/semantics/masterThesis
Rights	info:eu-repo/semantics/restrictedAccess

Page generated in 0.0016 seconds

Metody shlukování textových dat / Textual Data Clustering Methods

Description

Links & Downloads

Tags

Additional Fields