Global ETD Search

Return to search

Reprezentace textu a její vliv na kategorizaci / Representation of Text and Its Influence on Categorization

The thesis deals with machine processing of textual data. In the theoretical part, issues related to natural language processing are described and different ways of pre-processing and representation of text are also introduced. The thesis also focuses on the usage of N-grams as features for document representation and describes some algorithms used for their extraction. The next part includes an outline of classification methods used. In the practical part, an application for pre-processing and creation of different textual data representations is suggested and implemented. Within the experiments made, the influence of these representations on accuracy of classification algorithms is analysed.

http://www.nusl.cz/ntk/nusl-237263

Identifer	oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:237263
Date	January 2010
Creators	Šabatka, Ondřej
Contributors	Chmelař, Petr, Bartík, Vladimír
Publisher	Vysoké učení technické v Brně. Fakulta informačních technologií
Source Sets	Czech ETDs
Language	Czech
Detected Language	English
Type	info:eu-repo/semantics/masterThesis
Rights	info:eu-repo/semantics/restrictedAccess

Page generated in 0.0025 seconds

Reprezentace textu a její vliv na kategorizaci / Representation of Text and Its Influence on Categorization

Description

Links & Downloads

Tags

Additional Fields