Global ETD Search

Return to search

Statistické metody ve stylometrii / Statistical methods in stylometry

The aim of this thesis is to provide an overview of some of the commonly used methods in the area of authorship attribution (stylometry). The text begins with a recap of history from the end of the 19th century to present time and the required terminology from the field of text mining is presented and explained. What follows is a list of selected methods from the field of multidimensional statistics (principal components analysis, cluster analysis) and machine learning (Support Vector Machines, Naive Bayes) and their application as pertains to stylometrical problems, including several methods created specifically for use in this field (bootstrap consensus tree, contrast analysis). Finally these same methods are applied to a practical problem of authorship verification based on a corpus bulit from the works of four internet writers.

http://www.nusl.cz/ntk/nusl-359246

Identifer	oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:359246
Date	January 2017
Creators	Dupal, Pavel
Contributors	Kaspříková, Nikola, Šulc, Zdeněk
Publisher	Vysoká škola ekonomická v Praze
Source Sets	Czech ETDs
Language	Czech
Detected Language	English
Type	info:eu-repo/semantics/masterThesis
Rights	info:eu-repo/semantics/restrictedAccess

Page generated in 0.0022 seconds

Statistické metody ve stylometrii / Statistical methods in stylometry

Description

Links & Downloads

Tags

Additional Fields