Global ETD Search

Return to search

Dolování z dat v jazyce Python / Data Mining with Python

The main goal of this thesis was to get acquainted with the phases of data mining, with the support of the programming languages Python and R in the field of data mining and demonstration of their use in two case studies. The comparison of these languages in the field of data mining is also included. The data preprocessing phase and the mining algorithms for classification, prediction and clustering are described here. There are illustrated the most significant libraries for Python and R. In the first case study, work with time series was demonstrated using the ARIMA model and Neural Networks with precision verification using a Mean Square Error. In the second case study, the results of football matches are classificated using the K - Nearest Neighbors, Bayes Classifier, Random Forest and Logical Regression. The precision of the classification is displayed using Accuracy Score and Confusion Matrix. The work is concluded with the evaluation of the achived results and suggestions for the future improvement of the individual models.

http://www.nusl.cz/ntk/nusl-363895

Identifer	oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:363895
Date	January 2017
Creators	Šenovský, Jakub
Contributors	Bartík, Vladimír, Zendulka, Jaroslav
Publisher	Vysoké učení technické v Brně. Fakulta informačních technologií
Source Sets	Czech ETDs
Language	Czech
Detected Language	English
Type	info:eu-repo/semantics/masterThesis
Rights	info:eu-repo/semantics/restrictedAccess

Page generated in 0.0015 seconds

Dolování z dat v jazyce Python / Data Mining with Python

Description

Links & Downloads

Tags

Additional Fields