Return to search

Velká data - extrakce klíčových informací pomocí metod matematické statistiky a strojového učení / Big data - extraction of key information combining methods of mathematical statistics and machine learning

This thesis is concerned with data analysis, especially with principal component analysis and its sparse modi cation (SPCA), which is NP-hard-to- solve. SPCA problem can be recast into the regression framework in which spar- sity is usually induced with ℓ1-penalty. In the thesis, we propose to use iteratively reweighted ℓ2-penalty instead of the aforementioned ℓ1-approach. We compare the resulting algorithm with several well-known approaches to SPCA using both simulation study and interesting practical example in which we analyze voting re- cords of the Parliament of the Czech Republic. We show experimentally that the proposed algorithm outperforms the other considered algorithms. We also prove convergence of both the proposed algorithm and the original regression-based approach to PCA. vi

Identiferoai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:357228
Date January 2017
CreatorsMasák, Tomáš
ContributorsAntoch, Jaromír, Maciak, Matúš
Source SetsCzech ETDs
LanguageCzech
Detected LanguageEnglish
Typeinfo:eu-repo/semantics/masterThesis
Rightsinfo:eu-repo/semantics/restrictedAccess

Page generated in 0.0022 seconds