Spelling suggestions: "subject:"guía"" "subject:"guia""
1 |
Aplikace systému LISp-Miner na rozsáhlá reálná data / Using system LISp-Miner for large real dataHrnčíř, Jan January 2017 (has links)
This dissertation thesis describes an advanced method of knowledge discovery in databases (KDD), implemented in system LISp-Miner. The goal is to show the possibilities of coordinated use of analytical tools and complex procedures GUHA in this system. The thesis uses methodology CRISP-DM, which is firstly described and work is proceeded using this methodology in the following sections. The author firstly introduces readers domain area and then the data itself, which are processed to the analysis needs. Analytical questions that are answered at, are drawn from the literature, which is focused on domain area. The work should be used as a guide to LISp-Miner users, using analytical tools and procedures GUHA is therefore described the easiest way to understand.
|
2 |
Clickstream AnalysisKliegr, Tomáš January 2007 (has links)
Thesis introduces current research trends in clickstream analysis and proposes a new heuristic that could be used for dimensionality reduction of semantically enriched data in Web Usage Mining (WUM). Click-fraud and conversion fraud are identified as key prospective application areas for WUM. Thesis documents a conversion fraud vulnerability of Google Analytics and proposes defense - a new clickstream acquisition software, which collects data in sufficient granularity and structure to allow for data mining approaches to fraud detection. Three variants of K-means clustering algorithms and three association rule data mining systems are evaluated and compared on real-world web usage data.
|
3 |
Fuzzy GUHA / Fuzzy GUHARalbovský, Martin January 2006 (has links)
The GUHA method is one of the oldest methods of exploratory data analysis, which is regarded as part of the data mining or knowledge discovery in databases (KDD) scienti_c area. Unlike many other methods of data mining, the GUHA method has firm theoretical foundations in logic and statistics. In scope of the method, finding interesting knowledge corresponds to finding special formulas in satisfactory rich logical calculus, which is called observational calculus. The main topic of the thesis is application of the "fuzzy paradigm" to the GUHA method By the term "fuzzy paradigm" we mean approaches that use many-valued membership degrees or truth values, namely fuzzy set theory and fuzzy logic. The thesis does not aim to cover all the aspects of this application, it emphasises mainly on: - Association rules as the most prevalent type of formulas mined by the GUHA method - Usage of fuzzy data - Logical aspects of fuzzy association rules mining - Comparison of the GUHA theory to the mainstream fuzzy association rules - Implementation of the theory using the bit string approach The thesis throughoutly elaborates the theory of fuzzy association rules, both using the theoretical apparatus of fuzzy set theory and fuzzy logic. Fuzzy set theory is used mainly to compare the GUHA method to existing mainstream approaches to formalize fuzzy association rules, which were studied in detail. Fuzzy logic is used to define novel class of logical calculi called logical calculi of fuzzy association rules (LCFAR) for logical representation of fuzzy association rules. The problem of existence of deduction rules in LCFAR is dealt in depth. Suitable part of the proposed theory is implemented in the Ferda system using the bit string approach. In the approach, characteristics of examined objects are represented as strings of bits, which in the crisp case enables efficient computation. In order to maintain this feature also in the fuzzy case, a profound low level testing of data structures and algoritms for fuzzy bit strings have been carried out as a part of the thesis.
|
4 |
Vytěžování databáze Poradny pro poruchy metabolismu / Data mining of the database of Consulting centre for metabolism disordersSenft, Martin January 2014 (has links)
This thesis applies the data mining method of decision rules on data from Consulting centre for Metabolism disorders from University hospital Pilsen. As a tool is used the system LISp-Miner, developed at University of Economics, Prague. Decision rules found are evaluated by a specialist. The main parts of this thesis are followings: an overview on main data mining methods and results evalutation methods, description of the data mining method application on data and description and evaluation of results.
|
5 |
Využití data miningových metod při zpracování dat z demografických šetření / Using data mining methods for demographic survey data processingFišer, David January 2015 (has links)
USING DATA MINING METHODS FOR DEMOGRAPHIC SURVEY DATA PROCESSING Abstract The goal of the thesis was to describe and demonstrate principles of the process of knowledge discovery in databases - data mining (DM). In the theoretical part of the thesis, selected methods for data mining processes are described as well as basic principles of those DM techniques. In the second part of the thesis a DM task is realized in accordance to CRISP-DM methodology. Practical part of the thesis is divided into two parts and data from the survey of American Community Survey served as the basic data for the practical part of the thesis. First part contains a classification task which goal was to determinate whether the selected DM techniques can be used to solve missing data in the surveys. The success rate of classifications and following data value prediction in selected attributes was in 55-80 % range. The second part of the practical part of the thesis was then focused of determining knowledge of interest using associating rules and the GUHA method. Keywords: data mining, knowledge discovery in databases, statistic surveys, missing values, classification, association rules, GUHA method, ACS
|
6 |
Doménové znalosti, analytické otázky, systém LISp-Miner a data ADAMEK / Knowledge base, analytical questions, LISp-Mner system and ADAMEK dataKubín, Richard January 2009 (has links)
The steps associated with the analytical question solving in terms of LISp-Miner system in ADAMEK medical data are the theme of this thesis. The operating sequence of using 4ft-Miner and SD4ft-Miner procedures in ADAMEK data together with the possibility of further use of formalized background knowledge and preparing routing for automatization of the downrighted steps are the objectiv of this thesis. The summary of the basic concepts and axioms of association rules and GUHA method is the content of the theoretical part of the thesis. Operativ part starts from CRISP-DM methodology. The operating sequence enabling searching for interesting association rules in different data, that is applied on STULONG medical data afterwards in order to get instigations for it's revision, is the produce of this thesis. Used data that come from EuroMISE are concern with cardiological patients.
|
7 |
Association rule mining as a support for OLAP / Dolování asociačních pravidel jako podpora pro OLAPChudán, David January 2010 (has links)
The aim of this work is to identify the possibilities of the complementary usage of two analytical methods of data analysis, OLAP analysis and data mining represented by GUHA association rule mining. The usage of these two methods in the context of proposed scenarios on one dataset presumes a synergistic effect, surpassing the knowledge acquired by these two methods independently. This is the main contribution of the work. Another contribution is the original use of GUHA association rules where the mining is performed on aggregated data. In their abilities, GUHA association rules outperform classic association rules referred to the literature. The experiments on real data demonstrate the finding of unusual trends in data that would be very difficult to acquire using standard methods of OLAP analysis, the time consuming manual browsing of an OLAP cube. On the other hand, the actual use of association rules loses a general overview of data. It is possible to declare that these two methods complement each other very well. The part of the solution is also usage of LMCL scripting language that automates selected parts of the data mining process. The proposed recommender system would shield the user from association rules, thereby enabling common analysts ignorant of the association rules to use their possibilities. The thesis combines quantitative and qualitative research. Quantitative research is represented by experiments on a real dataset, proposal of a recommender system and implementation of the selected parts of the association rules mining process by LISp-Miner Control Language. Qualitative research is represented by structured interviews with selected experts from the fields of data mining and business intelligence who confirm the meaningfulness of the proposed methods.
|
8 |
Vytváření webové analytické zprávy z metabáze systému LISp-Miner / Creation of web-based analytics report from LISp-Miner metabase analyticsNepomucký, Pavel January 2017 (has links)
This diploma thesis deals with ways how to represent results of LISp-Miner application on the world wide web. This thesis has three main sections. The first section is dedicated to description of data analysis process including description of newly established study of infography and its application in publishing results found du-ring the DZD process. The second part describes exporting of LISp-Miner as well as exporting formats of each module and its combining with other technologies, afterwards follows summarization of all kind of exports of lispminer and its im-provements or create a whole new templates. Third part is dedicated to develop-ment of a web-based application as a tool of repsentation results generated by lispminer. The very last part is contained of future improvements of this application.
|
9 |
Empirické porovnání systémů dobývání znalostí z databází / Empirical comparison of systems for knowledge discovery in databasesBenešová, Kristýna January 2008 (has links)
S rostoucím množstvím shromažďovaných a ukládaných dat roste také potřeba a zájem majitelů těchto dat o využití jejich potenciálu k dalšímu rozhodování. Proto se vyvíjí nové přístupy a způsoby vycházející z informatiky, statistiky a oblasti strojového učení, které se této potřebě snaží vyhovět. Cílem této diplomové práce je uvést proces dobývání znalostí dat z databází na medicínských datech Tinnitus a představit systémy LISp-Miner a Weka, které daný proces podporují. Obsahem teoretické části diplomové práce je shrnutí základních charakteristik a přístupů procesu dobývání znalostí. Praktická část diplomové práce je věnována realizaci celého procesu v jednotlivých krocích. V samotném kroku modelování jsou využity již zmíněné systémy akademické LISp-Miner a Weka. Poslední část praktické části práce patří prezentaci dosažených výsledků a vlastnímu zhodnocení systémů.
|
10 |
Aplikace metod DZD na otevřená data / Use of data mining techniques for open dataProkůpek, Miroslav January 2015 (has links)
This diploma thesis examines applications of datamining methods to open data. It is realized by solving analytical questions using the LISp-Miner system. Analytical questions are examined in data from The Czech Trade Inspection Authority from the perspective of the data owner. Procedure used to solve analytical questions is 4ft-Miner. There are presented and resolved four analytical questions, which are the results of the work. Work includes a detailed description of the transformation of the relational database into a format suitable for data mining. A detailed description of the data is also included. The theoretical part deals with the GUHA method and CRISP-DM methodology.
|
Page generated in 0.0492 seconds