The diploma thesis deals with the problem of automated searching for reviews on web pages and also the saving of found reviews. In this work are described in detail possibilities of storing unstructured data and subsequent selection of the most suitable storage. The main part of the work deals with the analysis of HTML structure, so that it is possible to find the required information on the website. This work also deals with ways to determine the similarity of text strings in order to determine what product the review found belongs to. The Python programming language was used for implementation.
Identifer | oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:429000 |
Date | January 2019 |
Creators | Voráč, Tomáš |
Source Sets | Czech ETDs |
Language | Czech |
Detected Language | English |
Type | info:eu-repo/semantics/masterThesis |
Rights | info:eu-repo/semantics/restrictedAccess |
Page generated in 0.0179 seconds