This work contains a brief overview of technologies for representation and obtaining data on WWW and describes selected web data extraction tools. The work designs a new tool for obtaining pages generated by filling in web forms, which allows its user to define data on such web pages and which can extract those data and offer it in a XML format suitable for future machine processing.
Identifer | oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:236686 |
Date | January 2009 |
Creators | Puna, Petr |
Contributors | Kunc, Michael, Burget, Radek |
Publisher | Vysoké učení technické v Brně. Fakulta informačních technologií |
Source Sets | Czech ETDs |
Language | Czech |
Detected Language | English |
Type | info:eu-repo/semantics/masterThesis |
Rights | info:eu-repo/semantics/restrictedAccess |
Page generated in 0.0077 seconds