The aim of this diploma thesis is a detail description of current possibilities and ways of storing open data. It focuses on tools and database systems used for storing linked open data as well as on the selection of such systems for subsequent analysis and comparison. The practical part of the thesis then focuses on the comparison of selected systems based on a selected use case. This thesis introduces the fundamental terms and concepts concerning linked open data. Besides that, various approaches and formats for storing linked open data (namely file ori-ented approaches and database approaches) are analyzed. . The thesis also focuses on the RDF format and database systems. Ten triplestore database solutions (solutions for storing data in the RDF format) are introduced and described briefly. Out of these, three are cho-sen for a detailed analysis by which they are compared with one another and with a rela-tional database system. The core of the detail analysis lies in performance benchmarks. Ex-isting performance oriented benchmarks of triplestore systems are described and analyzed. In addition to that, the thesis introduces a newly developed benchmark as a collection of database queries. The benchmark is then used for the performance testing. The following systems have been tested: Apache Jena TDB/Fuseki, OpenLink Virtuoso, Oracle Spatial and Graph a Microsoft SQL Server. The main contribution of this thesis consists in a comprehensive presentation of current possibilities of storing linked open data.
Identifer | oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:262273 |
Date | January 2015 |
Creators | Hanuš, Jiří |
Contributors | Chlapek, Dušan, Kučera, Jan |
Publisher | Vysoká škola ekonomická v Praze |
Source Sets | Czech ETDs |
Language | Czech |
Detected Language | English |
Type | info:eu-repo/semantics/masterThesis |
Rights | info:eu-repo/semantics/restrictedAccess |
Page generated in 0.0023 seconds