Global ETD Search

Return to search

Optimizing Sample Design for Approximate Query Processing

The rapid increase of data volumes makes sampling a crucial component of modern data management systems. Although there is a large body of work on database sampling, the problem of automatically determine the optimal sample for a given query remained (almost) unaddressed. To tackle this problem the authors propose a sample advisor based on a novel cost model. Primarily designed for advising samples of a few queries specified by an expert, the authors additionally propose two extensions of the sample advisor. The first extension enhances the applicability by utilizing recorded workload information and taking memory bounds into account. The second extension increases the effectiveness by merging samples in case of overlapping pieces of sample advice. For both extensions, the authors present exact and heuristic solutions. Within their evaluation, the authors analyze the properties of the cost model and demonstrate the effectiveness and the efficiency of the heuristic solutions with a variety of experiments.

info:eu-repo/classification/ddc/650

ddc:650

Identifer	oai:union.ndltd.org:DRESDEN/oai:qucosa:de:qucosa:72930
Date	30 November 2020
Creators	Rösch, Philipp, Lehner, Wolfgang
Publisher	IGI Global
Source Sets	Hochschulschriftenserver (HSSS) der SLUB Dresden
Language	English
Detected Language	English
Type	info:eu-repo/semantics/publishedVersion, doc-type:article, info:eu-repo/semantics/article, doc-type:Text
Rights	info:eu-repo/semantics/openAccess
Relation	2155-6407, 10.4018/ijkbo.2013100101

Page generated in 0.0023 seconds

Optimizing Sample Design for Approximate Query Processing

Description

Links & Downloads

Tags

Additional Fields