Global ETD Search

Return to search

Privacy preservation for training datasets in database: application to decision tree learning

Privacy preservation is important for machine learning and datamining, but measures
designed to protect private information sometimes result in a trade off: reduced utility of the training samples. This thesis introduces a privacy preserving approach that can be applied to decision-tree learning, without concomitant loss of accuracy. It describes an approach to the preservation of privacy of collected data samples in cases when information of the sample database has been partially lost. This approach converts the original sample datasets into a group of unreal datasets, where an original sample cannot be reconstructed without the entire group of unreal datasets. This approach does not
perform well for sample datasets with low frequency, or when there is low variance in the distribution of all samples. However, this problem can be solved through a modified
implementation of the approach introduced later in this thesis, by using some extra storage.

http://hdl.handle.net/1828/1291

Identifer	oai:union.ndltd.org:uvic.ca/oai:dspace.library.uvic.ca:1828/1291
Date	15 December 2008
Creators	Fong, Pui Kuen
Contributors	Weber, Jens H.
Source Sets	University of Victoria
Language	English, English
Detected Language	English
Type	Thesis
Rights	Available to the World Wide Web

Page generated in 0.0018 seconds

Privacy preservation for training datasets in database: application to decision tree learning

Description

Links & Downloads

Tags

Additional Fields