Return to search

Mining frequent itemsets from uncertain data: extensions to constrained mining and stream mining

Most studies on frequent itemset mining focus on mining precise data. However, there are situations in which the data are uncertain. This leads to the mining of uncertain data. There are also situations in which users are only interested in frequent itemsets that satisfy user-specified aggregate constraints. This leads to constrained mining of uncertain data. Moreover, floods of uncertain data can be produced in many other situations. This leads to stream mining of uncertain data. In this M.Sc. thesis, we propose algorithms to deal with all these situations. We first design a tree-based mining algorithm to find all frequent itemsets from databases of uncertain data. We then extend it to mine databases of uncertain data for only those frequent itemsets that satisfy user-specified aggregate constraints and to mine streams of uncertain data for all frequent itemsets. Experimental results show the effectiveness of all these algorithms.

Identiferoai:union.ndltd.org:LACETR/oai:collectionscanada.gc.ca:MWU.1993/4034
Date19 July 2010
CreatorsHao, Boyu
ContributorsLeung, Carson K. (Computer Science), Scuse, David H. (Computer Science) Wang, Xikui (Statistics)
Source SetsLibrary and Archives Canada ETDs Repository / Centre d'archives des thèses électroniques de Bibliothèque et Archives Canada
Languageen_US
Detected LanguageEnglish

Page generated in 0.0021 seconds