Mining frequent itemsets from uncertain data: extensions to constrained mining and stream mining

dc.contributor.authorHao, Boyu
dc.contributor.examiningcommitteeScuse, David H. (Computer Science) Wang, Xikui (Statistics)en
dc.contributor.supervisorLeung, Carson K. (Computer Science)en
dc.date.accessioned2010-07-19T15:26:03Z
dc.date.available2010-07-19T15:26:03Z
dc.date.issued2010-07-19T15:26:03Z
dc.degree.disciplineComputer Scienceen_US
dc.degree.levelMaster of Science (M.Sc.)en_US
dc.description.abstractMost studies on frequent itemset mining focus on mining precise data. However, there are situations in which the data are uncertain. This leads to the mining of uncertain data. There are also situations in which users are only interested in frequent itemsets that satisfy user-specified aggregate constraints. This leads to constrained mining of uncertain data. Moreover, floods of uncertain data can be produced in many other situations. This leads to stream mining of uncertain data. In this M.Sc. thesis, we propose algorithms to deal with all these situations. We first design a tree-based mining algorithm to find all frequent itemsets from databases of uncertain data. We then extend it to mine databases of uncertain data for only those frequent itemsets that satisfy user-specified aggregate constraints and to mine streams of uncertain data for all frequent itemsets. Experimental results show the effectiveness of all these algorithms.en
dc.description.noteOctober 2010en
dc.format.extent8243386 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.citationLeung, C.K.-S., Carmichael, C.L., Hao, B. (2007). Efficient mining of frequent patterns from uncertain data. In Proc. IEEE ICDM Workshops 2007: 489-494.en
dc.identifier.citationLeung, C.K.-S., Hao, B. (2009). Mining of frequent itemsets from streams of uncertain data. In Proc. IEEE ICDE 2009: 1663-1670.en
dc.identifier.citationLeung, C.K.-S., Hao, B., Jiang, F. (2010). Constrained frequent itemset mining from uncertain data streams. In Proc. IEEE ICDE Workshops 2010: 120-127.en
dc.identifier.citationLeung, C.K.-S., Hao, B., Brajczuk, D.A. (2010). Mining uncertain data for frequent itemsets that satisfy aggregate constraints. In Proc. ACM SAC 2010: 1034-1038.en
dc.identifier.urihttp://hdl.handle.net/1993/4034
dc.language.isoengen_US
dc.rightsopen accessen_US
dc.subjectData Miningen
dc.subjectDatabasesen
dc.titleMining frequent itemsets from uncertain data: extensions to constrained mining and stream miningen
dc.typemaster thesisen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Hao_B_MSc-Mining_frequent_itemsets.pdf
Size:
7.87 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.33 KB
Format:
Item-specific license agreed to upon submission
Description: