Mining frequent patterns from uncertain data with MapReduce

dc.contributor.authorHayduk, Yaroslav
dc.contributor.examiningcommitteeFung, Wai-Keung (Electrical and Computer Engineering) Graham, Peter C.J. (Computer Science)en_US
dc.contributor.supervisorLeung, Carson K. (Computer Science)en_US
dc.date.accessioned2012-04-04T16:52:58Z
dc.date.available2012-04-04T16:52:58Z
dc.date.issued2012-04-04
dc.degree.disciplineComputer Scienceen_US
dc.degree.levelMaster of Science (M.Sc.)en_US
dc.description.abstractFrequent pattern mining from uncertain data allows data analysts to mine frequent patterns from probabilistic databases, within which each item is associated with an existential probability representing the likelihood of the presence of the item in the transaction. When compared with precise data, the solution space for mining uncertain data is often much larger due to the probabilistic nature of uncertain databases. Thus, uncertain data mining algorithms usually take substantially more time to execute. Recent studies show that the MapReduce programming model yields significant performance gains for data mining algorithms, which can be mapped to the map and reduce execution phases of MapReduce. An attractive feature of MapReduce is fault-tolerance, which permits detecting and restarting failed jobs on working machines. In this M.Sc. thesis, I explore the feasibility of applying MapReduce to frequent pattern mining of uncertain data. Specifically, I propose two algorithms for mining frequent patterns from uncertain data with MapReduce.en_US
dc.description.noteMay 2012en_US
dc.identifier.urihttp://hdl.handle.net/1993/5250
dc.language.isoengen_US
dc.rightsopen accessen_US
dc.subjectData miningen_US
dc.subjectDatabasesen_US
dc.titleMining frequent patterns from uncertain data with MapReduceen_US
dc.typemaster thesisen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
hayduk_yaroslav.pdf
Size:
2.22 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.25 KB
Format:
Item-specific license agreed to upon submission
Description: