Show simple item record

dc.contributor.supervisor Leung, Carson K. (Computer Science) en_US
dc.contributor.author Hayduk, Yaroslav
dc.date.accessioned 2012-04-04T16:52:58Z
dc.date.available 2012-04-04T16:52:58Z
dc.date.issued 2012-04-04
dc.identifier.uri http://hdl.handle.net/1993/5250
dc.description.abstract Frequent pattern mining from uncertain data allows data analysts to mine frequent patterns from probabilistic databases, within which each item is associated with an existential probability representing the likelihood of the presence of the item in the transaction. When compared with precise data, the solution space for mining uncertain data is often much larger due to the probabilistic nature of uncertain databases. Thus, uncertain data mining algorithms usually take substantially more time to execute. Recent studies show that the MapReduce programming model yields significant performance gains for data mining algorithms, which can be mapped to the map and reduce execution phases of MapReduce. An attractive feature of MapReduce is fault-tolerance, which permits detecting and restarting failed jobs on working machines. In this M.Sc. thesis, I explore the feasibility of applying MapReduce to frequent pattern mining of uncertain data. Specifically, I propose two algorithms for mining frequent patterns from uncertain data with MapReduce. en_US
dc.subject Data mining en_US
dc.subject Databases en_US
dc.title Mining frequent patterns from uncertain data with MapReduce en_US
dc.degree.discipline Computer Science en_US
dc.contributor.examiningcommittee Fung, Wai-Keung (Electrical and Computer Engineering) Graham, Peter C.J. (Computer Science) en_US
dc.degree.level Master of Science (M.Sc.) en_US
dc.description.note May 2012 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

View Statistics