Scalable vertical mining for big data analytics

dc.contributor.authorZhang, Hao
dc.contributor.examiningcommitteeWang, Yang (Computer Science) Ho, Carl N.M. (Electrical and Computer Engineering)en_US
dc.contributor.supervisorLeung, Carson K. (Computer Science)en_US
dc.date.accessioned2016-12-12T21:44:58Z
dc.date.available2016-12-12T21:44:58Z
dc.date.issued2016
dc.degree.disciplineComputer Scienceen_US
dc.degree.levelMaster of Science (M.Sc.)en_US
dc.description.abstractThe increasing size of modern applications produces huge amounts of data, which in turn leads to a new challenge to data mining or big data analytics. Researchers often use the five V’s (Volume, Velocity, Variety, Veracity, and Value) to describe the features of big data. The interest of discovering patterns from a large collection of data has risen in both academic and industrial areas. Examples of rich sources of big data are on-line social networks like Facebook or Twitter. Embedded in these user online social activities are useful information and knowledge. Recently, although some algorithms have been proposed to mine a large scale of data, they mostly focused on the volume aspect. Unfortunately, not that many approaches have been focused on data variety which is also a critical criterion for mining process. The composition of a dataset could either be sparse or dense, or not evenly uniformly distributed. For example, a list of common friends in an on-line social network can be dense if two people share a lot of common friends; it could be sparse otherwise. For my MSc thesis, I design and implement a big data analytic algorithm that tackles both volume and variety aspects of big data.en_US
dc.description.noteFebruary 2017en_US
dc.identifier.urihttp://hdl.handle.net/1993/31951
dc.language.isoengen_US
dc.rightsopen accessen_US
dc.subjectData miningen_US
dc.subjectFrequent pattern miningen_US
dc.subjectBig dataen_US
dc.subjectData analyticsen_US
dc.titleScalable vertical mining for big data analyticsen_US
dc.typemaster thesisen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
zhang_hao.pdf
Size:
644.29 KB
Format:
Adobe Portable Document Format
Description:
main thesis
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.2 KB
Format:
Item-specific license agreed to upon submission
Description: