Scalable high-utility pattern mining from data streams

dc.contributor.authorMai, Jiaxing
dc.contributor.examiningcommitteeMuthukumarana, Saman (Statistics)en_US
dc.contributor.examiningcommitteeWang, Shaowei (Computer Science)en_US
dc.contributor.supervisorLeung, Carson K.
dc.date.accessioned2022-09-22T21:27:15Z
dc.date.available2022-09-22T21:27:15Z
dc.date.copyright2022-08-29
dc.date.issued2022-08-29
dc.date.submitted2022-08-30T03:51:56Zen_US
dc.degree.disciplineComputer Scienceen_US
dc.degree.levelMaster of Science (M.Sc.)en_US
dc.description.abstractTraditional high-utility mining mainly focuses on improving the efficiency of discovering high utility patterns from static databases based on a simplified assumption that the unit utility for a given item is a constant. However, not much research effort has been put into mining dynamic profit from data stream yet. The emergence of big data has led to some performance challenges such that a proper big data management technique is required to discover useful knowledge from the dynamic data streams. Traditional static data mining algorithms cannot directly apply to dynamic data. Furthermore, as information in the data stream might not be uniformly distributed, it introduces extra challenges to process the data. To mine real-world data streams, it is logical to use big data stream processing frameworks. Leveraging these big data processing frameworks requires having scalable algorithms. Hence, for my MSc thesis, I design and develop a high utility data stream framework to speed up the execution time and be flexible to adapt to mining requirement after data are dynamically modified. Utilizing our proposed algorithm, the data stream mining performance is expected to be further enhanced against both synthetic and real-world datasets.en_US
dc.description.noteFebruary 2023en_US
dc.identifier.urihttp://hdl.handle.net/1993/36919
dc.language.isoengen_US
dc.rightsopen accessen_US
dc.subjectData miningen_US
dc.subjectPattern miningen_US
dc.subjectData streamsen_US
dc.subjectHigh utility miningen_US
dc.titleScalable high-utility pattern mining from data streamsen_US
dc.typemaster thesisen_US
local.subject.manitobanoen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Mai_Jiaxing.pdf
Size:
3.05 MB
Format:
Adobe Portable Document Format
Description:
Thesis
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.2 KB
Format:
Item-specific license agreed to upon submission
Description: