Scalable high-utility pattern mining from data streams

Mai, Jiaxing

Scalable high-utility pattern mining from data streams

dc.contributor.author	Mai, Jiaxing
dc.contributor.examiningcommittee	Muthukumarana, Saman (Statistics)	en_US
dc.contributor.examiningcommittee	Wang, Shaowei (Computer Science)	en_US
dc.contributor.supervisor	Leung, Carson K.
dc.date.accessioned	2022-09-22T21:27:15Z
dc.date.available	2022-09-22T21:27:15Z
dc.date.copyright	2022-08-29
dc.date.issued	2022-08-29
dc.date.submitted	2022-08-30T03:51:56Z	en_US
dc.degree.discipline	Computer Science	en_US
dc.degree.level	Master of Science (M.Sc.)	en_US
dc.description.abstract	Traditional high-utility mining mainly focuses on improving the efficiency of discovering high utility patterns from static databases based on a simplified assumption that the unit utility for a given item is a constant. However, not much research effort has been put into mining dynamic profit from data stream yet. The emergence of big data has led to some performance challenges such that a proper big data management technique is required to discover useful knowledge from the dynamic data streams. Traditional static data mining algorithms cannot directly apply to dynamic data. Furthermore, as information in the data stream might not be uniformly distributed, it introduces extra challenges to process the data. To mine real-world data streams, it is logical to use big data stream processing frameworks. Leveraging these big data processing frameworks requires having scalable algorithms. Hence, for my MSc thesis, I design and develop a high utility data stream framework to speed up the execution time and be flexible to adapt to mining requirement after data are dynamically modified. Utilizing our proposed algorithm, the data stream mining performance is expected to be further enhanced against both synthetic and real-world datasets.	en_US
dc.description.note	February 2023	en_US
dc.identifier.uri	http://hdl.handle.net/1993/36919
dc.language.iso	eng	en_US
dc.rights	open access	en_US
dc.subject	Data mining	en_US
dc.subject	Pattern mining	en_US
dc.subject	Data streams	en_US
dc.subject	High utility mining	en_US
dc.title	Scalable high-utility pattern mining from data streams	en_US
dc.type	master thesis	en_US
local.subject.manitoba	no	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Mai_Jiaxing.pdf
Size:: 3.05 MB
Format:: Adobe Portable Document Format
Description:: Thesis

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.2 KB
Format:: Item-specific license agreed to upon submission
Description:

Download

Collections

FGS - Electronic Theses and Practica