Scalable high-utility pattern mining from data streams
dc.contributor.author | Mai, Jiaxing | |
dc.contributor.examiningcommittee | Muthukumarana, Saman (Statistics) | en_US |
dc.contributor.examiningcommittee | Wang, Shaowei (Computer Science) | en_US |
dc.contributor.supervisor | Leung, Carson K. | |
dc.date.accessioned | 2022-09-22T21:27:15Z | |
dc.date.available | 2022-09-22T21:27:15Z | |
dc.date.copyright | 2022-08-29 | |
dc.date.issued | 2022-08-29 | |
dc.date.submitted | 2022-08-30T03:51:56Z | en_US |
dc.degree.discipline | Computer Science | en_US |
dc.degree.level | Master of Science (M.Sc.) | en_US |
dc.description.abstract | Traditional high-utility mining mainly focuses on improving the efficiency of discovering high utility patterns from static databases based on a simplified assumption that the unit utility for a given item is a constant. However, not much research effort has been put into mining dynamic profit from data stream yet. The emergence of big data has led to some performance challenges such that a proper big data management technique is required to discover useful knowledge from the dynamic data streams. Traditional static data mining algorithms cannot directly apply to dynamic data. Furthermore, as information in the data stream might not be uniformly distributed, it introduces extra challenges to process the data. To mine real-world data streams, it is logical to use big data stream processing frameworks. Leveraging these big data processing frameworks requires having scalable algorithms. Hence, for my MSc thesis, I design and develop a high utility data stream framework to speed up the execution time and be flexible to adapt to mining requirement after data are dynamically modified. Utilizing our proposed algorithm, the data stream mining performance is expected to be further enhanced against both synthetic and real-world datasets. | en_US |
dc.description.note | February 2023 | en_US |
dc.identifier.uri | http://hdl.handle.net/1993/36919 | |
dc.language.iso | eng | en_US |
dc.rights | open access | en_US |
dc.subject | Data mining | en_US |
dc.subject | Pattern mining | en_US |
dc.subject | Data streams | en_US |
dc.subject | High utility mining | en_US |
dc.title | Scalable high-utility pattern mining from data streams | en_US |
dc.type | master thesis | en_US |
local.subject.manitoba | no | en_US |