Tightening upper bounds to the expected support for uncertain frequent pattern mining

Loading...
Thumbnail Image
Date
2014
Authors
Leung, Carson K.
MacKinnon, Richard Kyle
Tanbeer, Syed K.
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier
Abstract
Due to advances in technology, high volumes of valuable data can be collected and transmitted at high velocity in various scientific and engineering applications. Consequently, efficient data mining algorithms are in demand for analyzing these data. For instance, frequent pattern mining discovers implicit, previously unknown, and potentially useful knowledge about relationships among frequently co-occurring items, objects and/or events. While many frequent pattern mining algorithms handle precise data, there are situations in which data are uncertain. In recent years, tree-based algorithms for mining uncertain data have been developed. However, tree structures corresponding to these algorithms can be large. Other tree structures for handling uncertain data may achieve compactness at the expense of loose upper bounds on expected supports. In this paper, we propose (i) a compact tree structure for capturing uncertain data, (ii) a technique for using our tree structure to tighten upper bounds to expected support, and (iii) an algorithm for mining frequent patterns based on our tightened bounds. Experimental results show the benefits of our tightened upper bounds to expected supports in uncertain frequent pattern mining.
Description
C.K. Leung, R.K. MacKinnon, S.K. Tanbeer. Tightening upper bounds to the expected support for uncertain frequent pattern mining. Procedia Computer Science, 35 (2014), pp. 328-337. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
Keywords
data mining, data structure, expected support, frequent pattern mining, knowledge discovery, tree-based mining, uncertain data
Citation
C.K. Leung, R.K. MacKinnon, S.K. Tanbeer. Tightening upper bounds to the expected support for uncertain frequent pattern mining. Procedia Computer Science, 35 (2014), pp. 328-337.