A data analytic algorithm for managing, querying, and processing uncertain big data in cloud environments

Loading...
Thumbnail Image
Date
2015-12
Authors
Jiang, Fan
Leung, Carson K.
Journal Title
Journal ISSN
Volume Title
Publisher
MDPI
Abstract
Big data are everywhere as high volumes of varieties of valuable precise and uncertain data can be easily collected or generated at high velocity in various real-life applications. Embedded in these big data are rich sets of useful information and knowledge. To mine these big data and to discover useful information and knowledge, we present a data analytic algorithm in this article. Our algorithm manages, queries, and processes uncertain big data in cloud environments. More specifically, it manages transactions of uncertain big data, allows users to query these big data by specifying constraints expressing their interests, and processes the user-specified constraints to discover useful information and knowledge from the uncertain big data. As each item in every transaction in these uncertain big data is associated with an existential probability value expressing the likelihood of that item to be present in a particular transaction, computation could be intensive. Our algorithm uses the MapReduce model on a cloud environment for effective data analytics on these uncertain big data. Experimental results show the effectiveness of our data analytic algorithm for managing, querying, and processing uncertain big data in cloud environments.
Description
Jiang, F.; Leung, C.K. A data analytic algorithm for managing, querying, and processing uncertain big data in cloud environments. Algorithms 2015, 8, 1175-1194. This article belongs to the special issue on algorithms for managing, querying and processing big data in cloud environments. This is an open access article distributed under the Creative Commons Attribution License (CC BY) (http://creativecommons.org/licenses/by/4.0/).
Keywords
big data, cloud computing, constraints, data analytics, MapReduce, uncertain data
Citation
Jiang, F.; Leung, C.K. A data analytic algorithm for managing, querying, and processing uncertain big data in cloud environments. Algorithms 2015, 8, 1175-1194.