Predictive analytics on open big data for supporting smart transportation services

Loading...
Thumbnail Image
Date
2020
Authors
Balbin, Paul Patrick F.
Barker, Jackson C.R.
Leung, Carson K.
Tran, Marvin
Wall, Riley P.
Cuzzocrea, Alfredo
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier
Abstract
In the current era of big data, huge quantities of valuable data, which may be of different levels of veracity, are being generated at a rapid rate. Embedded into these big data are implicit, previously unknown and potentially useful information and valuable knowledge that can be discovered by data science solutions, which apply techniques like data mining. There has been a trend that more and more collections of these big data have been made openly available in science, government and non-profit organizations so that people could collaboratively study and analysis these open big data. In this article, we focus on open big data for public transit because public transit (e.g., bus) as a means of transportation is a vital part of many people's lives. As time is a precious resource, bus delays could negatively affect commuters' plans. Unfortunately, they are inevitable. Hence, many existing works focused on predicting bus delays. However, predicting on-time or early buses is also important. For instance, commuters who come to a bus stop on time may still miss their buses if the buses leave early. So, in this article, we examine open big data about bus performance (e.g., early, on-time, and late stops). We analyze the data with frequent pattern mining and make predictions with decision-tree based classification. For illustration, we perform predictive analytics on real-life open big data available on Winnipeg Open Data Portal, about bus performance from Winnipeg Transit. It shows the benefits of predictive analytics on open big data for supporting smart transportation services.
Description
Keywords
Predictive analytics, Open data, Winnipeg open data, Big data, Transportation data, On-time performance, Frequent patterns, Software engineering, Large-scale systems
Citation
P.P.F. Balbin, J.C.R. Barker, C.K. Leung, M. Tran, R.P. Wall, A. Cuzzocrea. Predictive analytics on open big data for supporting smart transportation services. Procedia Computer Science, 176 (2020), pp. 3009-3018.