Return to search

Investigation of Recurrent Neural Network architectures based Deep Learning for Short-Term Traffic Speed Forecasting

The availability of large tranches of data and its influence on traffic flow, make the problem of short-term traffic speed prediction very complex in nature. For more than 40 years, various statistical time series forecasting methods have been applied for traffic speed prediction, and in the last 20 years, machine learning-based methods have gained prevalence. However, more recently, recurrent neural network (RNN) based methods have emerged to show better results for traffic speed prediction\cite{tian2015pred, zhao2017lstm,fu2017usin,chen2016long,dai2017deep,dai2019deep, kanestrom2017traf,shao2016traf,jia2017traf}. As the interest in applying RNN models to the traffic speed predictions started to grow, we found some critical and important unanswered questions with respect to the application of such methods. From these open questions, as part of this research study, we studied the following three questions for multi-step-ahead short-term traffic speed predictions: - What is the impact of using the direct and recursive strategies on the accuracy of RNN models as compared to using the multi-input-multi-output (MIMO) strategy? - What is the impact of different aggregation intervals for the input data? - What is the impact of including additional variables such as volume, occupancy, time of day, day of week, and spatial location as represented by station or sensor id? Our study resulted in the following observations, conclusions, and recommendations: - We observed that GRU architecture based RNN models had better accuracy as compared to other architectures. Thus we recommend that modeling efforts start with GRU architecture. - We observed that combining direct strategy with RNN gives the same accuracy as MIMO (i.e. many-to-many RNN architecture), however, the time taken for building many-to-many RNN architecture is much less as compared to direct strategy. Thus we recommend avoiding the use of recursive or direct strategies and advise the use of many-to-many RNN architecture without combining any other strategies. - We observed that there was no significant difference in the accuracy of 5,10,15 minute aggregations and that adding additional variables does not necessarily always result in higher accuracy. In both cases, we observed that using autotuning with a Bayesian algorithm was able to reduce the error metrics to a smaller range for almost all the combinations of aggregations and multivariate features. Among different data aggregations and feature sets, we suggest converting these choices as hyper-parameters and let the Bayesian algorithm based hyper-parameter tuner select the best combinations for your dataset. With the above contributions, this dissertation seeks to give traffic engineers a better understanding of how to apply modern methods for multi-step-ahead short-term traffic speed predictions.

Identiferoai:union.ndltd.org:ucf.edu/oai:stars.library.ucf.edu:etd2020-1351
Date01 January 2020
CreatorsFandango, Armando
PublisherSTARS
Source SetsUniversity of Central Florida
LanguageEnglish
Detected LanguageEnglish
Typetext
Formatapplication/pdf
SourceElectronic Theses and Dissertations, 2020-

Page generated in 0.0018 seconds