Journal Article10.1109/TITS.2014.2311123
Deep Architecture for Traffic Flow Prediction: Deep Belief Networks With Multitask Learning
1.1K
TL;DR: It is presented that MTL can improve the generalization performance of shared tasks and a grouping method based on the weights in the top layer to make MTL more effective is proposed to take full advantage of weight sharing in the deep architecture.
read more
Abstract: Traffic flow prediction is a fundamental problem in transportation modeling and management. Many existing approaches fail to provide favorable results due to being: 1) shallow in architecture; 2) hand engineered in features; and 3) separate in learning. In this paper we propose a deep architecture that consists of two parts, i.e., a deep belief network (DBN) at the bottom and a multitask regression layer at the top. A DBN is employed here for unsupervised feature learning. It can learn effective features for traffic flow prediction in an unsupervised fashion, which has been examined and found to be effective for many areas such as image and audio classification. To the best of our knowledge, this is the first paper that applies the deep learning approach to transportation research. To incorporate multitask learning (MTL) in our deep architecture, a multitask regression layer is used above the DBN for supervised prediction. We further investigate homogeneous MTL and heterogeneous MTL for traffic flow prediction. To take full advantage of weight sharing in our deep architecture, we propose a grouping method based on the weights in the top layer to make MTL more effective. Experiments on transportation data sets show good performance of our deep architecture. Abundant experiments show that our approach achieved close to 5% improvements over the state of the art. It is also presented that MTL can improve the generalization performance of shared tasks. These positive results demonstrate that deep learning and MTL are promising in transportation research.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Short-Term Traffic Forecasting Using Deep Learning
Iren Valova,Natacha Gueorguieva,Sandeep Smudidonga +2 more
- 01 Aug 2021
TL;DR: This research proposes Convolutional Long Short Term Memory (CLSTM) which incorporates spatial and temporary information into the forecasting process and is compared with various deep learning architectures of Gated Recurrent Unit (GRU), Long Shortterm Memory (LSTM), and baseline methods such as Vector Autoregression (VAR) and historical average.
Combined Electricity and Traffic Short-Term Load Forecasting Using Bundled Causality Engine
Jose Cordova,Lalitha Madhavi Konila Sriram,Ayberk Kocatepe,Yuxun Zhou,Eren Erman Ozguven,Reza Arghandeh +5 more
TL;DR: The bundled causality engine is proposed as a novel information theory-based approach to characterize the causal dependency between flows of electricity and transportation networks and shows a considerable improvement in the short-term load forecasting accuracy at the household level.
Wavelet neural network with improved genetic algorithm for traffic flow time series prediction
Hong-jun Yang,Xu Hu +1 more
TL;DR: A predication model based on clustering search strategy improved genetic algorithm and WNN (IGA-WNN) is proposed, which has a higher predication accuracy and a better nonlinear fitting ability compared with the traditional WNN and GA- WNN prediction models.
A Data Grouping CNN Algorithm for Short-Term Traffic Flow Forecasting
Donghai Yu,Yang Liu,Xiaohui Yu,Xiaohui Yu +3 more
- 23 Sep 2016
TL;DR: A data grouping approach based on convolutional neural network (DGCNN) is proposed for forecasting urban short-term traffic flow, which includes the consideration of spatial relations between traffic locations, and utilizes such information to train a convolutionAL neural network for forecasting.
Traffic flow prediction in inland waterways of Assam region using uncertain spatiotemporal correlative features
TL;DR: A deep learning-based computing process, namely Convolution Neural Network-Long Short-Term Memory Network (CNN-LSTM), a progressive predictor of employing uncertain spatiotemporal information to decrease navigation mishaps, traffic and flow prediction failures during transportation is proposed.
References
ImageNet classification with deep convolutional neural networks
TL;DR: A large, deep convolutional neural network was trained to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes and employed a recently developed regularization method called "dropout" that proved to be very effective.
•Proceedings Article
ImageNet Classification with Deep Convolutional Neural Networks
Alex Krizhevsky,Ilya Sutskever,Geoffrey E. Hinton +2 more
- 03 Dec 2012
TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
Reducing the Dimensionality of Data with Neural Networks
TL;DR: In this article, an effective way of initializing the weights that allows deep autoencoder networks to learn low-dimensional codes that work much better than principal components analysis as a tool to reduce the dimensionality of data is described.
A fast learning algorithm for deep belief nets
TL;DR: A fast, greedy algorithm is derived that can learn deep, directed belief networks one layer at a time, provided the top two layers form an undirected associative memory.
Multitask Learning
Rich Caruana
- 01 Jul 1997
TL;DR: Multi-task Learning (MTL) as mentioned in this paper is an approach to inductive transfer that improves generalization by using the domain information contained in the training signals of related tasks as an inductive bias.