Journal Article10.1016/j.inffus.2023.101819
Long sequence time-series forecasting with deep learning: A survey
Minbo Ma,Tianrui Li,Hongjun Wang,Chongshou Li +3 more
124
TL;DR: In this article , the authors provide a comprehensive survey of LSTF studies with deep learning technology and summarize the evolution in terms of a proposed taxonomy based on network structure, and discuss three key problems and corresponding solutions from long dependency modeling, computation cost, and evaluation metrics.
read more
About: This article is published in Information Fusion. The article was published on 01 Apr 2023. The article focuses on the topics: Computer science & Computer science.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Kolmogorov-Arnold Networks (KANs) for Time Series Analysis
Cristian J. Vaca-Rubio,Luis Daniel Blanco,Roberto Pereira,Màrius Caus +3 more
- 14 May 2024
TL;DR: Kolmogorov-Arnold Networks (KANs) are effective for time series forecasting, leveraging their adaptive activation functions to learn complex patterns from data. KANs outperform Multi-Layer Perceptrons (MLPs) with fewer parameters and achieve more accurate forecasting results.
Privacy and Security Concerns in Generative AI: A Comprehensive Survey
Abenezer Golda,Kidus Mekonen,Amit Pandey,Anushka Singh,Vikas Hassija,Vinay Chamola,Biplab Sikdar +6 more
TL;DR: This comprehensive survey offers a meticulous examination of the privacy and security challenges inherent to GAI, and provides five pivotal perspectives essential for a comprehensive understanding of these intricacies.
32
Exploring Progress in Multivariate Time Series Forecasting: Comprehensive Benchmarking and Heterogeneity Analysis
Zezhi Shao,Fei Wang,Yongjun Xu,Wei Wei,Chengqing Yu,Zhao-yang Zhang,Di Yao,Guangyin Jin,Xin Cao,Gao Cong,Christian S. Jensen,Xueqi Cheng +11 more
TL;DR: It is proved that neglecting heterogeneity is the primary reason for generating controversies in technical approaches and an exhaustive and reproducible performance and efficiency comparison of popular models is conducted, providing insights for researchers in selecting and designing MTS forecasting models.
STGAFormer: Spatial–temporal Gated Attention Transformer based Graph Neural Network for traffic flow forecasting
Zili Geng,Jie Xu,Rongsen Wu,Changming Zhao,Jin Wang,Yunji Li,Chenlin Zhang +6 more
TL;DR: This paper proposes STGAFormer, a Graph Neural Network model that leverages transformer encoder architecture and novel gated attention modules to predict traffic flow, achieving state-of-the-art performance on four real datasets with improved MAE value of 3.82% on PeMS08 dataset.
22
Exploring Progress in Multivariate Time Series Forecasting: Comprehensive Benchmarking and Heterogeneity Analysis
Zezhi Shao,Fei Wang,Yongjun Xu,Wei Wei,Chengqing Yu,Zhao Zhang,Di Yao,Tao Sun,Guangyin Jin,Xin Cao,Gao Cong,Christian S. Jensen,Xueqi Cheng +12 more
16
References
•Proceedings Article
Attention is All you Need
Ashish Vaswani,Noam Shazeer,Niki Parmar,Jakob Uszkoreit,Llion Jones,Aidan N. Gomez,Lukasz Kaiser,Illia Polosukhin +7 more
- 12 Jun 2017
TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.
Attention Is All You Need
Ashish Vaswani,Noam Shazeer,Niki Parmar,Jakob Uszkoreit,Llion Jones,Aidan N. Gomez,Łukasz Kaiser,Illia Polosukhin +7 more
- 01 Jan 2017
Abstract: The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train. Our model achieves 28.4 BLEU on the WMT 2014 English-to-German translation task, improving over the existing best results, including ensembles by over 2 BLEU. On the WMT 2014 English-to-French translation task, our model establishes a new single-model state-of-the-art BLEU score of 41.8 after training for 3.5 days on eight GPUs, a small fraction of the training costs of the best models from the literature. We show that the Transformer generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.
51.8K
Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation
Kyunghyun Cho,Bart van Merriënboer,Caglar Gulcehre,Dzmitry Bahdanau,Fethi Bougares,Holger Schwenk,Yoshua Bengio,Yoshua Bengio,Yoshua Bengio +8 more
- 01 Jan 2014
TL;DR: In this paper, the encoder and decoder of the RNN Encoder-Decoder model are jointly trained to maximize the conditional probability of a target sequence given a source sequence.
Finding Structure in Time
TL;DR: A proposal along these lines first described by Jordan (1986) which involves the use of recurrent links in order to provide networks with a dynamic memory and suggests a method for representing lexical categories and the type/token distinction is developed.
11.8K
•Posted Content
WaveNet: A Generative Model for Raw Audio
Aaron van den Oord,Sander Dieleman,Heiga Zen,Karen Simonyan,Oriol Vinyals,Alex Graves,Nal Kalchbrenner,Andrew W. Senior,Koray Kavukcuoglu +8 more
TL;DR: This paper proposed WaveNet, a deep neural network for generating audio waveforms, which is fully probabilistic and autoregressive, with the predictive distribution for each audio sample conditioned on all previous ones.
6K