An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling

Open AccessPosted Content

An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling

- 04 Mar 2018

5.4K

TL;DR: A systematic evaluation of generic convolutional and recurrent architectures for sequence modeling concludes that the common association between sequence modeling and recurrent networks should be reconsidered, and convolutionals should be regarded as a natural starting point for sequence modeled tasks.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1007/s11356-023-30774-4

Forecasting water quality variable using deep learning and weighted averaging ensemble models.

Mohammad Ghadir Zamani, +4 more

- 24 Nov 2023

- Environmental Science and Pollution Rese...

TL;DR: The study’s findings demonstrated that the EM-NSGA-II stands out with exceptional effectiveness compared to DL and EM-GA models, showcasing improvements of 14% (RNN), 8% (LSTM), 6% (GRU), 8% (TCN), and 3% (EM-GA) during the testing phase during the testing phase.

...read moreread less

23

Book Chapter•10.1007/978-3-030-36808-1_14

Multi-task Temporal Convolutional Network for Predicting Water Quality Sensor Data

Yi-Fan Zhang, +2 more

- 12 Dec 2019

TL;DR: The proposed multi-task temporal convolution network (MTCN) is an encouraging approach for water quality management by processing a large amount of sensor data and achieves the best RMSE scores in predicting both temperature and DO in the following 48 time steps.

...read moreread less

23

•Journal Article•10.3389/FEART.2021.659611

Centimeter-Scale Lithology and Facies Prediction in Cored Wells Using Machine Learning

Thomas P. Martin, +2 more

- 24 Jun 2021

- Frontiers in Earth Science

TL;DR: In this paper, an open-source, python-based machine learning workflow was developed to analyze core image data in a scalable, reproducible way, which can unlock warehouses full of high-resolution data for a multitude of geological settings.

...read moreread less

23

•Journal Article•10.3390/E22111203

TNT: An Interpretable Tree-Network-Tree Learning Framework using Knowledge Distillation.

Jiawei Li, +5 more

- 24 Oct 2020

- Entropy

TL;DR: A Tree-Network-Tree (TNT) learning framework for explainable decision-making, where the knowledge is alternately transferred between the tree model and DNNs is proposed, and extensive experiments demonstrated the effectiveness of the proposed method.

...read moreread less

23

Journal Article•10.1109/TGRS.2022.3160617

Large-Area Land-Cover Changes Monitoring With Time-Series Remote Sensing Images Using Transferable Deep Models

Jining Yan, +5 more

- IEEE Transactions on Geoscience and Remo...

TL;DR: Wang et al. as mentioned in this paper proposed the similarity-measurement-based deep transfer learning for time-series adaptive change detection (SDTL-TSACD) model, which used a standard dynamic time warping (SDTW) distance to cluster large-scale time series into multiple subcategories with high time series similarity.

...read moreread less

23

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 01 Jan 2015

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

138.5K

•Posted Content

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 10 Dec 2015

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.

...read moreread less

117.9K

Journal Article•10.1162/NECO.1997.9.8.1735

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997

- Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

99K

Journal Article•10.1109/5.726791

Gradient-based learning applied to document recognition

Yann LeCun, +6 more

- 01 Jan 1998

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

53.5K

...

Expand

An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling

Chat with Paper

AI Agents for this Paper

Citations

Forecasting water quality variable using deep learning and weighted averaging ensemble models.

Multi-task Temporal Convolutional Network for Predicting Water Quality Sensor Data

Centimeter-Scale Lithology and Facies Prediction in Cored Wells Using Machine Learning

TNT: An Interpretable Tree-Network-Tree Learning Framework using Knowledge Distillation.

Large-Area Land-Cover Changes Monitoring With Time-Series Remote Sensing Images Using Transferable Deep Models

References

Deep Residual Learning for Image Recognition

Adam: A Method for Stochastic Optimization

Deep Residual Learning for Image Recognition

Long short-term memory

Gradient-based learning applied to document recognition

Related Papers (5)

Long short-term memory

Deep Residual Learning for Image Recognition

Adam: A Method for Stochastic Optimization

Attention is All you Need

Dropout: a simple way to prevent neural networks from overfitting