Deep Speech: Scaling up end-to-end speech recognition

Open AccessPosted Content

Deep Speech: Scaling up end-to-end speech recognition

- 17 Dec 2014

2.2K

TL;DR: Deep Speech, a state-of-the-art speech recognition system developed using end-to-end deep learning, outperforms previously published results on the widely studied Switchboard Hub5'00, achieving 16.0% error on the full test set.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article

Regression Prior Networks

Andrey Malinin, +3 more

- 04 May 2021

- arXiv: Learning

TL;DR: This work extends Prior Networks and EnD$^2$ to regression tasks by considering the Normal-Wishart distribution and demonstrates the properties of Regression Prior Networks, where they yield performance competitive with ensemble approaches.

...read moreread less

35

•Proceedings Article•10.21437/INTERSPEECH.2018-1301

Multi-channel attention for end-to-end speech recognition

Stefan Braun, +4 more

- 02 Sep 2018

TL;DR: This work proposes a sensory attention mechanism that is invariant to the channel ordering and only increases the overall parameter count by 0.09%, and demonstrates that even without re-training, this attention-equipped end-to-end model is able to deal with arbitrary numbers of input channels during inference.

...read moreread less

35

•Proceedings Article•10.1145/3240765.3243494

Searching toward pareto-optimal device-aware neural architectures

An-Chieh Cheng, +9 more

- 05 Nov 2018

TL;DR: Experimental results are poised to show that architectures found by MONAS and DPP-Net achieves Pareto optimality w.r.t the given objectives for various devices.

...read moreread less

35

•Posted Content

Universal adversarial examples in speech command classification

Jon Vadillo, +1 more

- 22 Nov 2019

- arXiv: Learning

TL;DR: Evidence is provided that universal attacks can be generated for speech command classification tasks, which are able to generalize across different models to a significant extent and a novel analytical framework is proposed for the evaluation of universal perturbations under different levels of universality.

...read moreread less

34

•Proceedings Article•10.1109/ASAP52443.2021.00025

Accelerating Recurrent Neural Networks for Gravitational Wave Experiments

Zhiqiang Que, +12 more

- 26 Jun 2021

- arXiv: Learning

TL;DR: In this paper, a reconfigurable architecture for reducing the latency of recurrent neural networks (RNNs) that are used for detecting gravitational waves is presented, which is based on optimizing the initiation intervals (II) in a multi-layer LSTM (Long Short-Term Memory) network, by identifying appropriate reuse factors for each layer.

...read moreread less

34

...

Expand

References

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

•Proceedings Article•10.1109/CVPR.2015.7298594

Going deeper with convolutions

Christian Szegedy, +8 more

- 07 Jun 2015

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

56.6K

•Proceedings Article

Sequence to Sequence Learning with Neural Networks

Ilya Sutskever, +2 more

- 08 Dec 2014

TL;DR: The authors used a multilayered Long Short-Term Memory (LSTM) to map the input sequence to a vector of a fixed dimensionality, and then another deep LSTM to decode the target sequence from the vector.

...read moreread less

20.1K

•Proceedings Article

Rectified Linear Units Improve Restricted Boltzmann Machines

Vinod Nair, +1 more

- 21 Jun 2010

TL;DR: Restricted Boltzmann machines were developed using binary stochastic hidden units that learn features that are better for object recognition on the NORB dataset and face verification on the Labeled Faces in the Wild dataset.

...read moreread less

18.4K

Journal Article•10.1162/NECO.1989.1.4.541

Backpropagation applied to handwritten zip code recognition

Yann LeCun, +6 more

- 01 Dec 1989

- Neural Computation

TL;DR: This paper demonstrates how constraints from the task domain can be integrated into a backpropagation network through the architecture of the network, successfully applied to the recognition of handwritten zip code digits provided by the U.S. Postal Service.

...read moreread less

12.5K

...

Expand

Deep Speech: Scaling up end-to-end speech recognition

Chat with Paper

AI Agents for this Paper

Citations

Regression Prior Networks

Multi-channel attention for end-to-end speech recognition

Searching toward pareto-optimal device-aware neural architectures

Universal adversarial examples in speech command classification

Accelerating Recurrent Neural Networks for Gravitational Wave Experiments

References

ImageNet Classification with Deep Convolutional Neural Networks

Going deeper with convolutions

Sequence to Sequence Learning with Neural Networks

Rectified Linear Units Improve Restricted Boltzmann Machines

Backpropagation applied to handwritten zip code recognition

Related Papers (5)

ImageNet Classification with Deep Convolutional Neural Networks

Deep Residual Learning for Image Recognition

Long short-term memory

Very Deep Convolutional Networks for Large-Scale Image Recognition

Gradient-based learning applied to document recognition