Deep Speech: Scaling up end-to-end speech recognition

Open AccessPosted Content

Deep Speech: Scaling up end-to-end speech recognition

- 17 Dec 2014

2.2K

TL;DR: Deep Speech, a state-of-the-art speech recognition system developed using end-to-end deep learning, outperforms previously published results on the widely studied Switchboard Hub5'00, achieving 16.0% error on the full test set.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

CAVBench: A Benchmark Suite for Connected and Autonomous Vehicles

Yifan Wang, +3 more

- 15 Oct 2018

- arXiv: Distributed, Parallel, and Cluste...

TL;DR: CAVBench as mentioned in this paper is a benchmark suite for the edge computing system in the connected and autonomous vehicles (CAVs) scenario, which consists of six typical applications covering four dominant CAVs scenarios and takes four datasets as standard input.

...read moreread less

50

•Posted Content

An End-to-End Architecture for Keyword Spotting and Voice Activity Detection.

Christopher T. Lengerich, +1 more

- 28 Nov 2016

- arXiv: Computation and Language

TL;DR: Novel inference algorithms for an end-to-end Recurrent Neural Network trained with the Connectionist Temporal Classification loss function are developed which allow the model to achieve high accuracy on both keyword spotting and voice activity detection without retraining.

...read moreread less

49

Journal Article•10.3109/10409238.2015.1135868

Computer vision for high content screening.

Oren Kraus, +1 more

- 24 Jan 2016

- Critical Reviews in Biochemistry and Mol...

TL;DR: The steps involved in quantifying microscopy images and different approaches for each step are described.

...read moreread less

49

•Proceedings Article•10.1109/PACT.2019.00009

MASR: A Modular Accelerator for Sparse RNNs

Udit Gupta, +7 more

- 01 Sep 2019

TL;DR: MASR as mentioned in this paper accelerates bidirectional RNNs for on-chip ASR by exploiting sparsity in both dynamic activations and static weights, which enables designs that efficiently scale from resource-constrained low-power IoT applications to large-scale, highly parallel datacenter deployments.

...read moreread less

49

•Proceedings Article

Metamers of neural networks reveal divergence from human perceptual systems

Jenelle Feather, +3 more

- 01 Jan 2019

TL;DR: The results reveal discrepancies between model and human representations, but also show how metamers can help guide model refinement and elucidate model representations.

...read moreread less

49

...

Expand

References

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

•Proceedings Article•10.1109/CVPR.2015.7298594

Going deeper with convolutions

Christian Szegedy, +8 more

- 07 Jun 2015

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

56.6K

•Proceedings Article

Sequence to Sequence Learning with Neural Networks

Ilya Sutskever, +2 more

- 08 Dec 2014

TL;DR: The authors used a multilayered Long Short-Term Memory (LSTM) to map the input sequence to a vector of a fixed dimensionality, and then another deep LSTM to decode the target sequence from the vector.

...read moreread less

20.1K

•Proceedings Article

Rectified Linear Units Improve Restricted Boltzmann Machines

Vinod Nair, +1 more

- 21 Jun 2010

TL;DR: Restricted Boltzmann machines were developed using binary stochastic hidden units that learn features that are better for object recognition on the NORB dataset and face verification on the Labeled Faces in the Wild dataset.

...read moreread less

18.4K

Journal Article•10.1162/NECO.1989.1.4.541

Backpropagation applied to handwritten zip code recognition

Yann LeCun, +6 more

- 01 Dec 1989

- Neural Computation

TL;DR: This paper demonstrates how constraints from the task domain can be integrated into a backpropagation network through the architecture of the network, successfully applied to the recognition of handwritten zip code digits provided by the U.S. Postal Service.

...read moreread less

12.5K

...

Expand

Deep Speech: Scaling up end-to-end speech recognition

Chat with Paper

AI Agents for this Paper

Citations

CAVBench: A Benchmark Suite for Connected and Autonomous Vehicles

An End-to-End Architecture for Keyword Spotting and Voice Activity Detection.

Computer vision for high content screening.

MASR: A Modular Accelerator for Sparse RNNs

Metamers of neural networks reveal divergence from human perceptual systems

References

ImageNet Classification with Deep Convolutional Neural Networks

Going deeper with convolutions

Sequence to Sequence Learning with Neural Networks

Rectified Linear Units Improve Restricted Boltzmann Machines

Backpropagation applied to handwritten zip code recognition

Related Papers (5)

ImageNet Classification with Deep Convolutional Neural Networks

Deep Residual Learning for Image Recognition

Long short-term memory

Very Deep Convolutional Networks for Large-Scale Image Recognition

Gradient-based learning applied to document recognition