Deep Speech: Scaling up end-to-end speech recognition

Open AccessPosted Content

Deep Speech: Scaling up end-to-end speech recognition

- 17 Dec 2014

2.2K

TL;DR: Deep Speech, a state-of-the-art speech recognition system developed using end-to-end deep learning, outperforms previously published results on the widely studied Switchboard Hub5'00, achieving 16.0% error on the full test set.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

Multi-Modal Data Augmentation for End-to-end ASR

Adithya Renduchintala, +3 more

- 27 Mar 2018

- arXiv: Computation and Language

TL;DR: In this paper, a multi-modal data augmentation network (MMDA) is proposed to combine acoustic and symbolic input for ASR, which enables seamless mixing of large text datasets with significantly smaller transcribed speech corpora during training.

...read moreread less

33

•Journal Article•10.3390/APP11125541

From Classical Machine Learning to Deep Neural Networks: A Simplified Scientometric Review

Ravil I. Mukhamediev, +4 more

- 15 Jun 2021

- Applied Sciences

TL;DR: The results show that, despite the limitations of the method, it is possible to identify fast-growing areas of research regardless of the number of articles, and predict publication activity in the short term with satisfactory accuracy for practice.

...read moreread less

33

Proceedings Article•10.1109/FCCM48280.2020.00011

Optimizing Reconfigurable Recurrent Neural Networks

Zhiqiang Que, +7 more

- 03 May 2020

TL;DR: A novel latency-hiding hardware architecture based on column-wise matrix-vector multiplication to eliminate data dependency is proposed, improving the throughput of systems of RNN models and a flexible checkerboard tiling strategy is introduced to allow large weight matrices.

...read moreread less

33

•Journal Article•10.1109/access.2022.3208131

Adversarial Deep Learning: A Survey on Adversarial Attacks and Defense Mechanisms on Image Classification

01 Jan 2022

- IEEE Access

TL;DR: A comprehensive review of the most recent and state-of-the-art adversarial attack methods by providing an in-depth analysis and explanation of the working process of these attacks is provided in this article .

...read moreread less

33

Journal Article•10.1177/0846537120947148

Applications of Artificial Intelligence in Musculoskeletal Imaging: From the Request to the Report.

Natalia Gorelik, +1 more

- 01 Feb 2021

- Canadian Association of Radiologists jou...

TL;DR: The impact of AI through the entire imaging cycle of musculoskeletal radiology, from the placement of the requisition to the generation of the report, is explored, with an added Canadian perspective.

...read moreread less

33

...

Expand

References

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

•Proceedings Article•10.1109/CVPR.2015.7298594

Going deeper with convolutions

Christian Szegedy, +8 more

- 07 Jun 2015

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

56.6K

•Proceedings Article

Sequence to Sequence Learning with Neural Networks

Ilya Sutskever, +2 more

- 08 Dec 2014

TL;DR: The authors used a multilayered Long Short-Term Memory (LSTM) to map the input sequence to a vector of a fixed dimensionality, and then another deep LSTM to decode the target sequence from the vector.

...read moreread less

20.1K

•Proceedings Article

Rectified Linear Units Improve Restricted Boltzmann Machines

Vinod Nair, +1 more

- 21 Jun 2010

TL;DR: Restricted Boltzmann machines were developed using binary stochastic hidden units that learn features that are better for object recognition on the NORB dataset and face verification on the Labeled Faces in the Wild dataset.

...read moreread less

18.4K

Journal Article•10.1162/NECO.1989.1.4.541

Backpropagation applied to handwritten zip code recognition

Yann LeCun, +6 more

- 01 Dec 1989

- Neural Computation

TL;DR: This paper demonstrates how constraints from the task domain can be integrated into a backpropagation network through the architecture of the network, successfully applied to the recognition of handwritten zip code digits provided by the U.S. Postal Service.

...read moreread less

12.5K

...

Expand

Deep Speech: Scaling up end-to-end speech recognition

Chat with Paper

AI Agents for this Paper

Citations

Multi-Modal Data Augmentation for End-to-end ASR

From Classical Machine Learning to Deep Neural Networks: A Simplified Scientometric Review

Optimizing Reconfigurable Recurrent Neural Networks

Adversarial Deep Learning: A Survey on Adversarial Attacks and Defense Mechanisms on Image Classification

Applications of Artificial Intelligence in Musculoskeletal Imaging: From the Request to the Report.

References

ImageNet Classification with Deep Convolutional Neural Networks

Going deeper with convolutions

Sequence to Sequence Learning with Neural Networks

Rectified Linear Units Improve Restricted Boltzmann Machines

Backpropagation applied to handwritten zip code recognition

Related Papers (5)

ImageNet Classification with Deep Convolutional Neural Networks

Deep Residual Learning for Image Recognition

Long short-term memory

Very Deep Convolutional Networks for Large-Scale Image Recognition

Gradient-based learning applied to document recognition