Deep Speech: Scaling up end-to-end speech recognition

Open AccessPosted Content

Deep Speech: Scaling up end-to-end speech recognition

- 17 Dec 2014

2.2K

TL;DR: Deep Speech, a state-of-the-art speech recognition system developed using end-to-end deep learning, outperforms previously published results on the widely studied Switchboard Hub5'00, achieving 16.0% error on the full test set.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

Meta-Learning Symmetries by Reparameterization

Allan Zhou, +2 more

- 06 Jul 2020

- arXiv: Learning

TL;DR: This work presents a method for learning and encoding equivariances into networks by learning corresponding parameter sharing patterns from data that can provably encode equivariance-inducing parameter sharing for any finite group of symmetry transformations.

...read moreread less

70

Proceedings Article•10.1109/MICRO.2018.00020

Diffy: a déjà vu-free differential deep neural network accelerator

Mostafa Mahmoud, +2 more

- 20 Oct 2018

TL;DR: Diffy, a hardware accelerator that performs Differential Convolution, provides the performance necessary to achieve real-time processing of HD resolution images with practical configurations and can serve as a general CNN accelerator as it improves performance even for image classification models.

...read moreread less

68

•Book Chapter•10.1007/978-3-030-70604-3_8

An Overview of Federated Deep Learning Privacy Attacks and Defensive Strategies

David Enthoven, +1 more

- 01 Jan 2021

- arXiv: Cryptography and Security

TL;DR: The application of a single defensive strategy is not enough to provide adequate protection to all available attack methods, so a literature review of the possible attack methods targetingFL privacy protection capabilities is performed.

...read moreread less

67

Journal Article•10.48550/arXiv.2301.08727

Neural Architecture Search: Insights from 1000 Papers

Colin White, +7 more

- 20 Jan 2023

- arXiv.org

TL;DR: Neural architecture search (NAS) as mentioned in this paper is the process of automating the design of neural architectures for a given task, and has already outpaced the best human-designed architectures on many tasks.

...read moreread less

67

Proceedings Article•10.1109/HSI.2018.8431232

A Vision and Speech Enabled, Customizable, Virtual Assistant for Smart Environments

Giancarlo Iannizzotto, +3 more

- 04 Jul 2018

TL;DR: Some of the most advanced techniques in computer vision, deep learning, speech generation and recognition, and artificial intelligence are combined into a virtual assistant architecture for smart home automation systems, which is effective and resource-efficient, interactive and customizable.

...read moreread less

66

...

Expand

References

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

•Proceedings Article•10.1109/CVPR.2015.7298594

Going deeper with convolutions

Christian Szegedy, +8 more

- 07 Jun 2015

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

56.6K

•Proceedings Article

Sequence to Sequence Learning with Neural Networks

Ilya Sutskever, +2 more

- 08 Dec 2014

TL;DR: The authors used a multilayered Long Short-Term Memory (LSTM) to map the input sequence to a vector of a fixed dimensionality, and then another deep LSTM to decode the target sequence from the vector.

...read moreread less

20.1K

•Proceedings Article

Rectified Linear Units Improve Restricted Boltzmann Machines

Vinod Nair, +1 more

- 21 Jun 2010

TL;DR: Restricted Boltzmann machines were developed using binary stochastic hidden units that learn features that are better for object recognition on the NORB dataset and face verification on the Labeled Faces in the Wild dataset.

...read moreread less

18.4K

Journal Article•10.1162/NECO.1989.1.4.541

Backpropagation applied to handwritten zip code recognition

Yann LeCun, +6 more

- 01 Dec 1989

- Neural Computation

TL;DR: This paper demonstrates how constraints from the task domain can be integrated into a backpropagation network through the architecture of the network, successfully applied to the recognition of handwritten zip code digits provided by the U.S. Postal Service.

...read moreread less

12.5K

...

Expand

Deep Speech: Scaling up end-to-end speech recognition

Chat with Paper

AI Agents for this Paper

Citations

Meta-Learning Symmetries by Reparameterization

Diffy: a déjà vu-free differential deep neural network accelerator

An Overview of Federated Deep Learning Privacy Attacks and Defensive Strategies

Neural Architecture Search: Insights from 1000 Papers

A Vision and Speech Enabled, Customizable, Virtual Assistant for Smart Environments

References

ImageNet Classification with Deep Convolutional Neural Networks

Going deeper with convolutions

Sequence to Sequence Learning with Neural Networks

Rectified Linear Units Improve Restricted Boltzmann Machines

Backpropagation applied to handwritten zip code recognition

Related Papers (5)

ImageNet Classification with Deep Convolutional Neural Networks

Deep Residual Learning for Image Recognition

Long short-term memory

Very Deep Convolutional Networks for Large-Scale Image Recognition

Gradient-based learning applied to document recognition