Deep Speech: Scaling up end-to-end speech recognition

Open AccessPosted Content

Deep Speech: Scaling up end-to-end speech recognition

- 17 Dec 2014

2.2K

TL;DR: Deep Speech, a state-of-the-art speech recognition system developed using end-to-end deep learning, outperforms previously published results on the widely studied Switchboard Hub5'00, achieving 16.0% error on the full test set.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Book Chapter•10.5772/INTECHOPEN.80026

Convolutional Neural Networks for Raw Speech Recognition

Vishal Passricha, +1 more

- 12 Dec 2018

TL;DR: CNN-based acoustic model for raw speech signal is discussed, which establishes the relation between rawspeech signal and phones in a data-driven manner and performs better than traditional cepstral fea- ture-based systems.

...read moreread less

32

Journal Article•10.1016/j.jfineco.2021.12.004

Real-time price discovery via verbal communication: Method and application to Fedspeak

Marco Grotteria

- 01 Mar 2022

- Journal of Financial Economics

TL;DR: This article studied the price discovery process on FOMC days and found that price movements around the post-meeting statement release are strong predictors of price movement around the subsequent press conference.

...read moreread less

32

•Journal Article•10.1109/TASLP.2018.2821899

Speech Dereverberation With Context-Aware Recurrent Neural Networks

João Felipe Santos, +1 more

- 01 Jul 2018

- IEEE Transactions on Audio, Speech, and ...

TL;DR: The proposed model to perform speech dereverberation by estimating its spectral magnitude from the reverberant counterpart outperforms a recently proposed model that uses different context information depending on the reverberation time, without requiring any sort of additional input.

...read moreread less

32

Journal Article•10.2196/59505

Multimodal Large Language Models in Healthcare: Applications, Challenges, and Future Outlook (Preprint)

Rawan AlSaad, +6 more

- 20 Aug 2024

- Journal of Medical Internet Research

TL;DR: This preprint explores the applications, challenges, and future outlook of multimodal large language models in healthcare, highlighting the need for integrating diverse data modalities to inform clinical decisions and drive a paradigm shift toward multimodal data-driven medical practice.

...read moreread less

32

•Proceedings Article

Are Neural Rankers still Outperformed by Gradient Boosted Decision Trees

Zhen Qin, +7 more

- 03 May 2021

TL;DR: This paper showed that neural LTR models are inferior to the best publicly available Gradient Boosted Decision Trees (GBDT) in terms of their reported ranking accuracy on benchmark datasets and proposed a unified framework comprising of counter strategies to ameliorate the existing weaknesses of neural models.

...read moreread less

31

...

Expand

References

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

•Proceedings Article•10.1109/CVPR.2015.7298594

Going deeper with convolutions

Christian Szegedy, +8 more

- 07 Jun 2015

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

56.6K

•Proceedings Article

Sequence to Sequence Learning with Neural Networks

Ilya Sutskever, +2 more

- 08 Dec 2014

TL;DR: The authors used a multilayered Long Short-Term Memory (LSTM) to map the input sequence to a vector of a fixed dimensionality, and then another deep LSTM to decode the target sequence from the vector.

...read moreread less

20.1K

•Proceedings Article

Rectified Linear Units Improve Restricted Boltzmann Machines

Vinod Nair, +1 more

- 21 Jun 2010

TL;DR: Restricted Boltzmann machines were developed using binary stochastic hidden units that learn features that are better for object recognition on the NORB dataset and face verification on the Labeled Faces in the Wild dataset.

...read moreread less

18.4K

Journal Article•10.1162/NECO.1989.1.4.541

Backpropagation applied to handwritten zip code recognition

Yann LeCun, +6 more

- 01 Dec 1989

- Neural Computation

TL;DR: This paper demonstrates how constraints from the task domain can be integrated into a backpropagation network through the architecture of the network, successfully applied to the recognition of handwritten zip code digits provided by the U.S. Postal Service.

...read moreread less

12.5K

...

Expand

Deep Speech: Scaling up end-to-end speech recognition

Chat with Paper

AI Agents for this Paper

Citations

Convolutional Neural Networks for Raw Speech Recognition

Real-time price discovery via verbal communication: Method and application to Fedspeak

Speech Dereverberation With Context-Aware Recurrent Neural Networks

Multimodal Large Language Models in Healthcare: Applications, Challenges, and Future Outlook (Preprint)

Are Neural Rankers still Outperformed by Gradient Boosted Decision Trees

References

ImageNet Classification with Deep Convolutional Neural Networks

Going deeper with convolutions

Sequence to Sequence Learning with Neural Networks

Rectified Linear Units Improve Restricted Boltzmann Machines

Backpropagation applied to handwritten zip code recognition

Related Papers (5)

ImageNet Classification with Deep Convolutional Neural Networks

Deep Residual Learning for Image Recognition

Long short-term memory

Very Deep Convolutional Networks for Large-Scale Image Recognition

Gradient-based learning applied to document recognition