Journal Article10.1016/J.CSL.2008.06.002
Continuous speech recognition with sparse coding
W. J. Smit,Etienne Barnard +1 more
30
TL;DR: This article shows how sparse codes can be used to do continuous speech recognition by using an iterative subset selection algorithm with quadratic programming to find a sparse code for a spectrogram.
read more
About: This article is published in Computer Speech & Language. The article was published on 01 Apr 2009. The article focuses on the topics: Sparse approximation & Spike train.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
On the Recognition of Cochlear Implant-Like Spectrally Reduced Speech With MFCC and HMM-Based ASR
TL;DR: It was shown that changing the bandwidth of the subband temporal envelopes had no significant effect on the ASR word accuracy, and increasing the number of frequency subbands of the SRS from 4 to 16 improved significantly the system performance.
33
Online Non-Negative Convolutive Pattern Learning for Speech Signals
TL;DR: This paper proposes a new online implementation of CNMF and CNSC which processes input data piece-by-piece and updates learned patterns gradually with accumulated statistics and shows that, with unlimited data and computing resources, the new online learning algorithm almost surely converges to a local minimum of the objective cost function.
MINQ8: general definite and bound constrained indefinite quadratic programming
Waltraud Huyer,Arnold Neumaier +1 more
TL;DR: In this article, the authors propose new algorithms for the local optimization of bound constrained quadratic programs, the solution of general definite quadrastic programs, and finding either a point satisfying given linear equations and inequalities or a certificate of infeasibility.
•Proceedings Article
Online Pattern Learning for Non-Negative Convolutive Sparse Coding.
Dong Wang,Ravichander Vipperla,Nicholas Evans +2 more
- 01 Jan 2011
TL;DR: An online algorithm is proposed for CNMF and CNSC, which processes input data piece-by-piece and updates the learned patterns after the processing of each piece by using accumulated sufficient statistics, thereby enabling its application to large scale tasks.
Spectrum enhancement with sparse coding for robust speech recognition
TL;DR: A novel method first finds out the atoms which represent the noise sparsely, and then selectively ignores them in the reconstruction of speech to reduce the residual noise, and speech features are then extracted from the enhanced spectrum for speech recognition.
14
References
Introduction to algorithms: 4. Turtle graphics
TL;DR: In this article, a language similar to logo is used to draw geometric pictures using this language and programs are developed to draw geometrical pictures using it, which is similar to the one we use in this paper.
15.4K
Matching pursuits with time-frequency dictionaries
Stéphane Mallat,Zhifeng Zhang +1 more
TL;DR: The authors introduce an algorithm, called matching pursuit, that decomposes any signal into a linear expansion of waveforms that are selected from a redundant dictionary of functions, chosen in order to best match the signal structures.
10.2K
•Book
Fundamentals of speech recognition
Lawrence R. Rabiner,Biing-Hwang Juang +1 more
- 01 Jan 1993
TL;DR: This book presents a meta-modelling framework for speech recognition that automates the very labor-intensive and therefore time-heavy and therefore expensive and expensive process of manually modeling speech.
9.4K
•Book
The Handbook of Brain Theory and Neural Networks
Michael A. Arbib
- 01 Jan 2007
TL;DR: A circular cribbage board having a circular base plate on which a circular counter disc, bearing a circular scale having 122 divisions numbered consecutively from 0, is mounted for rotation.
4.2K
Sparse Coding with an Overcomplete Basis Set: A Strategy Employed by V1 ?
TL;DR: These deviations from linearity provide a potential explanation for the weak forms of non-linearity observed in the response properties of cortical simple cells, and they further make predictions about the expected interactions among units in response to naturalistic stimuli.
4.2K