Proceedings Article10.1109/ICASSP.1989.266505
Continuous hidden Markov modeling for speaker-independent word spotting
J.R. Rohlicek,W. Russell,Salim Roukos,H. Gish +3 more
- 23 May 1989
- pp 627-630
313
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Speech spotter: On-demand speech recognition in human-human conversation on the telephone or in face-to-face situations
Masataka Goto,Koji Kitayama,Katsunobu Itou,Tetsunori Kobayashi +3 more
- 01 Jan 2004
TL;DR: A novel speech-interface function, called “speech spotter”, is described, which converts voice commands into a speech recognizer in the midst of natural human-human conversation and is found to be robust and convenient enough to be used in face-to-face or cellular-phone conversations.
Robust mapping of noisy speech parameters for HMM word spotting
Kenney Ng,H. Gish,J.R. Rohlicek +2 more
- 23 Mar 1992
TL;DR: It is demonstrated that using the proposed probabilistic vector mapping algorithm as a feature preprocessor results in robust performance levels across a wide range of signal-to-noise (SNR) levels.
14
Patent
Wordspotting using two hidden Markov models (HMM)
Lynn D. Wilcox,Marcia A. Bush +1 more
- 18 Sep 1992
TL;DR: In this paper, a technique for speaker-dependent wordspotting based on hidden Markov models (HMM's) is proposed. But the technique requires a speaker to specify keywords dynamically and to train the associated HMM's via a single repetition of a keyword.
14
Keyword Spotting Based On CTC and RNN For Mandarin Chinese Speech
Yiyan Wang,Yanhua Long +1 more
- 01 Nov 2018
TL;DR: This work proposes a Mandarin KWS system using the end-to-end method, which directly predict the posterior of phonetic units, based on Connectionist Temporal Classifier and Recurrent Neural Network and adopts Mandarin syllables as the output labels.
14
Optimize What Matters: Training DNN-Hmm Keyword Spotting Model Using End Metric
Ashish Shrivastava,Arnav Kundu,Chandra Shekhar Dhir,Devang Naik,Oncel Tuzel +4 more
- 06 Jun 2021
TL;DR: In this paper, an end-to-end training strategy that learns the hidden Markov model parameters by optimizing for the detection score was proposed to solve the mismatch between the cross-entropy loss between the predicted and the ground-truth state probabilities.
14
References
A Maximum Likelihood Approach to Continuous Speech Recognition
TL;DR: This paper describes a number of statistical models for use in speech recognition, with special attention to determining the parameters for such models from sparse data, and describes two decoding methods appropriate for constrained artificial languages and one appropriate for more realistic decoding tasks.
1.7K
On the use of instantaneous and transitional spectral information in speaker recognition
F.K. Soong,Aaron E. Rosenberg +1 more
- 01 Apr 1986
TL;DR: The experimental results show that the instantaneous and transitional representations are relatively uncorrelated thus providing complementary information for speaker recognition, and simple transmission channel variations are shown to affect the instantaneous spectral representations and the corresponding recognition performance significantly, while the transitional representations and performance are relatively resistant.
Related Papers (5)
Richard Rose,D.B. Paul +1 more
- 03 Apr 1990
Guoguo Chen,Carolina Parada,Georg Heigold +2 more
- 04 May 2014
Jonathan Mamou,Bhuvana Ramabhadran,Olivier Siohan +2 more
- 23 Jul 2007