Continuous hidden Markov modeling for speaker-independent word spotting

doi:10.1109/ICASSP.1989.266505

Proceedings Article10.1109/ICASSP.1989.266505

Continuous hidden Markov modeling for speaker-independent word spotting

J.R. Rohlicek, +3 more

- 23 May 1989

- pp 627-630

313

TL;DR: A word-spotting system using Gaussian hidden Markov models is presented and it is observed that performance can be greatly affected by the choice of features used, the covariance structure of the Gaussian models, and transformations based on energy and feature distributions.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Proceedings Article•10.21437/INTERSPEECH.2004-575

Speech spotter: On-demand speech recognition in human-human conversation on the telephone or in face-to-face situations

Masataka Goto, +3 more

- 01 Jan 2004

TL;DR: A novel speech-interface function, called “speech spotter”, is described, which converts voice commands into a speech recognizer in the midst of natural human-human conversation and is found to be robust and convenient enough to be used in face-to-face or cellular-phone conversations.

...read moreread less

14

Proceedings Article•10.1109/ICASSP.1992.226108

Robust mapping of noisy speech parameters for HMM word spotting

Kenney Ng, +2 more

- 23 Mar 1992

TL;DR: It is demonstrated that using the proposed probabilistic vector mapping algorithm as a feature preprocessor results in robust performance levels across a wide range of signal-to-noise (SNR) levels.

...read moreread less

14

Patent

Wordspotting using two hidden Markov models (HMM)

Lynn D. Wilcox, +1 more

- 18 Sep 1992

TL;DR: In this paper, a technique for speaker-dependent wordspotting based on hidden Markov models (HMM's) is proposed. But the technique requires a speaker to specify keywords dynamically and to train the associated HMM's via a single repetition of a keyword.

...read moreread less

14

Proceedings Article•10.1109/ISCSLP.2018.8706631

Keyword Spotting Based On CTC and RNN For Mandarin Chinese Speech

Yiyan Wang, +1 more

- 01 Nov 2018

TL;DR: This work proposes a Mandarin KWS system using the end-to-end method, which directly predict the posterior of phonetic units, based on Connectionist Temporal Classifier and Recurrent Neural Network and adopts Mandarin syllables as the output labels.

...read moreread less

14

•Proceedings Article•10.1109/ICASSP39728.2021.9414797

Optimize What Matters: Training DNN-Hmm Keyword Spotting Model Using End Metric

Ashish Shrivastava, +4 more

- 06 Jun 2021

TL;DR: In this paper, an end-to-end training strategy that learns the hidden Markov model parameters by optimizing for the detection score was proposed to solve the mismatch between the cross-entropy loss between the predicted and the ground-truth state probabilities.

...read moreread less

14

...

Expand

References

Journal Article•10.1109/TPAMI.1983.4767370

A Maximum Likelihood Approach to Continuous Speech Recognition

Lalit R. Bahl, +2 more

- 01 Feb 1983

- IEEE Transactions on Pattern Analysis an...

TL;DR: This paper describes a number of statistical models for use in speech recognition, with special attention to determining the parameters for such models from sparse data, and describes two decoding methods appropriate for constrained artificial languages and one appropriate for more realistic decoding tasks.

...read moreread less

1.7K

Proceedings Article•10.1109/ICASSP.1986.1168882

On the use of instantaneous and transitional spectral information in speaker recognition

F.K. Soong, +1 more

- 01 Apr 1986

TL;DR: The experimental results show that the instantaneous and transitional representations are relatively uncorrelated thus providing complementary information for speaker recognition, and simple transmission channel variations are shown to affect the instantaneous spectral representations and the corresponding recognition performance significantly, while the transitional representations and performance are relatively resistant.

...read moreread less

Continuous hidden Markov modeling for speaker-independent word spotting

Chat with Paper

AI Agents for this Paper

Citations

Speech spotter: On-demand speech recognition in human-human conversation on the telephone or in face-to-face situations

Robust mapping of noisy speech parameters for HMM word spotting

Wordspotting using two hidden Markov models (HMM)

Keyword Spotting Based On CTC and RNN For Mandarin Chinese Speech

Optimize What Matters: Training DNN-Hmm Keyword Spotting Model Using End Metric

References

A Maximum Likelihood Approach to Continuous Speech Recognition

On the use of instantaneous and transitional spectral information in speaker recognition

Related Papers (5)

A hidden Markov model based keyword recognition system

Small-footprint keyword spotting using deep neural networks

An application of recurrent neural networks to discriminative keyword spotting

Vocabulary independent spoken term detection

Rapid and accurate spoken term detection.