Automatic recognition of keywords in unconstrained speech using hidden Markov models

doi:10.1109/29.103088

Journal Article10.1109/29.103088

Automatic recognition of keywords in unconstrained speech using hidden Markov models

Jay G. Wilpon, +3 more

- 01 Nov 1990

- IEEE Transactions on Acoustics, Speech, ...

- Vol. 38, Iss: 11, pp 1870-1878

498

TL;DR: The modifications made to a connected word speech recognition algorithm based on hidden Markov models which allow it to recognize words from a predefined vocabulary list spoken in an unconstrained fashion are described.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1016/S0167-6393(99)00063-1

Is ASR ready for wireless primetime: measuring the core technology for selected applications

Harry M. Chang

- 01 Aug 2000

- Speech Communication

TL;DR: A set of benchmark tasks designed to evaluate the state-of-the-art ASR technologies from a wireless perspective are described and the results of these benchmark tests on two commercially available software-based ASR systems that represent the best core ASR technology on the market are presented.

...read moreread less

6

Journal Article•10.1023/B:IJST.0000017016.91296.88

Unsupervised Learning of a Chinese Spontaneous and Colloquial Speech Lexicon with Content and Filler Phrase Classification

Cheung Chi-Shun, +1 more

- 01 Apr 2004

- International Journal of Speech Technolo...

TL;DR: This work proposes an unsupervised learning method to find colloquial terms and classify filler and content phrases in spontaneous andColloquial Chinese, including Cantonese, and adapts a language model trained from written texts with the Hong Kong Newsgroup corpus that outperforms both the standard Chinese language model and also the Cantonesese language model.

...read moreread less

6

Proceedings Article•10.1109/slt54892.2023.10023079

Towards Visually Prompted Keyword Localisation for Zero-Resource Spoken Languages

09 Jan 2023

TL;DR: This article proposed a speech-vision model with a novel localising attention mechanism which they train with a new keyword sampling scheme, and showed that these innovations give improvements in VPKL over an existing speech-viz model.

...read moreread less

5

•Posted Content

Learning acoustic word embeddings with phonetically associated triplet network

Hyungjun Lim, +4 more

- 07 Nov 2018

- arXiv: Audio and Speech Processing

TL;DR: A novel architecture, phonetically associated triplet network (PATN), which aims at increasing discriminative power of acoustic word embeddings by utilizing phonetic information as well as word identity.

...read moreread less

5

Reconocimiento automático de voz en condiciones de ruido

Angel de la Torre Vega, +2 more

- 01 Jan 2001

5

...

Expand

References

Journal Article•10.1109/TASSP.1975.1162641

Minimum prediction residual principle applied to speech recognition

F. Itakura

- 01 Feb 1975

- IEEE Transactions on Acoustics, Speech, ...

TL;DR: A computer system is described in which isolated words, spoken by a designated talker, are recognized through calculation of a minimum prediction residual through optimally registering the reference LPC onto the input autocorrelation coefficients using the dynamic programming algorithm.

...read moreread less

1.7K

Journal Article•10.1109/PROC.1976.10159

Continuous speech recognition by statistical methods

Frederick Jelinek

- 01 Apr 1976

TL;DR: Experimental results are presented that indicate the power of the methods and concern modeling of a speaker and of an acoustic processor, extraction of the models' statistical parameters and hypothesis search procedures and likelihood computations of linguistic decoding.

...read moreread less

1.1K

Large-vocabulary speaker-independent continuous speech recognition: the sphinx system

Raj Reddy, +1 more

- 01 Jan 1988

436

Journal Article•10.1002/J.1538-7305.1985.TB00272.X

Recognition of isolated digits using hidden Markov models with continuous mixture densities

Lawrence R. Rabiner, +3 more

- 08 Jul 1985

- AT&T technical journal

TL;DR: This paper extends previous work on isolated-word recognition based on hidden Markov models by replacing the discrete symbol representation of the speech signal with a continuous Gaussian mixture density, thereby eliminating the inherent quantization error introduced by the discrete representation.

...read moreread less

296

Journal Article•10.1002/J.1538-7305.1986.TB00368.X

A segmental k-means training procedure for connected word recognition

Lawrence R. Rabiner, +2 more

- 06 May 1986

- AT&T technical journal

TL;DR: In this paper, a segmental k-means training procedure was used to extract whole-word patterns from naturally spoken word strings, which were then used to create a set of word reference patterns for recognition.

...read moreread less

257