Perceptual linear predictive (PLP) analysis of speech

doi:10.1121/1.399423

Journal Article10.1121/1.399423

Perceptual linear predictive (PLP) analysis of speech

Hynek Hermansky

- 01 Apr 1990

- Journal of the Acoustical Society of Ame...

- Vol. 87, Iss: 4, pp 1738-1752

3.1K

TL;DR: A new technique for the analysis of speech, the perceptual linear predictive (PLP) technique, which uses three concepts from the psychophysics of hearing to derive an estimate of the auditory spectrum, and yields a low-dimensional representation of speech.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Dissertation

Model-based techniques for noise robust speech recognition

M. J. F. Gales

- 16 Sep 1995

TL;DR: The development of a model-based noise compensation technique, Parallel Model Combination, to alter the parameters of a set of Hidden Markov Model (HMM) based acoustic models, so that they reeect speech spoken in a new acoustic environment is detailed.

...read moreread less

339

Patent

Crowd sourcing information to fulfill user requests

Thomas R. Gruber, +2 more

- 15 Mar 2013

TL;DR: In this article, a failure to provide a satisfactory response to a user request is detected and information relevant to the user request was crowd-sourced by querying one or more crowd sourcing information sources.

...read moreread less

323

•Journal Article•10.1016/J.CSL.2010.06.003

The subspace Gaussian mixture model-A structured model for speech recognition

Daniel Povey, +12 more

- 01 Apr 2011

- Computer Speech & Language

TL;DR: A new approach to speech recognition, in which all Hidden Markov Model states share the same Gaussian Mixture Model (GMM) structure with the same number of Gaussians in each state, appears to give better results than a conventional model.

...read moreread less

323

•Proceedings Article

Mel-generalized cepstral analysis - a unified approach to speech spectral estimation.

Keiichi Tokuda, +3 more

- 01 Jan 1994

TL;DR: This paper proposes a spectral estimation method which uses the spectral model represented by mel-generalized cepstral coefficients.

...read moreread less

318

Journal Article•10.1016/S0167-6393(98)00032-6

Robust speech recognition using the modulation spectrogram

Brian Kingsbury, +5 more

- 01 Aug 1998

- Speech Communication

TL;DR: Using the modulation spectrogram as a front end for ASR provides a significant improvement in performance on highly reverberant speech and when it is used in combination with log-RASTA-PLP performance over a range of noisy and reverberant conditions is significantly improved, suggesting that the use of multiple representations is another promising method for improving the robustness of ASR systems.

...read moreread less

307

...

Expand

References

•Book

Acoustic theory of speech production

Gunnar Fant

- 01 Jan 1960

3.6K

•Journal Article

Distance measures for speech recognition, psychological and instrumental

P. Mermelstein

- 01 Jan 1976

- Pattern Recognition and Artificial Intel...

464

Journal Article•10.1121/1.1912389

Effect of glottal pulse shape on the quality of natural vowels.

A. E. Rosenberg

- 01 Jul 1969

- Journal of the Acoustical Society of Ame...

TL;DR: In this article, a male speaker recorded monosyllabic words and a continuous sentence and a pitch-synchronous analysis was carried out by a digital computer on the vowel portions of these samples, for every pitch period, the analysis provided: formant frequencies, waveform of the glottal excitation function, and an accurate pitch-period measurement.

...read moreread less

434

•Book

The vowel, its nature and structure

Tsutomu Chiba

- 01 Jan 1958

363

Proceedings Article•10.1109/ICASSP.1982.1171512

Prediction of perceived phonetic distance from critical-band spectra: A first step

Dennis H. Klatt

- 03 May 1982

TL;DR: Judgements of phonetic distance between pairs of static synthetic vowels and fricatives have been collected in which the stimulus ensemble included formant frequency changes and a number of acoustic changes that turn out to have little phonetic relevance.

...read moreread less

349