Perceptual linear predictive (PLP) analysis of speech

doi:10.1121/1.399423

Journal Article10.1121/1.399423

Perceptual linear predictive (PLP) analysis of speech

Hynek Hermansky

- 01 Apr 1990

- Journal of the Acoustical Society of Ame...

- Vol. 87, Iss: 4, pp 1738-1752

3.1K

TL;DR: A new technique for the analysis of speech, the perceptual linear predictive (PLP) technique, which uses three concepts from the psychophysics of hearing to derive an estimate of the auditory spectrum, and yields a low-dimensional representation of speech.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article•10.1109/ICASSP.2006.1660022

Cross-Domain and Cross-Language Portability of Acoustic Features Estimated by Multilayer Perceptrons

Andreas Stolcke, +5 more

- 14 May 2006

TL;DR: It is shown that even without retraining, English-trained MLP features can provide a significant boost to recognition accuracy in new domains within the same language, as well as in entirely different languages such as Mandarin and Arabic.

...read moreread less

134

Patent

Multi-tiered voice feedback in an electronic device

James Eric Mason, +1 more

- 01 Sep 2009

TL;DR: In this paper, the authors proposed a voice feedback system that provides voice feedback for displayed speakable elements based on the associated tier of the display of each speakable element and the audio files for each speaker.

...read moreread less

133

•Dissertation•10.3990/1.9789036527125

Segmentation, Diarization and Speech Transcription: Surprise Data Unraveled

Marijn Huijbregts

- 21 Nov 2008

TL;DR: In this thesis methods are presented for which no external training data is required for training models, and these novel methods have been implemented in a large vocabulary continuous speech recognition system called SHoUT.

...read moreread less

133

Proceedings Article•10.1109/ICASSP.2014.6854054

Analyzing convolutional neural networks for speech activity detection in mismatched acoustic conditions

Samuel Thomas, +3 more

- 04 May 2014

TL;DR: CNNs are used as acoustic models for speech activity detection (SAD) on data collected over noisy radio communication channels to illustrate that CNNs have a considerable advantage in fast adaptation for acoustic modeling in these settings.

...read moreread less

132

Patent

Digital assistant providing whispered speech

Tuomo Raitio, +3 more

- 15 Sep 2016

TL;DR: In this article, a system and processes for detecting and/or providing a whispered speech response are provided, where speech is received from a user, and based on the speech input, determined that a whispering speech response is to be provided.

...read moreread less

132

...

Expand

References

•Book

Acoustic theory of speech production

Gunnar Fant

- 01 Jan 1960

3.6K

•Journal Article

Distance measures for speech recognition, psychological and instrumental

P. Mermelstein

- 01 Jan 1976

- Pattern Recognition and Artificial Intel...

464

Journal Article•10.1121/1.1912389

Effect of glottal pulse shape on the quality of natural vowels.

A. E. Rosenberg

- 01 Jul 1969

- Journal of the Acoustical Society of Ame...

TL;DR: In this article, a male speaker recorded monosyllabic words and a continuous sentence and a pitch-synchronous analysis was carried out by a digital computer on the vowel portions of these samples, for every pitch period, the analysis provided: formant frequencies, waveform of the glottal excitation function, and an accurate pitch-period measurement.

...read moreread less

434

•Book

The vowel, its nature and structure

Tsutomu Chiba

- 01 Jan 1958

363

Proceedings Article•10.1109/ICASSP.1982.1171512

Prediction of perceived phonetic distance from critical-band spectra: A first step

Dennis H. Klatt

- 03 May 1982

TL;DR: Judgements of phonetic distance between pairs of static synthetic vowels and fricatives have been collected in which the stimulus ensemble included formant frequency changes and a number of acoustic changes that turn out to have little phonetic relevance.

...read moreread less

349