Perceptual linear predictive (PLP) analysis of speech

doi:10.1121/1.399423

Journal Article10.1121/1.399423

Perceptual linear predictive (PLP) analysis of speech

Hynek Hermansky

- 01 Apr 1990

- Journal of the Acoustical Society of Ame...

- Vol. 87, Iss: 4, pp 1738-1752

3.1K

TL;DR: A new technique for the analysis of speech, the perceptual linear predictive (PLP) technique, which uses three concepts from the psychophysics of hearing to derive an estimate of the auditory spectrum, and yields a low-dimensional representation of speech.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Proceedings Article•10.21437/INTERSPEECH.2015-704

Autoencoder based multi-stream combination for noise robust speech recognition

Sri Harish Mallidi, +4 more

- 06 Sep 2015

TL;DR: This work proposes to use autoencoders which are multi-layer feed forward neural networks, for estimating conﬁdence measure, and shows that the reconstruction error of the autoencoder is correlated to the robustness of the corresponding stream.

...read moreread less

21

Patent

Voice Activity Detection Using A Soft Decision Mechanism

Ron Wein

- 01 Aug 2014

TL;DR: In this article, a robust VAD algorithm that is also language independent is presented, where instead of classifying short segments of the audio as either speech or silence, the VAD as disclosed herein employees a soft-decision mechanism.

...read moreread less

21

•Proceedings Article•10.30019/IJCLCLP.200509.0004

Detecting Emotions in Mandarin Speech

Tsang-Long Pao, +3 more

- 01 Sep 2004

TL;DR: A Mandarin speech based emotion classification method based on three classification techniques: LDA, K-NN and HMMs, which shows that the selected features are robust and effective for the emotion recognition in the valence and arousal dimensions of the two corpora.

...read moreread less

21

Patent•10.1121/1.3455415

Computer-implemented methods and systems for modeling and recognition of speech

Marios Athineos, +1 more

- 25 Mar 2005

- Journal of the Acoustical Society of Ame...

TL;DR: In this article, a time-to-frequency domain transformation is performed on at least a portion of the received signal to generate a frequency domain representation, which is then converted from a time domain representation to the frequency domain.

...read moreread less

21

•Book Chapter•10.5772/52023

Speaker Recognition: Advancements and Challenges

Homayoon Beigi

- 28 Nov 2012

TL;DR: A review of the most recent literature is presented and the latest techniques which are being deployed in the various branches of this technology are briefly visited.

...read moreread less

21

...

Expand

References

•Book

Acoustic theory of speech production

Gunnar Fant

- 01 Jan 1960

3.6K

•Journal Article

Distance measures for speech recognition, psychological and instrumental

P. Mermelstein

- 01 Jan 1976

- Pattern Recognition and Artificial Intel...

464

Journal Article•10.1121/1.1912389

Effect of glottal pulse shape on the quality of natural vowels.

A. E. Rosenberg

- 01 Jul 1969

- Journal of the Acoustical Society of Ame...

TL;DR: In this article, a male speaker recorded monosyllabic words and a continuous sentence and a pitch-synchronous analysis was carried out by a digital computer on the vowel portions of these samples, for every pitch period, the analysis provided: formant frequencies, waveform of the glottal excitation function, and an accurate pitch-period measurement.

...read moreread less

434

•Book

The vowel, its nature and structure

Tsutomu Chiba

- 01 Jan 1958

363

Proceedings Article•10.1109/ICASSP.1982.1171512

Prediction of perceived phonetic distance from critical-band spectra: A first step

Dennis H. Klatt

- 03 May 1982

TL;DR: Judgements of phonetic distance between pairs of static synthetic vowels and fricatives have been collected in which the stimulus ensemble included formant frequency changes and a number of acoustic changes that turn out to have little phonetic relevance.

...read moreread less

349