Perceptual linear predictive (PLP) analysis of speech

doi:10.1121/1.399423

Journal Article10.1121/1.399423

Perceptual linear predictive (PLP) analysis of speech

Hynek Hermansky

- 01 Apr 1990

- Journal of the Acoustical Society of Ame...

- Vol. 87, Iss: 4, pp 1738-1752

3.1K

TL;DR: A new technique for the analysis of speech, the perceptual linear predictive (PLP) technique, which uses three concepts from the psychophysics of hearing to derive an estimate of the auditory spectrum, and yields a low-dimensional representation of speech.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Book

Perceptually inspired signal processing strategies for robust speech recognition in reverberant environments

Brian Kingsbury, +1 more

- 01 Jan 1998

TL;DR: This work presentsceptually Inspired Signal-processing Strategies for Robust Speech Recognition in Reverberant Environments, a novel approach to signal-processing that automates the very labor-intensive and therefore time-heavy and expensive process of recognizing speech.

...read moreread less

75

Patent

Systems and methods for structured stem and suffix language models

Jerome R. Bellegarda, +1 more

- 31 Aug 2015

TL;DR: The authors used a structured stem and suffix n-gram language model to predict words using a pre-existing word in the received input, and provided an output of the predicted word as an output to the user.

...read moreread less

75

Journal Article•10.1016/0743-7315(92)90067-W

The Ring Array Processor: a multiprocessing peripheral for connectionist applications

Nelson Morgan, +5 more

- 01 Mar 1992

- Journal of Parallel and Distributed Comp...

TL;DR: The motivation for the RAP is described and how the architecture matches the target algorithm is shown, which is to reduce peak performance on the error back-propagation algorithm to about 50% of a linear speedup.

...read moreread less

75

Journal Article•10.1109/TCSI.2020.2997913

A 22nm, 10.8 μ W/15.1 μ W Dual Computing Modes High Power-Performance-Area Efficiency Domained Background Noise Aware Keyword- Spotting Processor

Bo Liu, +10 more

- 02 Jun 2020

- IEEE Transactions on Circuits and System...

TL;DR: This paper proposes a high power-performance-area efficient background noise aware keyword-spotting (KWS) processor based on an optimized binarized weight network (BWN) processor with adaptively configured to use dual computing modes for both high recognition accuracy under high background noise and ultra-low power consumption under low background noise.

...read moreread less

74

•Proceedings Article

Cross-lingual and multi-stream posterior features for low resource LVCSR systems.

Samuel Thomas, +2 more

- 01 Jan 2010

TL;DR: This work proposes to train low resource LVCSR system with additional sources of information like annotated data from other languages (German and Spanish) and various acoustic feature streams (short-term and modulation features) and multilayer perceptrons (MLPs) on these sources of Information.

...read moreread less

74

...

Expand

References

•Book

Acoustic theory of speech production

Gunnar Fant

- 01 Jan 1960

3.6K

•Journal Article

Distance measures for speech recognition, psychological and instrumental

P. Mermelstein

- 01 Jan 1976

- Pattern Recognition and Artificial Intel...

464

Journal Article•10.1121/1.1912389

Effect of glottal pulse shape on the quality of natural vowels.

A. E. Rosenberg

- 01 Jul 1969

- Journal of the Acoustical Society of Ame...

TL;DR: In this article, a male speaker recorded monosyllabic words and a continuous sentence and a pitch-synchronous analysis was carried out by a digital computer on the vowel portions of these samples, for every pitch period, the analysis provided: formant frequencies, waveform of the glottal excitation function, and an accurate pitch-period measurement.

...read moreread less

434

•Book

The vowel, its nature and structure

Tsutomu Chiba

- 01 Jan 1958

363

Proceedings Article•10.1109/ICASSP.1982.1171512

Prediction of perceived phonetic distance from critical-band spectra: A first step

Dennis H. Klatt

- 03 May 1982

TL;DR: Judgements of phonetic distance between pairs of static synthetic vowels and fricatives have been collected in which the stimulus ensemble included formant frequency changes and a number of acoustic changes that turn out to have little phonetic relevance.

...read moreread less

349