Journal Article10.1109/89.848220
Vector quantization based on Gaussian mixture models
Per Hedelin,Jan Skoglund +1 more
176
TL;DR: It is found that an optimal single-stage VQ can operate at approximately 3 bits less than a state-of-the-art LSF-based 2-split VQ.
read more
Abstract: We model the underlying probability density function of vectors in a database as a Gaussian mixture (GM) model. The model is employed for high rate vector quantization analysis and for design of vector quantizers. It is shown that the high rate formulas accurately predict the performance of model-based quantizers. We propose a novel method for optimizing GM model parameters for high rate performance, and an extension to the EM algorithm for densities having bounded support is also presented. The methods are applied to quantization of LPC parameters in speech coding and we present new high rate analysis results for band-limited spectral distortion and outlier statistics. In practical terms, we find that an optimal single-stage VQ can operate at approximately 3 bits less than a state-of-the-art LSF-based 2-split VQ.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
HCP: A Flexible CNN Framework for Multi-Label Image Classification
TL;DR: Experimental results on Pascal VOC 2007 and VOC 2012 multi-label image datasets well demonstrate the superiority of the proposed HCP infrastructure over other state-of-the-arts, where an arbitrary number of object segment hypotheses are taken as the inputs.
Bayesian Estimation of Beta Mixture Models with Variational Inference
Zhanyu Ma,Arne Leijon +1 more
TL;DR: An approximation to the prior/posterior distribution of the parameters in the beta distribution is introduced and an analytically tractable (closed form) Bayesian approach to the parameter estimation is proposed.
244
PDF optimized parametric vector quantization of speech line spectral frequencies
A.D. Subramaniam,Bhaskar D. Rao +1 more
TL;DR: A low complexity quantization scheme using transform coding and bit allocation techniques which allows for easy mapping from observation to quantized value is developed for both fixed rate and variable rate systems.
151
Patent
Near-transparent or transparent multi-channel encoder/decoder scheme
Lindblom Jonas
- 04 Oct 2005
TL;DR: In this paper, a near-transparent or transparent multi-channel encoder/decoder scheme is proposed to generate a waveform-type residual signal with one or more multichannel parameters.
130
Bounded generalized Gaussian mixture model
TL;DR: This paper proposes an extension of the generalized Gaussian distribution that has a flexibility to fit different shapes of observed data such as non-Gaussian and bounded support data and proposes an alternate approach to minimize the higher bound on the data negative log-likelihood function.
93
References
•Book
Elements of information theory
Thomas M. Cover,Joy A. Thomas +1 more
- 01 Jan 1991
TL;DR: The author examines the role of entropy, inequality, and randomness in the design of codes and the construction of codes in the rapidly changing environment.
•Book
Vector Quantization and Signal Compression
Allen Gersho,Robert M. Gray +1 more
- 01 Jan 1991
TL;DR: The author explains the design and implementation of the Levinson-Durbin Algorithm, which automates the very labor-intensive and therefore time-heavy and expensive process of designing and implementing a Quantizer.
8K
Robust text-independent speaker identification using Gaussian mixture speaker models
Douglas A. Reynolds,Richard Rose +1 more
TL;DR: The individual Gaussian components of a GMM are shown to represent some general speaker-dependent spectral shapes that are effective for modeling speaker identity and is shown to outperform the other speaker modeling techniques on an identical 16 speaker telephone speech task.
3.3K