Proceedings Article10.1109/ICASSP.2015.7179111
MDCT audio coding with pulse vector quantizers
Jonas Svedberg,Volodya Grancharov,Sigurdur Sverrisson,Erik Norvell,Tomas Jansson Toftgård,Harald Pobloth,Stefan Bruhn +6 more
- 19 Apr 2015
- pp 5937-5941
4
TL;DR: A complexity analysis in terms of WMOPS is presented to illustrate that the proposed Split-PVQ concept and dynamic range optimized MPVQ-indexing are suitable for real-time audio coding.
read more
Abstract: This paper describes a novel audio coding algorithm that is a building block in the recently standardized 3GPP EVS codec [1]. The presented scheme operates in the Modified Discrete Cosine Transform (MDCT) domain and deploys a Split-PVQ pulse coding quantizer, a noise-fill, and a gain control optimized for the quantizer's properties. A complexity analysis in terms of WMOPS is presented to illustrate that the proposed Split-PVQ concept and dynamic range optimized MPVQ-indexing are suitable for real-time audio coding. Test results from formal MOS subjective evaluations and objective performance figures are presented to illustrate the competitiveness of the proposed algorithm.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Overview of the EVS codec architecture
Martin Dietz,Markus Multrus,Vaclav Eksler,Vladimir Malenovsky,Erik Norvell,Harald Pobloth,Lei Miao,Zhe Wang,Lasse Juhani Laaksonen,Adriana Vasilache,Yutaka Kamamoto,Kei Kikuiri,Stéphane Ragot,Julien Faure,Hiroyuki Ehara,Vivek Rajendran,Atti Venkatraman S,Ho-Sang Sung,Eunmi Oh,Hao Yuan,Changbao Zhu +20 more
- 19 Apr 2015
TL;DR: An overview of the underlying architecture as well as the novel technologies in the EVS codec are given and listening test results showing the performance of the new codec in terms of compression and speech/audio quality are presented.
140
Advances in low bitrate time-frequency coding
Tommy Vaillancourt,Vladimir Malenovsky,Redwan Salami,Zexin Liu,Lei Miao,Jon Gibbs,Milan Jelinek +6 more
- 19 Apr 2015
TL;DR: A novel technique is presented to efficiently mix traditional ACELP time domain coding with a frequency domain coding model to improve the quality of generic audio signals coded at low bitrates without additional delay.
6
Patent
Pyramid vector quantizer shape search
Jonas Svedberg
- 25 Jun 2015
TL;DR: In this article, an encoder and a method for pyramid vector quantizer (PVQ) shape search is described. And the encoder determines, based on the maximum pulse amplitude, maxamp y, of a current vector y, whether more than a current bit word length is needed to represent enloop y, in a lossless manner in the upcoming inner dimension loop.
6
Frequency Domain Coding
Tom Bäckström
- 01 Jan 2017
TL;DR: This chapter gives an overview of the main components of frequency domain coding methods, which include windowing, a time-frequency transform, perceptual modelling and entropy coding of the spectral components.
References
•Book
Vector Quantization and Signal Compression
Allen Gersho,Robert M. Gray +1 more
- 01 Jan 1991
TL;DR: The author explains the design and implementation of the Levinson-Durbin Algorithm, which automates the very labor-intensive and therefore time-heavy and expensive process of designing and implementing a Quantizer.
8K
A pyramid vector quantizer
TL;DR: Although suboptimum in a rate-distortion sense, because the PVQ can encode large-dimensional vectors, it offers significant reduction in rose distortion compared with the optimum Lloyd-Max scalar quantizer, and provides an attractive alternative to currently available vector quantizers.
359
Definition of the Opus Audio Codec
Jean-Marc Valin,Koen Vos +1 more
- 01 Sep 2012
TL;DR: This document describes the Opus codec, designed for interactive speech and audio transmission over the Internet.
262
AMR-WB+: a new audio coding standard for 3rd generation mobile audio services
J. Makinen,B. Bessette,S. Bruhn,Pasi Ojala,R. Salami,A. Taleb +5 more
- 18 Mar 2005
TL;DR: The requirements imposed by mobile audio services are discussed and a technology overview of AMR-WB+ as a codec matching these requirements while providing outstanding audio quality is given.
141
Error-resilient pyramid vector quantization for image compression
TL;DR: This paper proposes a new method of deriving the indices of the lattice points of the multidimensional pyramid and describes how these techniques can also improve the channel noise immunity of general symmetric lattice quantizers.
72