Journal Article10.1109/TASL.2007.905144
On Integer MDCT for Perceptual Audio Coding
12
TL;DR: Noise introduced by the Int MDCT does not affect the perceptual quality of the coded audio under standard playback circumstances and the way of using only the IntMDCT filterbank in scalable audio coding is justified.
read more
Abstract: In MPEG-4 scalable lossless coding (SLS) which was recently published as an ISO standard in June 2006, the integer modified discrete cosine transform (IntMDCT) was adopted to enable efficient lossless reconstruction. In addition, there is an MDCT filterbank which is inherent to the advanced audio coding (AAC) core that is present in the SLS codec. The presence of two filterbanks have undoubtedly increased the complexity of the implementation, and it is for this reason that the MDCT is disabled and the IntMDCT is then the only type of filterbank that is employed in SLS for both lossy and lossless operations. Because of the rounding operations in the IntMDCT, there is a concern if the use of IntMDCT for perceptual audio coding will eventually degrade the fidelity of the audio codec. This paper addresses this concern by analyzing the performance of the IntMDCT in a lossy coding scenario. It is found that noise introduced by the IntMDCT does not affect the perceptual quality of the coded audio under standard playback circumstances. As such, it concludes that the MDCT and IntMDCT filterbanks are interchangeable at lossy bitrate, and the way of using only the IntMDCT filterbank in scalable audio coding is also justified.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
New universal rotation-based fast computational structures for an efficient implementation of the DCT-IV/DST-IV and analysis/synthesis MDCT/MDST filter banks
TL;DR: Since Givens-Jacobi rotation can be factored into a product of Gauss elementary matrices being unit lower and unit upper triangular matrices, the new fast rotation-based computational structures are suitable for an integer approximation of the DCT-IV/DST-IV and MDCT/MDST which are currently modern transform technologies for lossless audio coding.
21
The CORDIC-inside-lifting architecture for constant-coefficient hardware quaternion multipliers
Nicolai A. Petrovsky,Marek Parfieniuk +1 more
- 24 Dec 2012
TL;DR: This paper presents an architecture for constant-coefficient hardware quaternion multipliers, which is based on combining the CORDIC algorithm with a lifting scheme, which ensures that obtainable approximations of hypercomplex multiplications are perfectly invertible.
10
Fixed Quality Layered Audio Based on Scalable Lossless Coding
TL;DR: The paper addresses a bitstream scalable coder based on the MPEG-4 scalable lossless (SLS) coding system where, in contrast to SLS, the bitrate of the enhancement layer is not fixed but instead an attempt is made to create a quality-fixed enhancement layer.
6
Analysis of audio signal using integer MDCT with Kaiser Bessel Derived window
M. Davidson Kamala Dhas,P. Maria Sheeba +1 more
- 01 Jan 2017
TL;DR: The integer approximation of the lapped transform, called Integer MDCT, which is evolved from the MDCT by means of the rounding operations or lifting scheme, inherits most of the properties of MDCT and provides good spectral representation and perfect invertibility or reconstruction property.
3
Integer Approximate Cosine/Sine-Modulated Filter Banks
Vladimir Britanak,K. R. Rao +1 more
- 01 Jan 2018
TL;DR: The local and global methods to integer approximation of perfect reconstruction cosine/sine- modulated filter banks and cosine-modulated QMF banks are discussed in detail and are based on computational methods of linear algebra, matrix theory and matrix computations, and in particular, on the matrix decompositions.
3
References
Factoring wavelet transforms into lifting steps
Ingrid Daubechies,Wim Sweldens +1 more
TL;DR: In this paper, a self-contained derivation from basic principles such as the Euclidean algorithm, with a focus on applying it to wavelet filtering, is presented, which asymptotically reduces the computational complexity of the transform by a factor two.
A generalized Gaussian image model for edge-preserving MAP estimation
TL;DR: In this article, a generalized Gaussian Markov random field (GGMRF) is proposed for image reconstruction in low-dosage transmission tomography, which satisfies several desirable analytical and computational properties for map estimation, including continuous dependence of the estimate on the data and invariance of the character of solutions to scaling of data.
Perceptual coding of digital audio
T. Painter,Andreas Spanias +1 more
- 01 Apr 2000
TL;DR: This paper reviews methodologies that achieve perceptually transparent coding of FM- and CD-quality audio signals, including algorithms that manipulate transform components, subband signal decompositions, sinusoidal signal components, and linear prediction parameters, as well as hybrid algorithms that make use of more than one signal model.
Subband/Transform coding using filter bank designs based on time domain aliasing cancellation
John Princen,Andrew Johnson,A. Bradley +2 more
- 06 Apr 1987
TL;DR: A new, oddly stacked, critically sampled, single side-band (SSB) analysis/synthesis system based on Time Domain Aliasing Cancellation (TDAC) is described in this paper.
460
Fast multiplierless approximations of the DCT with the lifting scheme
Jie Liang,Trac D. Tran +1 more
TL;DR: The binDCT can be tuned to cover the gap between the Walsh-Hadamard transform and the DCT, and allows a 16-bit implementation, enables lossless compression, and maintains satisfactory compatibility with the floating-point DCT.