Patent
Multi-channel audio decoder
Smyth Stephen M,Smyth Michael H,Smith William Paul +2 more
- 16 Dec 1997
183
TL;DR: A subband audio coder employs perfect/nonperfect reconstruction filters, predictive/non-predictive subband encoding, transient analysis, and psycho-acoustic/minimum mean square error (mmse) bit allocation over time, frequency and the multiple audio channels to encode/decode a data stream to generate high fidelity reconstructed audio as mentioned in this paper.
read more
Abstract: A subband audio coder employs perfect/non-perfect reconstruction filters, predictive/non-predictive subband encoding, transient analysis, and psycho-acoustic/minimum mean-square-error (mmse) bit allocation over time, frequency and the multiple audio channels to encode/decode a data stream to generate high fidelity reconstructed audio. The audio coder windows the multi-channel audio signal such that the frame size, i.e. number of bytes, is constrained to lie in a desired range, and formats the encoded data so that the individual subframes can be played back as they are received thereby reducing latency. Furthermore, the audio coder processes the baseband portion (0-24 kHz) of the audio bandwidth for sampling frequencies of 48 kHz and higher with the same encoding/decoding algorithm so that audio coder architecture is future compatible.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Sound enhancement for mobile phones and other products producing personalized audio for users
TL;DR: In this paper, the authors describe a personal communication device consisting of a transmitter/receiver coupled to a communication medium for transmitted receiving audio signals, control circuitry that controls transmission, reception and processing of call and audio signals.
212
Patent
Enhancing Perceptual Performance of SBR and Related HFR Coding Methods by Adaptive Noise-Floor Addition and Noise Substitution Limiting
Liljeryd Lars Gustaf,Kristofer Kjoerling,Per Ekstrand,Fredrik Henn +3 more
- 24 Jun 2009
TL;DR: In this article, the problem of insufficient noise contents is addressed in a reconstructed highband, by using Adaptive Noise-floor Addition, which is applicable to both speech coding and natural audio coding systems.
160
Patent
System and method for providing single microphone noise suppression fallback
Mark Every,Carlos Avendano,Ludger Solbach,Carlo Murgia +3 more
- 29 Feb 2008
TL;DR: In this paper, a single microphone noise estimate is derived from the primary and secondary acoustic signals, and a combined noise estimate based on the single and dual microphone noise estimates is then determined.
145
Patent
Multi-channel audio encoding and decoding
Naveen Thumpudi,Wei-ge Chen +1 more
- 04 Sep 2003
TL;DR: In this article, the authors describe architectures and techniques that improve the efficiency of multi-channel audio coding and decoding, which can be used in combination or independently, and describe various techniques and tools.
144
Patent
System and method for utilizing omni-directional microphones for speech enhancement
Carlos Avendano
- 29 Jan 2007
TL;DR: In this article, an inter-microphone level difference (ILD) was used to attenuate noise and enhance speech. But the ILD was not used to enhance the speech of the primary acoustic signal.
144
References
Transform coding of audio signals using perceptual noise criteria
TL;DR: A 4-b/sample transform coder is designed using a psychoacoustically derived noise-making threshold that is based on the short-term spectrum of the signal, and tested in a formal subjective test involving a wide selection of monophonic audio inputs.
Variable rate vocoder
Jacobs Paul E,Gardner William R,Lee Chong U,Gilhousen Klein S,Lam S Katherine,Ming-Chang Tsai +5 more
TL;DR: In this paper, a variable rate coding of frames of digitized speech samples is proposed, comprising the steps of determining a level of speech activity for a frame of digitised speech samples, selecting an encoding rate from a set of rates based upon the determined level of activity within said frame, and coding said frame according to a predetermined coding format for said selected rate wherein each rate has a corresponding different coding format.
552
Mode-specific method and apparatus for encoding signals containing speech
TL;DR: A method for encoding a signal that includes a speech component that is classified in one of at least two modes, based, for example, on pitch stationarity, short-term level gradient or zero crossing rate, is described.
282
Adaptive-block-length, adaptive-transform, and adaptive-window transform coder, decoder, and encoder/decoder for high-quality audio
TL;DR: In this article, the tradeoff between time resolution and frequency resolution is optimized by adaptively selecting the transform block length for each sampled audio segment, and/or can optimize coding gain by adapting the transform and analysis window or the analysis/synthesis window pair.
275
Method and apparatus for coding audio signals based on perceptual model
TL;DR: In this paper, a perceptual filterbank coder is used to decode high quality stereophonic audio signals, which exploits the interchannel redundancies and psychoacoustic properties of stereo audio signals.
238