Patent10.1121/1.417887
Voice encoding method and voice decoding method
80
TL;DR: A compressed digital speech signal is encoded to provide a transmission error-resistant transmission signal to support voiced/unvoiced sound discrimination and CRC codes are presented.
read more
Abstract: A compressed digital speech signal is encoded to provide a transmission error-resistant transmission signal. The compressed speech signal is derived from a digital speech signal by performing a pitch search on a block obtained by dividing the speech signal in time to provide pitch information for the block. The block of the speech signal is orthogonally transformed to provide spectral data, which is divided by frequency into plural bands in response to the pitch information. A voiced/unvoiced sound discrimination generates voiced/-unvoiced (V/UV) information indicating whether the spectral data in each of the plural bands represents a voiced or an unvoiced sound. The spectral data in the plural bands are interpolated to provide spectral amplitudes for a predetermined number of bands, independent of the pitch. Hierarchical vector quantizing is applied to the spectral amplitudes to generate upper-layer indices, representing an overview of the spectral amplitudes, and lower-layer indices, representing details of the spectral amplitudes. CRC error detection coding is applied to the upper-layer indices, the pitch information, and the V/UV information to generate CRC codes. Convolution coding for error correction is applied to the upper-layer indices, the higher-order bits of the lower-layer indices, the pitch information, the V/UV information, and the CRC codes. The convolution-coded quantities from two blocks of the speech signal are then interleaved in a frame of the transmission signal, together with the lower-order bits of the respective lower-layer indices.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Patent
Encoding device and decoding device
Mineo Tsushima,Takeshi Norimatsu,Kosuke Nishio,Naoya Tanaka +3 more
- 07 Nov 2002
TL;DR: In this paper, an encoding device (200) includes an MDCT unit (202) that transforms an input signal in a time domain into a frequency spectrum including a lower frequency spectrum, a BWE encoding unit (204) that generates extension data which specifies a higher frequency spectrum at higher frequency than the lower spectrum, and an encoded data stream generating unit (205) that encodes to output the lower-frequency spectrum obtained by the MDCT units and the extension data obtained by BWE units.
232
Patent
Encoding device, decoding device, and method thereof
Tomofumi Yamanashi,Masahiro Oshikiri +1 more
- 29 Feb 2008
TL;DR: The speech encoding apparatus (220) as mentioned in this paper has a first layer encoding section (2201), a first-layer decoding section (2202), a delay section(2203), a subtracting section (104), a frequency domain transforming section (101), a second-layer encoding section(105) and a multiplexing section(106).
186
Patent
Audio coding systems and methods
TL;DR: In this paper, an audio signal is decomposed into lower and upper sub-band and at least the noise component of the upper subband is encoded at the decoder by a decoding means which utilises a synthesised noise excitation signal and a filter to reproduce the noise components in the lower subband.
160
Patent
Quality improvement techniques in an audio encoder
Wei-ge Chen,Naveen Thumpudi,Ming-Chieh Lee +2 more
- 14 Dec 2001
TL;DR: In this paper, an audio encoder dynamically selects between joint and independent coding of a multi-channel audio signal via an open-loop decision based upon energy separation between the coding channels, and the disparity between excitation patterns of the separate input channels.
150
Patent
Multi-channel audio encoding and decoding
Naveen Thumpudi,Wei-ge Chen +1 more
- 04 Sep 2003
TL;DR: In this article, the authors describe architectures and techniques that improve the efficiency of multi-channel audio coding and decoding, which can be used in combination or independently, and describe various techniques and tools.
144
References
Product code vector quantizers for waveform and voice coding
M. J. Sabin,Robert M. Gray +1 more
TL;DR: Several algorithms are presented for the design of shape-gain vector quantizers based on a traning sequence of data or a probabilistic model, and their performance is compared to that of previously reported vector quantization systems.
316
Method for protecting multi-pulse coders from fading and random pattern bit errors
TL;DR: In this paper, a low-overhead method of protecting multi-pulse speech coders from the effects of severe random or fading pattern bit errors combines a standard error correcting code (convolutional rate 1/2 coding and Viterbi trellis decoding) for protection in random errors with cyclic redundancy code (CRC) error detection for fading errors.
59
Fading bit error protection for digital cellular multi-pulse speech coder
TL;DR: Protection of a digital multi-pulse speech coder from fading pattern bit errors common in a digital mobile radio channel is accomplished with error detection techniques which are simple to implement and require no error correcting codes.
59
Voice signal encoding and decoding apparatus and method
TL;DR: A voice signal encoding and decoding apparatus and method which attenuates rapidly a voice signal when there occurs an error in transmission by giving increased robustness to the transmission error.
16
Multiband excitation vocoder
Daniel W. Griffin,Jae Lim +1 more
TL;DR: A speech model, referred to as the multiband excitation model, is presented where the band around each harmonic of the fundamental frequency is declared voiced or unvoiced and methods to synthesize speech from the model parameters are described.
Related Papers (5)
Harald Gustafsson,Ulf Lindgren,Clas Thurban,Petra Deutgen +3 more
- 05 Jan 2001
Juha Ojanpera
- 21 Mar 2003