Speech coding system and method using voicing probability determination

doi:10.1121/1.427004

Patent10.1121/1.427004

Speech coding system and method using voicing probability determination

Suat Yeldener, +1 more

- 13 Sep 1995

- Journal of the Acoustical Society of Ame...

- Vol. 105, Iss: 2, pp 586

151

TL;DR: A modular system and method is provided for encoding and decoding of speech signals using voicing probability determination and the use of the system in the generation of a variety of voice effects.

Abstract: A modular system and method is provided for encoding and decoding of speech signals using voicing probability determination. The continuous input speech is divided into time segments of a predetermined length. For each segment the encoder of the system computes the signal pitch and a parameter which is related to the relative content of voiced and unvoiced portions in the spectrum of the signal, which is expressed as a ratio Pv, defined as a voicing probability. The voiced portion of the signal spectrum, as determined by the parameter Pv, is encoded using a set of harmonically related amplitudes corresponding to the estimated pitch. The unvoiced portion of the signal is processed in a separate processing branch which uses a modified linear predictive coding algorithm. Parameters representing both the voiced and the unvoiced portions of a speech segment are combined in data packets for transmission. In the decoder, speech is synthesized from the transmitted parameters representing voiced and unvoiced portions of the speech in a reverse order. Boundary conditions between voiced and unvoiced segments are established to ensure amplitude and phase continuity for improved output speech quality. Perceptually smooth transition between frames is ensured by using an overlap and add method of synthesis. Also disclosed is the use of the system in the generation of a variety of voice effects.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Patent

Headset terminal with speech functionality

James Wahl, +7 more

- 06 Feb 2006

TL;DR: In this article, a rotatable microphone boom assembly includes a headband assembly, an earcup assembly and a power source assembly, which are mounted on opposite sides of a rotation axis to maintain a consistent orientation on the boom assembly with respect to a user.

...read moreread less

331

Patent

Scalable and embedded codec for speech and audio signals

Joseph Gerard Aguilar, +8 more

- 10 Aug 2007

TL;DR: In this article, a system and method for processing of audio and speech signals is disclosed, which provide compatibility over a range of communication devices operating at different sampling frequencies and/or bit rates.

...read moreread less

219

Patent•10.1121/1.2409451

Enhancing speech intelligibility using variable-rate time-scale modification

Nicola Chong-White, +1 more

- 09 Jan 2002

- Journal of the Acoustical Society of Ame...

TL;DR: In this paper, the saliency of initial consonants was improved by spectral enhancements and variable rate time-scaling procedures, and emphasis was transferred from the dominating vowel to the preceding consonant through adaptation of the phoneme timing structure.

...read moreread less

189

Patent

Method and apparatus for hybrid coding of speech at 4kbps

Allen Gersho, +3 more

- 28 Aug 1998

TL;DR: In this article, a method and apparatus for encoding speech for communication to a decoder for reproduction of the speech where the speech signal is classified into steady state voiced (harmonic), stationary unvoiced, and "transitory" or "transition" speech.

...read moreread less

153

Patent

Digital audio signal coding using a CELP coder and a transform coder

Gilad Cohen, +4 more

- 04 Mar 1998

TL;DR: In this paper, a method for adaptively switching between transform audio coder and CELP coder, which makes use of the superior performance of cELP coders for speech signal coding, while enjoying the benefits of transform coder for other audio signals.

...read moreread less

148

...

Expand

References

Journal Article•10.1109/78.80763

Super resolution pitch determination of speech signals

Y. Medan, +2 more

- 01 Jan 1991

- IEEE Transactions on Signal Processing

TL;DR: Based on a new similarity model for the voice excitation process, a novel pitch determination procedure is derived that has infinite (super) resolution, better accuracy than the difference limen for F/sub 0/, robustness to noise, reliability, and modest computational complexity.

...read moreread less

240

Patent•10.1121/1.411396

Voiced/unvoiced estimation of an acoustic signal

John C. Hardwick, +1 more

- 21 Nov 1991

- Journal of the Acoustical Society of Ame...

TL;DR: In this paper, the pitch estimation method is improved by making the decision dependent on the energy of the current segment relative to energy of recent prior segments; if the relative energy is low, the current segments favors an unvoiced decision; if high, it favors a voiced decision.

...read moreread less

221

Patent•10.1121/1.411887

Speech transformation system

Michael Savic, +2 more

- 31 Aug 1993

- Journal of the Acoustical Society of Ame...

TL;DR: In this paper, a high quality voice transformation system and method operates during a training mode to store voice signal characteristics representing target and source voices, and then during a real time transformation mode, a signal representing source speech is segmented into overlapping segments, analyzed to separate the excitation spectrum from the tone quality spectrum.

...read moreread less

219

Patent•10.1121/1.410046

Methods for speech transmission

John C. Hardwick, +1 more

- 03 May 1995

- Journal of the Acoustical Society of Ame...

TL;DR: The quantized parameter bits are grouped into several categories according to their sensitivity to bit errors, and the ratio between the actual spectral envelope and the smoothed spectral envelope is used to enhance the spectral envelope.

...read moreread less

217

Patent•10.1121/1.400432

Processing of acoustic waveforms

R.J. McAulay, +1 more

- 14 Mar 1986

- Journal of the Acoustical Society of Ame...

TL;DR: In this article, a sinusoidal model for acoustic waveforms is applied to develop a new analysis/synthesis technique which characterizes a waveform by the amplitudes, frequencies, and phases of component sine waves.

...read moreread less

154