Patent10.1121/1.427004
Speech coding system and method using voicing probability determination
151
TL;DR: A modular system and method is provided for encoding and decoding of speech signals using voicing probability determination and the use of the system in the generation of a variety of voice effects.
read more
Abstract: A modular system and method is provided for encoding and decoding of speech signals using voicing probability determination. The continuous input speech is divided into time segments of a predetermined length. For each segment the encoder of the system computes the signal pitch and a parameter which is related to the relative content of voiced and unvoiced portions in the spectrum of the signal, which is expressed as a ratio Pv, defined as a voicing probability. The voiced portion of the signal spectrum, as determined by the parameter Pv, is encoded using a set of harmonically related amplitudes corresponding to the estimated pitch. The unvoiced portion of the signal is processed in a separate processing branch which uses a modified linear predictive coding algorithm. Parameters representing both the voiced and the unvoiced portions of a speech segment are combined in data packets for transmission. In the decoder, speech is synthesized from the transmitted parameters representing voiced and unvoiced portions of the speech in a reverse order. Boundary conditions between voiced and unvoiced segments are established to ensure amplitude and phase continuity for improved output speech quality. Perceptually smooth transition between frames is ensured by using an overlap and add method of synthesis. Also disclosed is the use of the system in the generation of a variety of voice effects.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Patent
Headset terminal with speech functionality
James Wahl,Andres Viduya,Ben Kessing,Roger Graham Byford,James Randall Logan,Dominic Tooze,Philip Shade,Graham Lacy +7 more
- 06 Feb 2006
TL;DR: In this article, a rotatable microphone boom assembly includes a headband assembly, an earcup assembly and a power source assembly, which are mounted on opposite sides of a rotation axis to maintain a consistent orientation on the boom assembly with respect to a user.
331
Patent
Scalable and embedded codec for speech and audio signals
Joseph Gerard Aguilar,David A. Campana,Juin-Hwey Chen,Robert B. Dunn,Robert J. McAulay,Xiaoquin Sun,Wei Wang,Craig Robert Watkins,Robert W. Zopf +8 more
- 10 Aug 2007
TL;DR: In this article, a system and method for processing of audio and speech signals is disclosed, which provide compatibility over a range of communication devices operating at different sampling frequencies and/or bit rates.
219
Enhancing speech intelligibility using variable-rate time-scale modification
TL;DR: In this paper, the saliency of initial consonants was improved by spectral enhancements and variable rate time-scaling procedures, and emphasis was transferred from the dominating vowel to the preceding consonant through adaptation of the phoneme timing structure.
189
Patent
Method and apparatus for hybrid coding of speech at 4kbps
Allen Gersho,Eyal Shlomot,Vladimir Cuperman,Chunyan Li +3 more
- 28 Aug 1998
TL;DR: In this article, a method and apparatus for encoding speech for communication to a decoder for reproduction of the speech where the speech signal is classified into steady state voiced (harmonic), stationary unvoiced, and "transitory" or "transition" speech.
153
Patent
Digital audio signal coding using a CELP coder and a transform coder
Gilad Cohen,Yossef Cohen,Doron Hoffman,Hagai Krupnik,Aharon Satt +4 more
- 04 Mar 1998
TL;DR: In this paper, a method for adaptively switching between transform audio coder and CELP coder, which makes use of the superior performance of cELP coders for speech signal coding, while enjoying the benefits of transform coder for other audio signals.
148
References
Super resolution pitch determination of speech signals
TL;DR: Based on a new similarity model for the voice excitation process, a novel pitch determination procedure is derived that has infinite (super) resolution, better accuracy than the difference limen for F/sub 0/, robustness to noise, reliability, and modest computational complexity.
240
Voiced/unvoiced estimation of an acoustic signal
John C. Hardwick,J.S. Lim +1 more
TL;DR: In this paper, the pitch estimation method is improved by making the decision dependent on the energy of the current segment relative to energy of recent prior segments; if the relative energy is low, the current segments favors an unvoiced decision; if high, it favors a voiced decision.
221
Speech transformation system
TL;DR: In this paper, a high quality voice transformation system and method operates during a training mode to store voice signal characteristics representing target and source voices, and then during a real time transformation mode, a signal representing source speech is segmented into overlapping segments, analyzed to separate the excitation spectrum from the tone quality spectrum.
219
Methods for speech transmission
John C. Hardwick,J.S. Lim +1 more
TL;DR: The quantized parameter bits are grouped into several categories according to their sensitivity to bit errors, and the ratio between the actual spectral envelope and the smoothed spectral envelope is used to enhance the spectral envelope.
217
Processing of acoustic waveforms
R.J. McAulay,Thomas F. Quatieri +1 more
TL;DR: In this article, a sinusoidal model for acoustic waveforms is applied to develop a new analysis/synthesis technique which characterizes a waveform by the amplitudes, frequencies, and phases of component sine waves.
154