Topic

Sinusoidal model

About: Sinusoidal model is a research topic. Over the lifetime, 374 publications have been published within this topic receiving 6489 citations.

...read moreread less

Topic Tools

Find unexplored research gaps

Generate a literature review

Explore related concepts

Papers published on a yearly basis

Papers

Journal Article•10.1109/TASSP.1986.1164910•

Speech analysis/Synthesis based on a sinusoidal representation

[...]

R.J. McAulay¹, Thomas F. Quatieri¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Aug 1986-IEEE Transactions on Acoustics, Speech, and Signal Processing

TL;DR: A sinusoidal model for the speech waveform is used to develop a new analysis/synthesis technique that is characterized by the amplitudes, frequencies, and phases of the component sine waves, which forms the basis for new approaches to the problems of speech transformations including time-scale and pitch-scale modification, and midrate speech coding.

...read moreread less

Abstract: A sinusoidal model for the speech waveform is used to develop a new analysis/synthesis technique that is characterized by the amplitudes, frequencies, and phases of the component sine waves. These parameters are estimated from the short-time Fourier transform using a simple peak-picking algorithm. Rapid changes in the highly resolved spectral components are tracked using the concept of "birth" and "death" of the underlying sine waves. For a given frequency track a cubic function is used to unwrap and interpolate the phase such that the phase track is maximally smooth. This phase function is applied to a sine-wave generator, which is amplitude modulated and added to the other sine waves to give the final speech output. The resulting synthetic waveform preserves the general waveform shape and is essentially perceptually indistinguishable from the original speech. Furthermore, in the presence of noise the perceptual characteristics of the speech as well as the noise are maintained. In addition, it was found that the representation was sufficiently general that high-quality reproduction was obtained for a larger class of inputs including: two overlapping, superposed speech waveforms; music waveforms; speech in musical backgrounds; and certain marine biologic sounds. Finally, the analysis/synthesis system forms the basis for new approaches to the problems of speech transformations including time-scale and pitch-scale modification, and midrate speech coding [8], [9].

...read moreread less

1,861 citations

Journal Article•10.1109/89.622558•

Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model

[...]

E.B. George, M.J.T. Smith¹•Institutions (1)

Georgia Institute of Technology¹

01 Sep 1997-IEEE Transactions on Speech and Audio Processing

TL;DR: The proposed analysis-by-synthesis/overlap-add (ABS/OLA) system allows for both fixed and time-varying time-, frequency-, and pitch-scale modifications, and computational shortcuts using the FFT algorithm make its implementation feasible using currently available hardware.

...read moreread less

Abstract: Sinusoidal modeling has been successfully applied to a broad range of speech processing problems, and offers advantages over linear predictive modeling and the short-time Fourier transform for speech analysis/synthesis and modification. This paper presents a novel speech analysis/synthesis system based on the combination of an overlap-add sinusoidal model with an analysis-by-synthesis technique to determine the model parameters. It describes this analysis procedure in detail, and introduces an equivalent frequency-domain algorithm that takes advantage of the computational efficiency of the fast Fourier transform (FFT). In addition, a refined overlap-add sinusoidal model capable of shape-invariant speech modification is derived, and a pitch-scale modification algorithm is defined that preserves speech bandwidth and eliminates noise migration effects. Analysis-by-synthesis achieves very high synthetic speech quality by accurately estimating the component frequencies, eliminating sidelobe interference effects, and effectively dealing with nonstationary speech events. The refined overlap-add synthesis model correlates well with analysis-by-synthesis, and modifies speech without objectionable artifacts by explicitly controlling shape invariance and phase coherence. The proposed analysis-by-synthesis/overlap-add (ABS/OLA) system allows for both fixed and time-varying time-, frequency-, and pitch-scale modifications, and computational shortcuts using the FFT algorithm make its implementation feasible using currently available hardware.

...read moreread less

230 citations

Patent•10.1121/1.400432•

Processing of acoustic waveforms

[...]

R.J. McAulay¹, Thomas F. Quatieri¹•Institutions (1)

Massachusetts Institute of Technology¹

14 Mar 1986-Journal of the Acoustical Society of America

TL;DR: In this article, a sinusoidal model for acoustic waveforms is applied to develop a new analysis/synthesis technique which characterizes a waveform by the amplitudes, frequencies, and phases of component sine waves.

...read moreread less

Abstract: A sinusoidal model for acoustic waveforms is applied to develop a new analysis/synthesis technique which characterizes a waveform by the amplitudes, frequencies, and phases of component sine waves. These parameters are estimated from a short-time Fourier transform. Rapid changes in the highly-resolved spectral components are tracked using the concept of "birth" and "death" of the underlying sine waves. The component values are interpolated from one frame to the next to yield a representation that is applied to a sine wave generator. The resulting synthetic waveform preserves the general waveform shape and is perceptually indistinguishable from the original. Furthermore, in the presence of noise the perceptual characteristics of the waveform as well as the noise are maintained. The method and devices disclosed herein are particularly useful in speech coding, time-scale modification, frequency scale modification and pitch modification.

...read moreread less

154 citations

Journal Article•10.1109/89.952491•

Speech enhancement using a constrained iterative sinusoidal model

[...]

Jesper Jensen¹, John H. L. Hansen²•Institutions (2)

Aalborg University¹, University of Colorado Boulder²

01 Oct 2001-IEEE Transactions on Speech and Audio Processing

TL;DR: This paper presents a sinusoidal model based algorithm for enhancement of speech degraded by additive broad-band noise that shows considerable improvement over traditional spectral subtraction and Wiener filtering based schemes.

...read moreread less

Abstract: This paper presents a sinusoidal model based algorithm for enhancement of speech degraded by additive broad-band noise. In order to ensure speech-like characteristics observed in clean speech, smoothness constraints are imposed on the model parameters using a spectral envelope surface (SES) smoothing procedure. Algorithm evaluation is performed using speech signals degraded by additive white Gaussian noise. Distortion as measured by objective speech quality scores showed a 34%-41% reduction over a SNR range of 5-to-20 dB. Objective and subjective evaluations also show considerable improvement over traditional spectral subtraction and Wiener filtering based schemes. Finally, in a subjective AB preference test, where enhanced signals were coded with the G729 codec, the proposed scheme was preferred over the traditional enhancement schemes tested for SNRs in the range of 5 to 20 dB.

...read moreread less

133 citations

Journal Article•10.1109/29.45618•

An approach to co-channel talker interference suppression using a sinusoidal model for speech

[...]

Thomas F. Quatieri¹, R.G. Danisewicz¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Jan 1990-IEEE Transactions on Acoustics, Speech, and Signal Processing

TL;DR: Evidence is provided that the sinusoidal analysis/synthesis model with effective parameter estimation techniques offers a promising approach to the problem of cochannel talker-interference suppression over a range of conditions.

...read moreread less

Abstract: The technique fits a sinusoidal model to additive vocal speech segments so that the least-mean-squared error between the model and the summed waveforms is obtained. Enhancement is achieved by synthesizing a waveform from the sine waves attributed to the desired speaker. Least-squares estimation is applied to obtain sine-wave amplitudes and phases of both talkers, based on either a priori sine-wave frequencies or a priori fundamental frequency contours. When the frequencies of the two waveforms are closely spaced, the performance is significantly improved by exploring the time evolution of the sinusoidal parameters across multiple analysis frames. The least-squared-error approach is also extended, under restricted conditions, to estimate fundamental frequency contours of both speakers from the summed waveforms. The results obtained, although limited in their scope, provide evidence that the sinusoidal analysis/synthesis model with effective parameter estimation techniques offers a promising approach to the problem of cochannel talker-interference suppression over a range of conditions. >

...read moreread less

114 citations

...

Expand

Performance Metrics

374

Papers

2,969

Citations

No. of papers in the topic in previous years
Year	Papers
2021	4
2020	7
2019	10
2018	13
2017	7
2016	11

Sinusoidal model

Topic Tools

Papers published on a yearly basis

Papers

Speech analysis/Synthesis based on a sinusoidal representation

Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model

Processing of acoustic waveforms

Speech enhancement using a constrained iterative sinusoidal model

An approach to co-channel talker interference suppression using a sinusoidal model for speech

Related Topics (5)

Performance Metrics