Speaker adaptation using maximum likelihood model interpolation

doi:10.1109/ICASSP.1999.759777

Proceedings Article10.1109/ICASSP.1999.759777

Speaker adaptation using maximum likelihood model interpolation

Zuoying Wang, +1 more

- 15 Mar 1999

- Vol. 2, pp 753-756

9

TL;DR: Experiments show that 3 adaptation sentences can give a significant performance improvement and as the number of SD models increases, further improvement can be obtained.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1006/CSLA.2001.0168

Maximum likelihood stochastic transformation adaptation for medium and small data sets

Constantinos Boulis, +2 more

- 01 Jul 2001

- Computer Speech & Language

TL;DR: In this article, the authors proposed the maximum likelihood stochastic transformation (MLST) for speaker adaptation, which estimates multiple linear transforms per class of models and a transform weights vector specific to each component (Gaussians in our case).

...read moreread less

8

Proceedings Article•10.1109/ICASSP.2006.1660170

Multigrained Model Adaptation With Map and Reference Speaker Weighting For Text Independent Speaker Verification

Xianyu Zhao, +4 more

- 14 May 2006

TL;DR: A new speaker adaptation method which combines MAP and reference speaker weighting (RSW) adaptation in a hierarchical, multigrained mode is presented, which enables all model components to be updated in a way that strikes a good balance between model complexity and available data.

...read moreread less

4

•Proceedings Article

Using spatial correlation information in speech recognition.

Peng Yu, +1 more

- 01 Jan 2001

TL;DR: A new method of using spatial information in speech recognition is proposed by using linear equation to subscribe spatial correlation, calculating equation coefficients by K-L transformation, and developing a new training algorithm with the linear constraints.

...read moreread less

2

Patent

Method of creating an acoustic model for a speech recognition system

Bartosik Heinrich

- 01 Jul 2004

TL;DR: In this article, a weighted linear combination of the initial models (Gi) is created using previously determined weight factors (gi) that are specific to the acoustic model (U) for a speaker as the user of a voice recognition system.

...read moreread less

2

•Proceedings Article

Linguistic tree based maximum likelihood model interpolation.

Liu Feng, +3 more

- 01 Jan 1999

TL;DR: A speaker adaptation method is presented which computes the speaker adapted model by a weighted sum of a set of speaker dependent models which shows that with as little as 1~3 sentences a significant performance improvement is obtained.

...read moreread less

References

Journal Article•10.1006/CSLA.1995.0010

Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models

C. J. Leggetter, +1 more

- 01 Apr 1995

- Computer Speech & Language

TL;DR: An important feature of the method is that arbitrary adaptation data can be used—no special enrolment sentences are needed and that as more data is used the adaptation performance improves.

...read moreread less

2.5K

Journal Article•10.1109/89.279278

Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains

Jean-Luc Gauvain, +1 more

- 01 Apr 1994

- IEEE Transactions on Speech and Audio Pr...

TL;DR: A framework for maximum a posteriori (MAP) estimation of hidden Markov models (HMM) is presented, and Bayesian learning is shown to serve as a unified approach for a wide range of speech recognition applications.

...read moreread less

2.5K

Journal Article•10.1109/79.595570

Voice dictation of Mandarin Chinese

Lin-Shan Lee

- 01 Jul 1997

TL;DR: The characteristic structure of Mandarin Chinese is analyzed and the primary focus is on the key technology regarding the problem, including the basic architecture for Mandarin dictation, acoustic modeling/ processing, and linguistic modeling/processing.

...read moreread less

79

•Proceedings Article