Model complexity control and compression using discriminative growth functions

doi:10.1109/ICASSP.2004.1326106

Open AccessProceedings Article10.1109/ICASSP.2004.1326106

Model complexity control and compression using discriminative growth functions

Xunying Liu, +1 more

- 17 May 2004

- Vol. 1, pp 797-800

15

TL;DR: In this paper further experiments are carried out using a recently proposed criterion based on marginalizing a maximum mutual information (MMI) growth function for model compression, showing a reduction in word error rate.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article•10.1109/ICASSP.2006.1660239

Adaptation of Hybrid ANN/HMM Models Using Linear Hidden Transformations and Conservative Training

Roberto Gemello, +4 more

- 14 May 2006

TL;DR: A new solution, called conservative training, is proposed that compensates for the lack of adaptation samples in certain classes that outperforms the use of transformations in the feature space and yields even better results when combined with linear input transformations.

...read moreread less

68

•Proceedings Article•10.1109/IJCNN.2006.246618

Adaptation of Artificial Neural Networks Avoiding Catastrophic Forgetting

D. Albesano, +4 more

- 30 Oct 2006

TL;DR: The results show that the combination of the proposed approaches mitigates the catastrophic forgetting effects, and always outperforms the use of the classical transformations in the feature space.

...read moreread less

36

•Journal Article•10.1109/TASL.2006.889804

Automatic Model Complexity Control Using Marginalized Discriminative Growth Functions

Xunying Liu, +1 more

- 01 May 2007

- IEEE Transactions on Audio, Speech, and ...

TL;DR: Experimental results showed that marginalized discriminative growth functions outperforms manually tuned systems and conventional complexity control techniques, such as Bayesian information criterion (BIC), in terms of WER.

...read moreread less

19

•Proceedings Article•10.1109/ASRU.2003.1318400

Automatic model complexity control using marginalized discriminative growth functions

Xunying Liu, +1 more

- 16 Sep 2003

TL;DR: Experimental results on a spontaneous speech recognition task show that marginalized the MMI growth function outperforms data likelihood and standard Bayesian schemes in terms of both recognition performance ranking error and word error.

...read moreread less

15

Towards Efficient and Robust Automatic Speech Recognition: Decoding Techniques and Discriminative Training

Janne Pylkkönen

- 01 Jan 2013

TL;DR: This thesis points out theoretical connections of the Baum-Welch algorithm to general constrained optimization and proposes new control methods for the algorithm, which are shown to improve the robustness of the acoustic models in several large vocabulary speech recognition tasks.

...read moreread less

12

References

•Journal Article•10.1214/AOS/1176344136

Estimating the Dimension of a Model

Gideon Schwarz

- 01 Mar 1978

- Annals of Statistics

TL;DR: In this paper, the problem of selecting one of a number of models of different dimensions is treated by finding its Bayes solution, and evaluating the leading terms of its asymptotic expansion.

...read moreread less

45K

Estimating the dimension of a model

Gideon Schwarz

- 01 Jan 2005

TL;DR: In this paper, the problem of selecting one of a number of models of different dimensions is treated by finding its Bayes solution, and evaluating the leading terms of its asymptotic expansion.

...read moreread less

40.6K

•Journal Article•10.1109/18.720554

The minimum description length principle in coding and modeling

Andrew R. Barron, +2 more

- 01 Oct 1998

- IEEE Transactions on Information Theory

TL;DR: The normalized maximized likelihood, mixture, and predictive codings are each shown to achieve the stochastic complexity to within asymptotically vanishing terms.

...read moreread less

1.2K

Journal Article•10.1006/CSLA.2001.0182

Large scale discriminative training of hidden Markov models for speech recognition

Philip C. Woodland, +1 more

- 01 Jan 2002

- Computer Speech & Language

TL;DR: It is shown that HMMs trained with MMIE benefit as much as MLE-trained HMMs from applying model adaptation using maximum likelihood linear regression (MLLR), which has allowed the straightforward integration of MMIe- trained HMMs into complex multi-pass systems for transcription of conversational telephone speech.

...read moreread less

396

•Proceedings Article

A novel loss function for the overall risk criterion based discriminative training of HMM models.

Janez Kaiser, +2 more

- 01 Jan 2000

TL;DR: Using HMM, trained with the proposed method, a decrease of word recognition error rate of up to 17.3% has been achieved for the phoneme recognition task on the TIMIT database.

...read moreread less

101

Model complexity control and compression using discriminative growth functions

Chat with Paper

AI Agents for this Paper

Citations

Adaptation of Hybrid ANN/HMM Models Using Linear Hidden Transformations and Conservative Training

Adaptation of Artificial Neural Networks Avoiding Catastrophic Forgetting

Automatic Model Complexity Control Using Marginalized Discriminative Growth Functions

Automatic model complexity control using marginalized discriminative growth functions

Towards Efficient and Robust Automatic Speech Recognition: Decoding Techniques and Discriminative Training

References

Estimating the Dimension of a Model

Estimating the dimension of a model

The minimum description length principle in coding and modeling

Large scale discriminative training of hidden Markov models for speech recognition

A novel loss function for the overall risk criterion based discriminative training of HMM models.

Related Papers (5)

Estimating the Dimension of a Model

Automatic complexity control for HLDA systems

Decision tree state tying based on penalized Bayesian information criterion

Maximum likelihood linear transformations for HMM-based speech recognition

Maximum likelihood from incomplete data via the EM algorithm