Multi-Task Minimum Error Rate Training for SMT

doi:10.2478/V10108-011-0015-0

Open AccessJournal Article10.2478/V10108-011-0015-0

Multi-Task Minimum Error Rate Training for SMT

Patrick Simianer, +2 more

- 01 Oct 2011

- The Prague Bulletin of Mathematical Ling...

- Vol. 96, Iss: 2011, pp 99-108

6

TL;DR: The authors' experiments show statistically significant gains over task-specific training by techniques that model commonalities through shared parameters, however, more finegrained combinations of shared parameters with task- specific ones could not be brought to bear on models with a small number of dense features.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Book Chapter•10.1007/978-3-642-31274-8_2

Analyzing parallelism and domain similarities in the MAREC patent corpus

Katharina Wäschle, +1 more

- 02 Jul 2012

TL;DR: A twofold approach for extracting parallel data from all patent document sections from a large multilingual patent corpus and a descriptive analysis of its subdomains to enable its use in domain-oriented translation, e.g. when applying multi-task learning.

...read moreread less

30

•Proceedings Article

One System, Many Domains: Open-Domain Statistical Machine Translation via Feature Augmentation

Jonathan H. Clark, +2 more

- 01 Jan 2012

TL;DR: A simple technique for incorporating domain information into a statistical machine translation system that significantly improves translation quality when test data comes from multiple domains is introduced.

...read moreread less

24

•Proceedings Article

Structural and Topical Dimensions in Multi-Task Patent Translation

Katharina Waeschle, +1 more

- 23 Apr 2012

TL;DR: This paper analyzes patents along the orthogonal dimensions of topic and textual structure, and views different patent classes and different patent text sections, as separate translation tasks, and investigates the influence of such tasks on machine translation performance.

...read moreread less

21

•Dissertation•10.11588/HEIDOK.00025488

Preference Learning for Machine Translation

Patrick Simianer

- 01 Jan 2018

TL;DR: Algorithms that can learn from very large amounts of data by exploiting pairwise preferences defined over competing translations are developed, which can be used to make a machine translation system robust to arbitrary texts from varied sources, but also enable it to learn effectively to adapt to new domains of data.

...read moreread less

6

•Journal Article•10.2478/V10108-011-0008-Z

An attractive game with the document: (im)possible?

Barbora Hladká, +2 more

- 01 Oct 2011

- The Prague Bulletin of Mathematical Ling...

TL;DR: The notion of crowdsourcing is reviewed, namely it is turned to crowdsourcing projects that manipulate textual data and a game on coreference, PlayCoref, and games with words and white spaces in the sentence are introduced.

...read moreread less

4

References

•Proceedings Article•10.3115/1073083.1073135

Bleu: a Method for Automatic Evaluation of Machine Translation

Kishore Papineni, +3 more

- 06 Jul 2002

TL;DR: This paper proposed a method of automatic machine translation evaluation that is quick, inexpensive, and language-independent, that correlates highly with human evaluation, and that has little marginal cost per run.

...read moreread less

28.9K

•Proceedings Article•10.3115/1075096.1075117

Minimum Error Rate Training in Statistical Machine Translation

Franz Josef Och

- 07 Jul 2003

TL;DR: It is shown that significantly better results can often be obtained if the final evaluation criterion is taken directly into account as part of the training procedure.

...read moreread less

3.4K

Proceedings Article•10.1145/1014052.1014067

Regularized multi--task learning

Theodoros Evgeniou, +1 more

- 22 Aug 2004

TL;DR: An approach to multi--task learning based on the minimization of regularization functionals similar to existing ones, such as the one for Support Vector Machines, that have been successfully used in the past for single-- task learning is presented.

...read moreread less

1.9K

•Proceedings Article

Frustratingly Easy Domain Adaptation

Hal Daumé

- 01 Jun 2007

TL;DR: This work describes an approach to domain adaptation that is appropriate exactly in the case when one has enough “target” data to do slightly better than just using only “source’ data.

...read moreread less

1.7K

•Proceedings Article

Parallelized Stochastic Gradient Descent

Martin Zinkevich, +3 more

- 06 Dec 2010

TL;DR: This paper presents the first parallel stochastic gradient descent algorithm including a detailed analysis and experimental evidence and introduces a novel proof technique — contractive mappings to quantify the speed of convergence of parameter distributions to their asymptotic limits.

...read moreread less

1.5K