Bitext word alignment

Topic Tools

Papers

Journal Article•

Posterior Regularization for Structured Latent Variable Models

[...]

Kuzman Ganchev¹, João Graça², Jennifer Gillenwater², Ben Taskar¹•Institutions (2)

01 Mar 2010-Journal of Machine Learning Research

TL;DR: This work presents an efficient algorithm for learning with posterior regularization and illustrates its versatility on a diverse set of structural constraints such as bijectivity, symmetry and group sparsity in several large scale experiments, including multi-view learning, cross-lingual dependency grammar induction, unsupervised part-of-speech induction, and bitext word alignment.

...read moreread less

Abstract: We present posterior regularization, a probabilistic framework for structured, weakly supervised learning. Our framework efficiently incorporates indirect supervision via constraints on posterior distributions of probabilistic models with latent variables. Posterior regularization separates model complexity from the complexity of structural constraints it is desired to satisfy. By directly imposing decomposable regularization on the posterior moments of latent variables during learning, we retain the computational efficiency of the unconstrained model while ensuring desired constraints hold in expectation. We present an efficient algorithm for learning with posterior regularization and illustrate its versatility on a diverse set of structural constraints such as bijectivity, symmetry and group sparsity in several large scale experiments, including multi-view learning, cross-lingual dependency grammar induction, unsupervised part-of-speech induction, and bitext word alignment.

...read moreread less

597 citations

Proceedings Article•10.3115/1073445.1073464•

A weighted finite state transducer implementation of the alignment template model for statistical machine translation

[...]

Shankar Kumar¹, William Byrne¹•Institutions (1)

Johns Hopkins University¹

27 May 2003

TL;DR: A derivation of the alignment template model for statistical machine translation and an implementation of the model using weighted finite state transducers are presented, showing that bitext word alignment and translation under the model can be performed with standard FSM operations involving these transducers.

...read moreread less

Abstract: We present a derivation of the alignment template model for statistical machine translation and an implementation of the model using weighted finite state transducers. The approach we describe allows us to implement each constituent distribution of the model as a weighted finite state transducer or acceptor. We show that bitext word alignment and translation under the model can be performed with standard FSM operations involving these transducers. One of the benefits of using this framework is that it obviates the need to develop specialized search procedures, even for the generation of lattices or N-Best lists of bitext word alignments and translation hypotheses. We evaluate the implementation of the model on the French-to-English Hansards task and report alignment and translation performance.

...read moreread less

102 citations

Journal Article•10.1017/S1351324905003815•

A weighted finite state transducer translation template model for statistical machine translation

[...]

Shankar Kumar¹, Yonggang Deng¹, William Byrne¹•Institutions (1)

Johns Hopkins University¹

01 Mar 2006-Natural Language Engineering

TL;DR: It is shown that bitext word alignment and translation under the model can be performed with standard finite state machine operations involving these transducers, and the contribution of each of the model components to different aspects of alignment andtranslation performance is identified.

...read moreread less

Abstract: We present a Weighted Finite State Transducer Translation Template Model for statistical machine translation. This is a source-channel model of translation inspired by the Alignment Template translation model. The model attempts to overcome the deficiencies of word-to-word translation models by considering phrases rather than words as units of translation. The approach we describe allows us to implement each constituent distribution of the model as a weighted finite state transducer or acceptor. We show that bitext word alignment and translation under the model can be performed with standard finite state machine operations involving these transducers. One of the benefits of using this framework is that it avoids the need to develop specialized search procedures, even for the generation of lattices or N-Best lists of bitext word alignments and translation hypotheses. We report and analyze bitext word alignment and translation performance on the Hansards French-English task and the FBIS Chinese-English task under the Alignment Error Rate, BLEU, NIST and Word Error-Rate metrics. These experiments identify the contribution of each of the model components to different aspects of alignment and translation performance. We finally discuss translation performance with large bitext training sets on the NIST 2004 Chinese-English and Arabic-English MT tasks.

...read moreread less

78 citations

Posted Content•

Conditional Random Field Autoencoders for Unsupervised Structured Prediction

[...]

Waleed Ammar¹, Chris Dyer¹, Noah A. Smith¹•Institutions (1)

Carnegie Mellon University¹

05 Nov 2014-arXiv: Learning

TL;DR: Competitive results with instantiations of the framework for unsupervised learning of structured predictors with overlapping, global features are shown, and it is shown that training the proposed model can be substantially more efficient than a comparable feature-rich baseline.

...read moreread less

Abstract: We introduce a framework for unsupervised learning of structured predictors with overlapping, global features. Each input's latent representation is predicted conditional on the observable data using a feature-rich conditional random field. Then a reconstruction of the input is (re)generated, conditional on the latent structure, using models for which maximum likelihood estimation has a closed-form. Our autoencoder formulation enables efficient learning without making unrealistic independence assumptions or restricting the kinds of features that can be used. We illustrate insightful connections to traditional autoencoders, posterior regularization and multi-view learning. We show competitive results with instantiations of the model for two canonical NLP tasks: part-of-speech induction and bitext word alignment, and show that training our model can be substantially more efficient than comparable feature-rich baselines.

...read moreread less

73 citations

Proceedings Article•

Conditional Random Field Autoencoders for Unsupervised Structured Prediction

[...]

Waleed Ammar¹, Chris Dyer¹, Noah A. Smith¹•Institutions (1)

Carnegie Mellon University¹

8 Dec 2014

TL;DR: This article introduced a framework for unsupervised learning of structured predictors with overlapping, global features, which enables efficient exact inference without resorting to unrealistic independence assumptions or restricting the kinds of features that can be used.

...read moreread less

Abstract: We introduce a framework for unsupervised learning of structured predictors with overlapping, global features. Each input's latent representation is predicted conditional on the observed data using a feature-rich conditional random field (CRF). Then a reconstruction of the input is (re)generated, conditional on the latent structure, using a generative model which factorizes similarly to the CRF. The autoencoder formulation enables efficient exact inference without resorting to unrealistic independence assumptions or restricting the kinds of features that can be used. We illustrate connections to traditional autoencoders, posterior regularization, and multi-view learning. We then show competitive results with instantiations of the framework for two canonical tasks in natural language processing: part-of-speech induction and bitext word alignment, and show that training the proposed model can be substantially more efficient than a comparable feature-rich baseline.

...read moreread less

53 citations

Topic Tools

Papers

Posterior Regularization for Structured Latent Variable Models

A weighted finite state transducer implementation of the alignment template model for statistical machine translation

A weighted finite state transducer translation template model for statistical machine translation

Conditional Random Field Autoencoders for Unsupervised Structured Prediction

Conditional Random Field Autoencoders for Unsupervised Structured Prediction

Related Topics (5)

Performance Metrics

No. of papers in the topic in previous years
Year	Papers
2014	2
2010	1
2006	1
2005	3
2003	2