Open AccessPosted Content
Conditional Random Field Autoencoders for Unsupervised Structured Prediction
TL;DR: Competitive results with instantiations of the framework for unsupervised learning of structured predictors with overlapping, global features are shown, and it is shown that training the proposed model can be substantially more efficient than a comparable feature-rich baseline.
read more
Abstract: We introduce a framework for unsupervised learning of structured predictors with overlapping, global features. Each input's latent representation is predicted conditional on the observable data using a feature-rich conditional random field. Then a reconstruction of the input is (re)generated, conditional on the latent structure, using models for which maximum likelihood estimation has a closed-form. Our autoencoder formulation enables efficient learning without making unrealistic independence assumptions or restricting the kinds of features that can be used. We illustrate insightful connections to traditional autoencoders, posterior regularization and multi-view learning. We show competitive results with instantiations of the model for two canonical NLP tasks: part-of-speech induction and bitext word alignment, and show that training our model can be substantially more efficient than comparable feature-rich baselines.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Grounding of Textual Phrases in Images by Reconstruction
Anna Rohrbach,Marcus Rohrbach,Marcus Rohrbach,Ronghang Hu,Trevor Darrell,Bernt Schiele +5 more
- 08 Oct 2016
TL;DR: A novel approach which learns grounding by reconstructing a given phrase using an attention mechanism, which can be either latent or optimized directly, and demonstrates the effectiveness on the Flickr 30k Entities and ReferItGame datasets.
Grounding of Textual Phrases in Images by Reconstruction
TL;DR: In this article, an attention mechanism is used to reconstruct a given phrase by reconstructing the given phrase using an attention loss, which can be either latent or optimized directly for ground-truth spatial localization.
477
Semi-Supervised Learning for Neural Machine Translation
Yong Cheng,Wei Xu,Zhongjun He,Wei He,Hua Wu,Maosong Sun,Yang Liu +6 more
- 15 Jun 2016
TL;DR: This work proposes a semi-supervised approach for training NMT models on the concatenation of labeled and unlabeled monolingual corpora data, in which the source- to-target and target-to-source translation models serve as the encoder and decoder, respectively.
Unsupervised Recurrent Neural Network Grammars
Yoon Kim,Alexander M. Rush,Lei Yu,Adhiguna Kuncoro,Chris Dyer,Gábor Melis +5 more
- 07 Apr 2019
TL;DR: An inference network parameterized as a neural CRF constituency parser is developed to maximize the evidence lower bound and apply amortized variational inference to unsupervised learning of RNNGs.
155
Deep Neural Networks with Massive Learned Knowledge
Zhiting Hu,Zichao Yang,Ruslan Salakhutdinov,Eric P. Xing +3 more
- 01 Nov 2016
TL;DR: A general framework is developed that enables learning knowledge and its confidence jointly with the DNNs, so that the vast amount of fuzzy knowledge can be incorporated and automatically optimized with little manual efforts.
115
References
Bleu: a Method for Automatic Evaluation of Machine Translation
Kishore Papineni,Salim Roukos,Todd Ward,Wei-Jing Zhu +3 more
- 06 Jul 2002
TL;DR: This paper proposed a method of automatic machine translation evaluation that is quick, inexpensive, and language-independent, that correlates highly with human evaluation, and that has little marginal cost per run.
•Proceedings Article
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
John Lafferty,Andrew McCallum,Fernando Pereira +2 more
- 28 Jun 2001
TL;DR: This work presents iterative parameter estimation algorithms for conditional random fields and compares the performance of the resulting models to HMMs and MEMMs on synthetic and natural-language data.
Extracting and composing robust features with denoising autoencoders
Pascal Vincent,Hugo Larochelle,Yoshua Bengio,Pierre-Antoine Manzagol +3 more
- 05 Jul 2008
TL;DR: This work introduces and motivate a new training principle for unsupervised learning of a representation based on the idea of making the learned representations robust to partial corruption of the input pattern.