Open AccessPosted Content
Learning Autoencoders with Relational Regularization
TL;DR: This article proposed a relational regularized autoencoder (RAE) for learning autoencoders of data distributions, which penalizes the fused Gromov-Wasserstein distance between the latent prior and its corresponding posterior, allowing one to flexibly learn a structured prior distribution associated with the generative model.
read more
Abstract: A new algorithmic framework is proposed for learning autoencoders of data distributions. We minimize the discrepancy between the model and target distributions, with a \emph{relational regularization} on the learnable latent prior. This regularization penalizes the fused Gromov-Wasserstein (FGW) distance between the latent prior and its corresponding posterior, allowing one to flexibly learn a structured prior distribution associated with the generative model. Moreover, it helps co-training of multiple autoencoders even if they have heterogeneous architectures and incomparable latent spaces. We implement the framework with two scalable algorithms, making it applicable for both probabilistic and deterministic autoencoders. Our relational regularized autoencoder (RAE) outperforms existing methods, $e.g.$, the variational autoencoder, Wasserstein autoencoder, and their variants, on generating images. Additionally, our relational co-training strategy for autoencoders achieves encouraging results in both synthesis and real-world multi-view learning tasks. The code is at this https URL Relational-AutoEncoders.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
The Unbalanced Gromov Wasserstein Distance: Conic Formulation and Relaxation
TL;DR: Two Unbalanced Gromov-Wasserstein formulations are introduced: a distance and a more computationally tractable upper-bounding relaxation that allow the comparison of metric spaces equipped with arbitrary positive measures up to isometries.
51
Soft-IntroVAE: Analyzing and Improving the Introspective Variational Autoencoder
Tal Daniel,Aviv Tamar +1 more
- 01 Jun 2021
TL;DR: Soft-IntroVAE as discussed by the authors replaces the hinge-loss terms with a smooth exponential loss on generated samples, which significantly improves training stability and also enables theoretical analysis of the complete algorithm.
•Posted Content
Inspect, Understand, Overcome: A Survey of Practical Methods for AI Safety.
Sebastian Houben,Stephanie Abrecht,Maram Akila,Andreas Bär,Felix Brockherde,Patrick Feifel,Tim Fingscheidt,Sujan Sai Gannamaneni,Seyed Eghbal Ghobadi,Ahmed Hammam,Anselm Haselhoff,Felix Hauser,Christian Heinzemann,Marco Hoffmann,Nikhil Kapoor,Falk Kappel,Marvin Klingner,Jan Kronenberger,Fabian Küppers,Jonas Löhdefink,Michael Mlynarski,Michael Mock,Firas Mualla,Svetlana Pavlitskaya,Maximilian Poretschkin,Alexander Pohl,Varun Ravi Kumar,Julia Rosenzweig,Matthias Rottmann,Stefan Rüping,Timo Sämann,Jan David Schneider,Elena Schulz,Gesina Schwalbe,Joachim Sicking,Toshika Srivastava,Serin Varghese,Michael Weber,Sebastian J. Wirkert,Tim Wirtz,Matthias Woehrle +40 more
TL;DR: In this article, the authors provide a structured and broad overview of the state-of-the-art techniques aiming to address the model-inherent shortcomings of deep neural networks (DNNs).
37
ADD: Frequency Attention and Multi-View Based Knowledge Distillation to Detect Low-Quality Compressed Deepfake Images
TL;DR: In this paper , the authors apply frequency domain learning and optimal transport theory in knowledge distillation (KD) to specifically improve the detection of low-quality compressed deepfake images, and propose the Attention-based Deepfake detection Distiller (ADD), which consists of two novel distillations: 1) frequency attention distillation that effectively retrieves the removed high-frequency components in the student network, and 2) multi-view attention dis-distillation that creates multiple attention vectors by slicing the teacher's and student's tensors under different views to transfer the teacher tensor distribution to the student more efficiently.
28
EfficientTDNN: Efficient Architecture Search for Speaker Recognition
01 Jan 2022
TL;DR: In this article , the authors proposed an efficient architecture search framework consisting of a time-delay neural network-based supernet and a TDNN-NAS algorithm, where the proposed supernet introduces temporal convolution of different ranges of receptive field and feature aggregation of various resolutions from different layers to TDNN.
References
•Posted Content
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.
82.5K
Gradient-based learning applied to document recognition
Yann LeCun,Léon Bottou,Léon Bottou,Yoshua Bengio,Yoshua Bengio,Yoshua Bengio,Patrick Haffner +6 more
- 01 Jan 1998
TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.
53.5K
Deep Learning Face Attributes in the Wild
Ziwei Liu,Ping Luo,Xiaogang Wang,Xiaoou Tang +3 more
- 07 Dec 2015
TL;DR: A novel deep learning framework for attribute prediction in the wild that cascades two CNNs, LNet and ANet, which are fine-tuned jointly with attribute tags, but pre-trained differently.
•Posted Content
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks
TL;DR: This work introduces a class of CNNs called deep convolutional generative adversarial networks (DCGANs), that have certain architectural constraints, and demonstrates that they are a strong candidate for unsupervised learning.
6.8K
Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories
Li Fei-Fei,Rob Fergus,Pietro Perona +2 more
- 27 Jun 2004
TL;DR: The incremental algorithm is compared experimentally to an earlier batch Bayesian algorithm, as well as to one based on maximum-likelihood, which have comparable classification performance on small training sets, but incremental learning is significantly faster, making real-time learning feasible.
Related Papers (5)
Hanna Kamyshanska,Roland Memisevic +1 more
- 16 Jun 2013
Yifei Zhang,Hao Zhu +1 more
- 04 May 2020
Alireza Makhzani,Brendan J. Frey +1 more
- 01 Jan 2017