TL;DR: This paper proposes to learn a feature representation which has domain-invariant class conditional distributions P(h(X)|Y), and proposes a conditional invariant representation which can be guaranteed if the class prior P(Y ) does not change across training and test domains.
Abstract: Domain generalization aims to apply knowledge gained from multiple labeled source domains to unseen target domains. The main difficulty comes from the dataset bias: training data and test data have different distributions, and the training set contains heterogeneous samples from different distributions. Let X denote the features, and Y be the class labels. Existing domain generalization methods address the dataset bias problem by learning a domain-invariant representation h(X) that has the same marginal distribution P(h(X)) across multiple source domains. The functional relationship encoded in P(Y|X) is usually assumed to be stable across domains such that P(Y|h(X)) is also invariant. However, it is unclear whether this assumption holds in practical problems. In this paper, we consider the general situation where both P(X) and P(Y|X) can change across all domains. We propose to learn a feature representation which has domain-invariant class conditional distributions P(h(X)|Y). With the conditional invariant representation, the invariance of the joint distribution P(h(X),Y) can be guaranteed if the class prior P(Y) does not change across training and test domains. Extensive experiments on both synthetic and real data demonstrate the effectiveness of the proposed method.
TL;DR: In this paper, a measure of the dispersion of a point from a fitted plane is used to construct a new measure of deviation for regression, and the estimates from minimizing the new deviation can be a convex combination of the k + 1 OLS regression estimates.
TL;DR: Taavitsainen et al. as mentioned in this paper proposed a nonlinear (NL) PLS approach for nonlinear data analysis with latent variables, which is based on linear partial least squares (PLS).