TL;DR: This paper proposes a novel solution to preserve the joint distribution of a high-dimensional dataset using an integer programming relaxation and the constrained concave-convex procedure and proves that selecting the optimal marginals with the goal of minimizing error is NP-hard.
Abstract: Releasing high-dimensional data enables a wide spectrum of data mining tasks. Yet, individual privacy has been a major obstacle to data sharing. In this paper, we consider the problem of releasing high-dimensional data with differential privacy guarantees. We propose a novel solution to preserve the joint distribution of a high-dimensional dataset. We first develop a robust sampling-based framework to systematically explore the dependencies among all attributes and subsequently build a dependency graph. This framework is coupled with a generic threshold mechanism to significantly improve accuracy. We then identify a set of marginal tables from the dependency graph to approximate the joint distribution based on the solid inference foundation of the junction tree algorithm while minimizing the resultant error. We prove that selecting the optimal marginals with the goal of minimizing error is NP-hard and, thus, design an approximation algorithm using an integer programming relaxation and the constrained concave-convex procedure. Extensive experiments on real datasets demonstrate that our solution substantially outperforms the state-of-the-art competitors.
TL;DR: This paper shows that (loopy) belief propagation (BP) can be lifted and establishes a link between lifting and radix sort, which paves the way for the first scalable lifted training approaches based on stochastic gradients, both in an online and a MapReduce fashion.
Abstract: Judging by the increasing impact of machine learning on large-scale data analysis in the last decade, one can anticipate a substantial growth in diversity of the machine learning applications for "big data" over the next decade. This exciting new opportunity, however, also raises many challenges. One of them is scaling inference within and training of graphical models. Typical ways to address this scaling issue are inference by approximate message passing, stochastic gradients, and MapReduce, among others. Often, we encounter inference and training problems with symmetries and redundancies in the graph structure. A prominent example are relational models that capture complexity. Exploiting these symmetries, however, has not been considered for scaling yet. In this paper, we show that inference and training can indeed benefit from exploiting symmetries. Specifically, we show that (loopy) belief propagation (BP) can be lifted. That is, a model is compressed by grouping nodes together that send and receive identical messages so that a modified BP running on the lifted graph yields the same marginals as BP on the original one, but often in a fraction of time. By establishing a link between lifting and radix sort, we show that lifting is MapReduce-able. Still, in many if not most situations training relational models will not benefit from this (scalable) lifting: symmetries within models easily break since variables become correlated by virtue of depending asymmetrically on evidence. An appealing idea for such situations is to train and recombine local models. This breaks long-range dependencies and allows to exploit lifting within and across the local training tasks. Moreover, it naturally paves the way for the first scalable lifted training approaches based on stochastic gradients, both in an online and a MapReduced fashion. On several datasets, the online training, for instance, converges to the same quality solution over an order of magnitude faster, simply because it starts optimizing long before having seen the entire mega-example even once.
TL;DR: This work provides two algorithms that extend the celebrated junction tree algorithm, process soft evidence, and have different efficiency characteristics, and provides methodological guidance to model soft evidence in the form of beliefs on single and multiple variables, propositional logical formulae, and even conditional distributions.
TL;DR: This work extends the standard mean field method by using an approximating distribution that factorises into cluster potentials, which includes undirected graphs, directed acyclic graphs and junction trees, and derives generalized mean field equations to optimize the clusters potentials.
Abstract: Recently, variational approximations such as the mean field approximation have received much interest. We extend the standard mean field method by using an approximating dis tribution that factorises into cluster poten tials. This includes undirected graphs, di rected acyclic graphs and junction trees. We derive generalised mean field equations to op timise the cluster potentials. We show that the method bridges the gap between the stan dard mean field approximation and the exact junction tree algorithm. In addition, we ad dress the problem of how to choose the struc ture and the free parameters of the approx imating distribution. From the generalised mean field equations we derive rules to sim plify the approximation in advance without affecting the potential accuracy of the model class. We also show how the method fits into some other variational approximations that are currently popular.
TL;DR: The proposed method makes use of a dynamic Bayesian network and the junction tree algorithm for probabilistic inference to predict future human driving behavior under the condition that its resultant behavior and past observations are given.
Abstract: This paper presents a method of predicting future human driving behavior under the condition that its resultant behavior and past observations are given. The proposed method makes use of a dynamic Bayesian network and the junction tree algorithm for probabilistic inference. The method is applied to behavior prediction for a vehicle assumed to stop at an intersection. Such a predictive system would facilitate warning and assistance to prevent dangerous activities, such as red-light violations, by allowing detection of a deviation from normal behavior.