Algorithmically Effective Differentially Private Synthetic Data

doi:10.48550/arXiv.2302.05552

Journal Article10.48550/arXiv.2302.05552

Algorithmically Effective Differentially Private Synthetic Data

Yi He, +2 more

- 11 Feb 2023

- arXiv.org

- Vol. abs/2302.05552

5

TL;DR: In this article , the authors presented a highly effective algorithm for generating differentially private synthetic data in a bounded metric space with near-optimal utility guarantees under the 1-Wasserstein distance.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arXiv.2302.09680

Sample-efficient private data release for Lipschitz functions under sparsity assumptions

Konstantin Donhauser, +5 more

- 19 Feb 2023

- arXiv.org

TL;DR: In this paper , the authors presented a differentially private data release algorithm that achieves optimal rates of order $n^{-1/d} , where d being the size of the dataset and n being the dimension, for the worst case error over all Lipschitz continuous statistics.

...read moreread less

4

Journal Article•10.48550/arXiv.2305.17148

Differentially private low-dimensional representation of high-dimensional data

Yi He, +3 more

- 26 May 2023

- arXiv.org

TL;DR: In this article , a differentially private algorithm is proposed to generate low-dimensional synthetic data efficiently from a high-dimensional dataset with a utility guarantee with respect to the Wasserstein distance.

...read moreread less

3

•Posted Content•10.48550/arxiv.2302.09680

Sample-efficient private data release for Lipschitz functions under sparsity assumptions

19 Feb 2023

TL;DR: In this article , the authors presented a differentially private data release algorithm that achieves optimal rates of order $n^{-1/d} , where d being the size of the dataset and n being the dimension, for the worst case error over all Lipschitz continuous statistics.

...read moreread less

3

Journal Article•10.48550/arXiv.2305.12100

Stability, Generalization and Privacy: Precise Analysis for Random and NTK Features

Simone Bombari, +1 more

- 20 May 2023

- arXiv.org

TL;DR: In this article , the authors study the safety of ERM-trained models against a family of powerful black-box attacks and quantifies this safety via two separate terms: (i) the model stability with respect to individual training samples, and (ii) the feature alignment between the attacker query and the original data.

...read moreread less

2

Journal Article•10.48550/arxiv.2402.08012

Online Differentially Private Synthetic Data Generation

Yiyun He, +2 more

- 12 Feb 2024

- arXiv.org

TL;DR: An online algorithm is developed that generates a differentially private synthetic dataset at each time $t$ that achieves a near-optimal accuracy bound of O(t^{-1/d}\log(t) for d\geq 2 and $O(t^{-1}\log^{4.5}(t) for d=1$ in the 1-Wasserstein distance.

...read moreread less

References

•Book

Optimal Transport: Old and New

Cédric Villani

- 02 Jan 2013

TL;DR: In this paper, the authors provide a detailed description of the basic properties of optimal transport, including cyclical monotonicity and Kantorovich duality, and three examples of coupling techniques.

...read moreread less

7.4K

•Book

The Algorithmic Foundations of Differential Privacy

Cynthia Dwork, +1 more

- 11 Aug 2014

TL;DR: The preponderance of this monograph is devoted to fundamental techniques for achieving differential privacy, and application of these techniques in creative combinations, using the query-release problem as an ongoing example.

...read moreread less

7.2K

•Proceedings Article•10.1145/2976749.2978318

Deep Learning with Differential Privacy

Martín Abadi, +6 more

- 01 Jul 2016

- arXiv: Machine Learning

TL;DR: This work develops new algorithmic techniques for learning and a refined analysis of privacy costs within the framework of differential privacy, and demonstrates that deep neural networks can be trained with non-convex objectives, under a modest privacy budget, and at a manageable cost in software complexity, training efficiency, and model quality.

...read moreread less

4.3K

Book Chapter•10.1007/3-540-44581-1_15

Rademacher and gaussian complexities: risk bounds and structural results

Peter L. Bartlett, +1 more

- 01 Mar 2003

TL;DR: In this paper, the authors investigate the use of data-dependent estimates of the complexity of a function class, called Rademacher and Gaussian complexities, in a decision theoretic setting and prove general risk bounds in terms of these complexities.

...read moreread less

3K