OpBoost

doi:10.14778/3565816.3565823

Journal Article10.14778/3565816.3565823

OpBoost

Xiaochen Li, +7 more

- 01 Oct 2022

- Proceedings of The Vldb Endowment

- Vol. 16, Iss: 2, pp 202-215

15

TL;DR: In this paper , Xu et al. proposed three order-preserving desensitization algorithms satisfying a variant of Local Differential Privacy (LDP), called distance-based LDP (dLDP), to improve the accuracy of tree boosting algorithms satisfying differential privacy under vertical FL.

Abstract: Vertical Federated Learning (FL) is a new paradigm that enables users with non-overlapping attributes of the same data samples to jointly train a model without directly sharing the raw data. Nevertheless, recent works show that it's still not sufficient to prevent privacy leakage from the training process or the trained model. This paper focuses on studying the privacy-preserving tree boosting algorithms under the vertical FL. The existing solutions based on cryptography involve heavy computation and communication overhead and are vulnerable to inference attacks. Although the solution based on Local Differential Privacy (LDP) addresses the above problems, it leads to the low accuracy of the trained model. This paper explores to improve the accuracy of the widely deployed tree boosting algorithms satisfying differential privacy under vertical FL. Specifically, we introduce a framework called OpBoost. Three order-preserving desensitization algorithms satisfying a variant of LDP called distance-based LDP (dLDP) are designed to desensitize the training data. In particular, we optimize the dLDP definition and study efficient sampling distributions to further improve the accuracy and efficiency of the proposed algorithms. The proposed algorithms provide a trade-off between the privacy of pairs with large distance and the utility of desensitized values. Comprehensive evaluations show that OpBoost has a better performance on prediction accuracy of trained models compared with existing LDP approaches on reasonable settings. Our code is open source.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/tkde.2024.3352628

Vertical Federated Learning: Concepts, Advances, and Challenges

Yang Liu, +8 more

- 01 Jan 2024

- IEEE Transactions on Knowledge and Data ...

TL;DR: VFL is a federated learning setting where multiple parties train machine learning models without exposing their raw data or model parameters. It involves a comprehensive review of concepts, advances, and challenges in VFL, including effectiveness, efficiency, privacy, and fairness.

...read moreread less

51

Journal Article•10.1109/jas.2024.124215

A Tutorial on Federated Learning from Theory to Practice: Foundations, Software Frameworks, Exemplary Use Cases, and Selected Trends

M. Victoria Luzón, +7 more

- 01 Apr 2024

- IEEE/CAA Journal of Automatica Sinica

TL;DR: A comprehensive tutorial on federated learning (FL) covering foundations, software frameworks, exemplary use cases, and selected trends. FL enables distributed model training without centralized data transfer, preserving data privacy.

...read moreread less

20

Journal Article•10.3390/blockchains2010003

Decision Tree-Based Federated Learning: A Survey

Zijun Wang, +1 more

- 07 Mar 2024

TL;DR: Federated learning with decision tree models enhances performance and privacy, but faces challenges in training and prediction. The survey explores recent advancements and emphasizes data security and communication efficiency as key areas for improvement.

...read moreread less

11

Journal Article•10.1109/tkde.2024.3382002

A Survey for Federated Learning Evaluations: Goals and Measures

Di Chai, +5 more

- 01 Jan 2024

- IEEE Transactions on Knowledge and Data ...

7

Journal Article•10.1109/tsc.2023.3279839

Privet: A Privacy-Preserving Vertical Federated Learning Service for Gradient Boosted Decision Tables

01 Jan 2023

- IEEE Transactions on Services Computing

TL;DR: In this paper , the authors proposed a framework for privacy-preserving VFL service for gradient-boosted decision tables, which allows an arbitrary number of participants holding vertically partitioned datasets to securely train gradient boosted decision tables.

...read moreread less

6

References

•Proceedings Article•10.1145/2939672.2939785

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, +1 more

- 09 Mar 2016

- arXiv: Learning

TL;DR: This paper proposes a novel sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning and provides insights on cache access patterns, data compression and sharding to build a scalable tree boosting system called XGBoost.

...read moreread less

32.8K

•Proceedings Article•10.1145/2939672.2939785

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, +1 more

- 13 Aug 2016

TL;DR: XGBoost as discussed by the authors proposes a sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning to achieve state-of-the-art results on many machine learning challenges.

...read moreread less

14.8K

•Book

The Algorithmic Foundations of Differential Privacy

Cynthia Dwork, +1 more

- 11 Aug 2014

TL;DR: The preponderance of this monograph is devoted to fundamental techniques for achieving differential privacy, and application of these techniques in creative combinations, using the query-release problem as an ongoing example.

...read moreread less

7.2K

Journal Article•10.1145/3298981

Federated Machine Learning: Concept and Applications

Qiang Yang, +3 more

- 28 Jan 2019

- ACM Transactions on Intelligent Systems ...

TL;DR: This work introduces a comprehensive secure federated-learning framework, which includes horizontal federated learning, vertical federatedLearning, and federated transfer learning, and provides a comprehensive survey of existing works on this subject.

...read moreread less

4.3K

Book Chapter•10.1007/978-3-540-79228-4_1

Differential privacy: a survey of results

Cynthia Dwork

- 25 Apr 2008

TL;DR: This survey recalls the definition of differential privacy and two basic techniques for achieving it, and shows some interesting applications of these techniques, presenting algorithms for three specific tasks and three general results on differentially private learning.

...read moreread less

4.2K