Journal Article10.14778/3565816.3565823
OpBoost
15
TL;DR: In this paper , Xu et al. proposed three order-preserving desensitization algorithms satisfying a variant of Local Differential Privacy (LDP), called distance-based LDP (dLDP), to improve the accuracy of tree boosting algorithms satisfying differential privacy under vertical FL.
read more
Abstract: Vertical Federated Learning (FL) is a new paradigm that enables users with non-overlapping attributes of the same data samples to jointly train a model without directly sharing the raw data. Nevertheless, recent works show that it's still not sufficient to prevent privacy leakage from the training process or the trained model. This paper focuses on studying the privacy-preserving tree boosting algorithms under the vertical FL. The existing solutions based on cryptography involve heavy computation and communication overhead and are vulnerable to inference attacks. Although the solution based on Local Differential Privacy (LDP) addresses the above problems, it leads to the low accuracy of the trained model.
This paper explores to improve the accuracy of the widely deployed tree boosting algorithms satisfying differential privacy under vertical FL. Specifically, we introduce a framework called OpBoost. Three order-preserving desensitization algorithms satisfying a variant of LDP called distance-based LDP (dLDP) are designed to desensitize the training data. In particular, we optimize the dLDP definition and study efficient sampling distributions to further improve the accuracy and efficiency of the proposed algorithms. The proposed algorithms provide a trade-off between the privacy of pairs with large distance and the utility of desensitized values. Comprehensive evaluations show that OpBoost has a better performance on prediction accuracy of trained models compared with existing LDP approaches on reasonable settings. Our code is open source.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Vertical Federated Learning: Concepts, Advances, and Challenges
Yang Liu,Yan Kang,Tianyuan Zou,Yanhong Pu,Yuanqin He,Xiaozhou Ye,Ye Ouyang,Yaqin Zhang,Qiang Yang +8 more
TL;DR: VFL is a federated learning setting where multiple parties train machine learning models without exposing their raw data or model parameters. It involves a comprehensive review of concepts, advances, and challenges in VFL, including effectiveness, efficiency, privacy, and fairness.
51
A Tutorial on Federated Learning from Theory to Practice: Foundations, Software Frameworks, Exemplary Use Cases, and Selected Trends
M. Victoria Luzón,Nuria Rodríguez-Barroso,Alberto Argente-Garrido,Daniel Jiménez-López,Jose M. Moyano,Javier Del Ser,Weiping Ding,Francisco Herrera +7 more
TL;DR: A comprehensive tutorial on federated learning (FL) covering foundations, software frameworks, exemplary use cases, and selected trends. FL enables distributed model training without centralized data transfer, preserving data privacy.
20
Decision Tree-Based Federated Learning: A Survey
Zijun Wang,Keke Gai +1 more
- 07 Mar 2024
TL;DR: Federated learning with decision tree models enhances performance and privacy, but faces challenges in training and prediction. The survey explores recent advancements and emphasizes data security and communication efficiency as key areas for improvement.
11
A Survey for Federated Learning Evaluations: Goals and Measures
Di Chai,Leye Wang,Yang Liu,Junxue Zhang,Kai Chen,Qiang Yang +5 more
7
Privet: A Privacy-Preserving Vertical Federated Learning Service for Gradient Boosted Decision Tables
TL;DR: In this paper , the authors proposed a framework for privacy-preserving VFL service for gradient-boosted decision tables, which allows an arbitrary number of participants holding vertically partitioned datasets to securely train gradient boosted decision tables.
6
References
XGBoost: A Scalable Tree Boosting System
Tianqi Chen,Carlos Guestrin +1 more
TL;DR: This paper proposes a novel sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning and provides insights on cache access patterns, data compression and sharding to build a scalable tree boosting system called XGBoost.
XGBoost: A Scalable Tree Boosting System
Tianqi Chen,Carlos Guestrin +1 more
- 13 Aug 2016
TL;DR: XGBoost as discussed by the authors proposes a sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning to achieve state-of-the-art results on many machine learning challenges.
•Book
The Algorithmic Foundations of Differential Privacy
Cynthia Dwork,Aaron Roth +1 more
- 11 Aug 2014
TL;DR: The preponderance of this monograph is devoted to fundamental techniques for achieving differential privacy, and application of these techniques in creative combinations, using the query-release problem as an ongoing example.
Federated Machine Learning: Concept and Applications
TL;DR: This work introduces a comprehensive secure federated-learning framework, which includes horizontal federated learning, vertical federatedLearning, and federated transfer learning, and provides a comprehensive survey of existing works on this subject.
4.3K
Differential privacy: a survey of results
Cynthia Dwork
- 25 Apr 2008
TL;DR: This survey recalls the definition of differential privacy and two basic techniques for achieving it, and shows some interesting applications of these techniques, presenting algorithms for three specific tasks and three general results on differentially private learning.