Attribute-Distributed Learning: Models, Limits, and Algorithms

doi:10.1109/TSP.2010.2088393

Journal Article10.1109/TSP.2010.2088393

Attribute-Distributed Learning: Models, Limits, and Algorithms

Haipeng Zheng, +2 more

- 01 Jan 2011

- IEEE Transactions on Signal Processing

- Vol. 59, Iss: 1, pp 386-398

66

TL;DR: A framework for distributed learning (regression) on attribute-distributed data by taking residual refitting (or boosting) as a prototype algorithm, three different schemes, Simple Iterative Projection, a greedy algorithm, and a parallel algorithm (with its derivatives), are proposed and compared.

Abstract: This paper introduces a framework for distributed learning (regression) on attribute-distributed data. First, the convergence properties of attribute-distributed regression with an additive model and a fusion center are discussed, and the convergence rate and uniqueness of the limit are shown for some special cases. Then, taking residual refitting (or boosting) as a prototype algorithm, three different schemes, Simple Iterative Projection, a greedy algorithm, and a parallel algorithm (with its derivatives), are proposed and compared. Among these algorithms, the first two are sequential and have low communication overhead, but are susceptible to overtraining. The parallel algorithm has the best performance, but has significant communication requirements. Instead of directly refitting the ensemble residual sequentially, the parallel algorithm redistributes the residual to each agent in proportion to the coefficients of the optimal linear combination of the current individual estimators. Designing residual redistribution schemes also improves the ability to eliminate irrelevant attributes. The performance of the algorithms is compared via extensive simulations. Communication issues are also considered: the amount of data to be exchanged among the three algorithms is compared, and the three methods are generalized to scenarios without a fusion center.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1186/S13634-016-0355-X

A survey of machine learning for big data processing

Junfei Qiu, +4 more

- 28 May 2016

- EURASIP Journal on Advances in Signal Pr...

TL;DR: A literature survey of the latest advances in researches on machine learning for big data processing finds some promising learning methods in recent studies, such as representation learning, deep learning, distributed and parallel learning, transfer learning, active learning, and kernel-based learning.

...read moreread less

907

•Proceedings Article

Privacy-preserving SVM classification

Jaideep Vaidya, +2 more

- 01 Jan 2008

TL;DR: In this article, a privacy-preserving solution for support vector machine (SVM) classification, PP-SVM for short, is proposed, which constructs the global SVM classification model from data distributed at multiple parties, without disclosing the data of each party to others.

...read moreread less

174

•Journal Article•10.1109/MWC.2012.6155875

Information and inference in the wireless physical layer

H. Vincent Poor

- 23 Feb 2012

- IEEE Wireless Communications

TL;DR: Four research areas are explored briefly, primarily involving information theoretic or inferential problems, each of which is motivated by a wireless application-layer issue: security in data networks, distributed inference in sensor networks, finite-blocklength capacity in multimedia networks, and connectivity in small-world networks.

...read moreread less

142

•Journal Article•10.1002/GAMM.202100001

Combining machine learning and domain decomposition methods for the solution of partial differential equations—A review

Alexander Heinlein, +3 more

- 01 Mar 2021

- Gamm-mitteilungen

TL;DR: An approach is presented which uses neural networks to reduce the computational effort in adaptive DDMs while retaining their robustness, and two recently published deep domain decomposition approaches are presented in a unified framework.

...read moreread less

74

•Journal Article•10.1109/JSTSP.2015.2389196

Mining the Situation: Spatiotemporal Traffic Prediction With Big Data

Jie Xu, +4 more

- 06 Jan 2015

- IEEE Journal of Selected Topics in Signa...

TL;DR: A novel online framework that could learn from the current traffic situation (or context) in real-time and predict the future traffic by matching the current situation to the most effective prediction model trained using historical data is proposed.

...read moreread less

66

...

Expand

References

Journal Article•10.1111/J.2517-6161.1996.TB02080.X

Regression Shrinkage and Selection via the Lasso

Robert Tibshirani

- 01 Jan 1996

- Journal of the royal statistical society...

TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.

...read moreread less

45.4K

•Journal Article•10.1214/AOS/1013203451

Greedy function approximation: A gradient boosting machine.

Jerome H. Friedman

- 01 Oct 2001

- Annals of Statistics

TL;DR: A general gradient descent boosting paradigm is developed for additive expansions based on any fitting criterion, and specific algorithms are presented for least-squares, least absolute deviation, and Huber-M loss functions for regression, and multiclass logistic likelihood for classification.

...read moreread less

26.4K

Journal Article•10.1198/016214503000125

Boosting With the L2 Loss

Peter Bühlmann, +1 more

- 01 Jun 2003

- Journal of the American Statistical Asso...

TL;DR: In this paper, a computationally simple variant of boosting, L2Boost, which is constructed from a functional gradient descent algorithm with the L2-loss function, is investigated in both regression and classification.

...read moreread less

899

Proceedings Article•10.1145/1081870.1081942

Privacy-preserving distributed k-means clustering over arbitrarily partitioned data

Geetha Jagannathan, +1 more

- 21 Aug 2005

TL;DR: The concept of arbitrarily partitioned data is introduced, which is a generalization of both horizontally and vertically partitionedData, and an efficient privacy-preserving protocol for k-means clustering in the setting of arbitrarily partitions data is provided.

...read moreread less

501

•Journal Article•10.1109/MSP.2006.1657817

Distributed learning in wireless sensor networks

Joel B. Predd, +2 more

- 17 Jul 2006

- IEEE Signal Processing Magazine

TL;DR: In this article, the authors discuss nonparametric distributed learning in WSNs and discuss the challenges that wireless sensor networks pose for distributed learning, and research aimed at addressing these challenges is surveyed.

...read moreread less

468

...

Expand

Attribute-Distributed Learning: Models, Limits, and Algorithms

Chat with Paper

AI Agents for this Paper

Citations

A survey of machine learning for big data processing

Privacy-preserving SVM classification

Information and inference in the wireless physical layer

Combining machine learning and domain decomposition methods for the solution of partial differential equations—A review

Mining the Situation: Spatiotemporal Traffic Prediction With Big Data

References

Regression Shrinkage and Selection via the Lasso

Greedy function approximation: A gradient boosting machine.

Boosting With the L2 Loss

Privacy-preserving distributed k-means clustering over arbitrarily partitioned data

Distributed learning in wireless sensor networks

Related Papers (5)

A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting

Distributed Sparse Linear Regression

Distributed learning in wireless sensor networks

Tracking the Best Expert

The weighted majority algorithm