Active learning for logistic regression: an evaluation

doi:10.1007/S10994-007-5019-5

Open AccessJournal Article10.1007/S10994-007-5019-5

Active learning for logistic regression: an evaluation

Andrew I. Schein, +1 more

- 01 Oct 2007

- Machine Learning

- Vol. 68, Iss: 3, pp 235-265

411

TL;DR: A re-derive of the variance reduction method known in experimental design circles as ‘A-optimality’ and comparisons against different variations of the most widely used heuristic schemes are run to discover which methods work best for different classes of problems and why.

Abstract: Which active learning methods can we expect to yield good performance in learning binary and multi-category logistic regression classifiers? Addressing this question is a natural first step in providing robust solutions for active learning across a wide variety of exponential models including maximum entropy, generalized linear, log-linear, and conditional random field models. For the logistic regression model we re-derive the variance reduction method known in experimental design circles as `A-optimality.' We then run comparisons against different variations of the most widely used heuristic schemes: query by committee and uncertainty sampling, to discover which methods work best for different classes of problems and why. We find that among the strategies tested, the experimental design methods are most likely to match or beat a random sample baseline. The heuristic alternatives produced mixed results, with an uncertainty sampling variant called margin sampling and a derivative method called QBB-MM providing the most promising performance at very low computational cost. Computational running times of the experimental design methods were a bottleneck to the evaluations. Meanwhile, evaluation of the heuristic methods lead to an accumulation of negative results. We explore alternative evaluation design parameters to test whether these negative results are merely an artifact of settings where experimental design methods can be applied. The results demonstrate a need for improved active learning methods that will provide reliable performance at a reasonable computational cost.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1145/242224.242229

Machine learning

Thomas G. Dietterich

- 01 Dec 1996

- ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

14K

Active Learning Literature Survey

Burr Settles

- 01 Jan 2009

TL;DR: This report provides a general introduction to active learning and a survey of the literature, including a discussion of the scenarios in which queries can be formulated, and an overview of the query strategy frameworks proposed in the literature to date.

...read moreread less

6.7K

•Journal Article•10.1016/J.ROBOT.2012.05.008

Active learning of inverse models with intrinsically motivated goal exploration in robots

Adrien Baranes, +1 more

- 01 Jan 2013

- Robotics and Autonomous Systems

TL;DR: The Self-Adaptive Goal Generation Robust Intelligent Adaptive Curiosity (SAGG-RIAC) architecture is introduced as an intrinsically motivated goal exploration mechanism which allows active learning of inverse models in high-dimensional redundant robots.

...read moreread less

582

Journal Article•10.1016/J.PATCOG.2008.10.028

Feature selection with dynamic mutual information

Huawen Liu, +3 more

- 01 Jul 2009

- Pattern Recognition

TL;DR: A new feature selection algorithm based on dynamic mutual information, which is only estimated on unlabeled instances is proposed, which can bring most information measurements in previous algorithms together.

...read moreread less

411

•Posted Content

On The Power of Curriculum Learning in Training Deep Networks

Guy Hacohen, +1 more

- 07 Apr 2019

- arXiv: Learning

TL;DR: This work analyzes the effect of curriculum learning, which involves the non-uniform sampling of mini-batches, on the training of deep networks, and specifically CNNs trained for image recognition, and defines the concept of an ideal curriculum.

...read moreread less

358

...

Expand

References

Journal Article•10.2307/2532419

Applied Logistic Regression.

A. J. Scott, +2 more

- 01 Dec 1991

- Biometrics

TL;DR: Applied Logistic Regression, Third Edition provides an easily accessible introduction to the logistic regression model and highlights the power of this model by examining the relationship between a dichotomous outcome and a set of covariables.

...read moreread less

40.1K

•Book

Neural networks for pattern recognition

Christopher M. Bishop

- 01 Jan 1995

TL;DR: This is the first comprehensive treatment of feed-forward neural networks from the perspective of statistical pattern recognition, and is designed as a text, with over 100 exercises, to benefit anyone involved in the fields of neural computation and pattern recognition.

...read moreread less

19.9K

•Journal Article•10.1023/A:1018054314350

Bagging predictors

Leo Breiman

- 01 Aug 1996

TL;DR: Tests on real and simulated data sets using classification and regression trees and subset selection in linear regression show that bagging can give substantial gains in accuracy.

...read moreread less

16.6K

•Proceedings Article

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

John Lafferty, +2 more

- 28 Jun 2001

TL;DR: This work presents iterative parameter estimation algorithms for conditional random fields and compares the performance of the resulting models to HMMs and MEMMs on synthetic and natural-language data.

...read moreread less

15.4K