Journal Article10.1109/TGRS.2013.2258468
Bayesian Active Remote Sensing Image Classification
TL;DR: This paper exploits the Bayesian modeling and inference paradigm to tackle the problem of kernel-based remote sensing image classification and proposes an incremental/active learning approach based on three different approaches: the maximum differential of entropies; the minimum distance to decision boundary; and the minimum normalized distance.
read more
Abstract: In recent years, kernel methods, in particular support vector machines (SVMs), have been successfully introduced to remote sensing image classification. Their properties make them appropriate for dealing with a high number of image features and a low number of available labeled spectra. The introduction of alternative approaches based on (parametric) Bayesian inference has been quite scarce in the more recent years. Assuming a particular prior data distribution may lead to poor results in remote sensing problems because of the specificities and complexity of the data. In this context, the emerging field of nonparametric Bayesian methods constitutes a proper theoretical framework to tackle the remote sensing image classification problem. This paper exploits the Bayesian modeling and inference paradigm to tackle the problem of kernel-based remote sensing image classification. This Bayesian methodology is appropriate for both finite- and infinite-dimensional feature spaces. The particular problem of active learning is addressed by proposing an incremental/active learning approach based on three different approaches: 1) the maximum differential of entropies; 2) the minimum distance to decision boundary; and 3) the minimum normalized distance. Parameters are estimated by using the evidence Bayesian approach, the kernel trick, and the marginal distribution of the observations instead of the posterior distribution of the adaptive parameters. This approach allows us to deal with infinite-dimensional feature spaces. The proposed approach is tested on the challenging problem of urban monitoring from multispectral and synthetic aperture radar data and in multiclass land cover classification of hyperspectral images, in both purely supervised and active learning settings. Similar results are obtained when compared to SVMs in the supervised mode, with the advantage of providing posterior estimates for classification and automatic parameter learning. Comparison with random sampling as well as standard active learning methods such as margin sampling and entropy-query-by-bagging reveals a systematic overall accuracy gain and faster convergence with the number of queries.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Pattern Recognition and Machine Learning
Christopher M. Bishop
- 01 Jan 2006
TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.
10.1K
A review of machine learning in processing remote sensing data for mineral exploration
TL;DR: In this paper, a comprehensive review of the implementation and adaptation of some popular and recently established machine learning methods for processing different types of remote sensing data and investigates their applications for detecting various ore deposit types.
207
A survey of remote-sensing big data
TL;DR: It is pointed out that the dynamic-state, multi-scale and non-linear characteristics are intrinsic characteristics of remote-sensing big data while the multi-source, high-dimensional and isomer characteristics are extrinsic characteristics ofremote- sensing big data.
Remote sensing image classification based on the optimal support vector machine and modified binary coded ant colony optimization algorithm
TL;DR: A remote sensing image classification technique based on the optimal SVM is proposed, in which the parameters of SVM and feature selection are handled integrally by a modified coded ant colony optimization algorithm combined with genetic algorithm.
112
Active Learning With Gaussian Process Classifier for Hyperspectral Image Classification
TL;DR: Three new AL heuristics based on the probabilistic output of GP classifiers aimed at actively selecting the most uncertain and confusing candidate samples from the unlabeled data are proposed and an incremental model updating scheme is developed to avoid the repeated training of the GP classifier during the AL process.
94
References
•Book
Pattern Recognition and Machine Learning
Christopher M. Bishop
- 17 Aug 2006
TL;DR: Probability Distributions, linear models for Regression, Linear Models for Classification, Neural Networks, Graphical Models, Mixture Models and EM, Sampling Methods, Continuous Latent Variables, Sequential Data are studied.
A Tutorial on Support Vector Machines for Pattern Recognition
TL;DR: There are several arguments which support the observed high accuracy of SVMs, which are reviewed and numerous examples and proofs of most of the key theorems are given.
Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond
Bernhard Schölkopf,Alexander J. Smola +1 more
- 01 Dec 2001
TL;DR: Learning with Kernels provides an introduction to SVMs and related kernel methods that provide all of the concepts necessary to enable a reader equipped with some basic mathematical knowledge to enter the world of machine learning using theoretically well-founded yet easy-to-use kernel algorithms.
10.2K
•Book
Pattern Recognition and Machine Learning (Information Science and Statistics)
Christopher M. Bishop
- 01 Aug 2006
TL;DR: Looking for competent reading resources?
10.1K
Pattern Recognition and Machine Learning
Christopher M. Bishop
- 01 Jan 2006
TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.
10.1K