Towards Practical Active Learning for Classification
Y. Yang
- 20 Nov 2018
TL;DR: This thesis addresses the particular challenge of using as few annotations as possible, while at the same time, maintaining a good learning performance, and proposes two novel active learning methods that show a clear advantage over passive learning.
read more
Abstract: In recent decades, the availability of a large amount of data has propelled the field of machine learning enormously. Machine learning, however, relies heavily on the availability of annotated data, typically labels indicating to which class a data instance belongs. With the huge amounts of data, this raises the question of how to efficiently annotate data, certainly when having limited resources. This thesis addresses the particular challenge of using as few annotations as possible, while at the same time, maintaining a good learning performance. For that we utilize active learning, which iteratively chooses the most valuable instances as to obtain the labels froman oracle (e.g. a human expert). Though many studies have demonstrated that active learning can reduce the annotation cost, there are still several issues that limit its practical use. This thesis makes a further step forwards making active learning more practical for real-world applications. We first provide a benchmark and comparison of six different categories of active learning algorithms built on logistic regression. This work provides a better understanding of the underlying characteristics of various active learners and illustrates the potential benefits of using such techniques, but it also provides many cases for which active learning fails to outperform passive learning (i.e. randomly selecting instances for labeling). Those failed cases motivate us to propose two novel active learning methods that show a clear advantage over passive learning. The first one proposes to weight the so-called retraining-based criteria with an uncertainty score that is measured by the estimated posterior probability. The second one measures the usefulness of unlabeled instances according to the variance of the predictive probability. This method takes an additional step towards practical active learning, clearly outperforming current state of the art on binary andmulti-class classification tasks. We further consider two realistic issues when applying active learning to real-world problems. One is how to find an initial set that contains at least one instance per class to start the active labeling cycle. The other one is dealing with the absence of human annotators in the interactive labeling loop. We propose new approaches to tackle the above problems and observe good performance compared to existing methods. This thesis concludes with an analysis of the contributions and limitations of our work, as well as research directions that deserve further studies. We hope that this thesis also inspires others to make active learning more suitable for real-world applications.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
TROMPA-MER: an open dataset for personalized music emotion recognition
Juan Sebastián Gómez-Cañón,Nicolás Gutiérrez-Páez,Lorenzo Porcaro,Alastair Porter,Estefania Cono,Perfecto Herrera-Boyer,Aggelos Gkiokas,Patricia Santos,Davinia Hernández-Leo,Casper Karreman,Emilia Gómez +10 more
TL;DR: In this article , the authors present a platform and a dataset to help research on music emotion recognition (MER) using citizen science strategies and generate music emotion annotations, where participants annotated each music excerpt with single free-text emotion words (in native language), distinct forced-choice emotion categories, preference and familiarity.
Enhancing Deep Active Learning Using Selective Self-Training For Image Classification
Emmeleia Panagiota Mastoropoulou
- 01 Jan 2019
TL;DR: A high quality and large scale training data-set is an important guarantee to teach an ideal classifier for image classification.
9
TROMPA-MER: an open dataset for personalized Music Emotion Recognition
10 Aug 2022
TL;DR: In this paper , the authors present a platform and a dataset to help research on music emotion recognition (MER) using citizen science strategies and generate music emotion annotations, where participants annotated each music excerpt with single free-text emotion words (in native language), distinct forced-choice emotion categories, preference and familiarity.
Let's agree to disagree: Consensus Entropy Active Learning for Personalized Music Emotion Recognition
Juan S. Gómez-Cañón,Estefanía Cano,Yi-Hsuan Yang,Perfecto Herrera,Emilia Gómez +4 more
- 07 Nov 2021
TL;DR: This work proposes a methodology based on uncertainty sampling and query-by-committee, adopting prior knowledge from the agreement of human annotations as an oracle for active learning (AL), and suggests that this methodology can be beneficial to produce personalized classification models that exhibit different results depending on the algorithms’ complexity.
5
•Journal Article
A variance minimization criterion to active learning on graphs
Ming Ji,Jiawei Han +1 more
TL;DR: In this paper, the authors considered the problem of active learning over the vertices in a graph, without feature representation, based on the common graph smoothness assumption, which is formulated in a Gaussian random field model and analyzed the probability distribution over the unlabeled vertices conditioned on the label information.
3
References
ImageNet classification with deep convolutional neural networks
TL;DR: A large, deep convolutional neural network was trained to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes and employed a recently developed regularization method called "dropout" that proved to be very effective.
•Proceedings Article
ImageNet Classification with Deep Convolutional Neural Networks
Alex Krizhevsky,Ilya Sutskever,Geoffrey E. Hinton +2 more
- 03 Dec 2012
TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
Gradient-based learning applied to document recognition
Yann LeCun,Léon Bottou,Léon Bottou,Yoshua Bengio,Yoshua Bengio,Yoshua Bengio,Patrick Haffner +6 more
- 01 Jan 1998
TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.
53.5K
LIBSVM: A library for support vector machines
Chih-Chung Chang,Chih-Jen Lin +1 more
TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.
Histograms of oriented gradients for human detection
Navneet Dalal,Bill Triggs +1 more
- 20 Jun 2005
TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Related Papers (5)
Mustafa Bilgic,Lise Getoor +1 more
- 01 Jan 2009
Hao Shao,Hao Shao,Bin Tong,Einoshin Suzuki +3 more
- 15 Dec 2012
Buyue Qian,Xiang Wang,Fei Wang,Hongfei Li,Jieping Ye,Ian Davidson +5 more
- 03 Aug 2013