Learning from ambiguous examples

Open Access

Learning from ambiguous examples

- 01 Jan 2007

15

TL;DR: This thesis demonstrates that the burden of collecting a large number of examples may be supplanted with algorithms that learn from readily available inputs that are ambiguous and suggests that the approach is best suited for tasks in which one and only one interpretation of each ambiguous example is valid, which is a reasonable assumption when the ambiguity is due to missing labels in the training data.

Abstract: The main drawback of the supervised learning approach to solving pattern classification problems is that the initial instance-label pairs are often expensive to collect due to required human effort or comprehensive testing. In many applications however, it is evidently more practical and sometimes essential to collect training examples that are ambiguous due to polymorphism or missing labels. Since these ambiguous examples have a small number of interpretations as instance-label pairs, they are still informative. It is therefore of great interest to practitioners and machine learning researchers alike to develop principled methods that can utilize such examples. This thesis demonstrates that the burden of collecting a large number of examples may be supplanted with algorithms that learn from readily available inputs that are ambiguous. This thesis presents a statistical learning theoretic framework for learning from ambiguous examples that is based on a novel formalization of disambiguation consistency. Intuitively, a valid interpretation and concept hypothesis must be mutually reinforcing. Using this principle, disambiguation and learning are jointly formulated as a non-convex maximum-margin problem. The first presented algorithmic approach for solving the disambiguation and learning problem uses a 2-stage mixed-integer local search technique that leverages state-of-the-art support vector machine software. The subsequent two algorithms use novel specializations of methods from disjunctive programming, a branch of combinatorial optimization. The first and third algorithms are of practical importance because they are efficient and scale to large data sets. Empirical results on benchmark data sets from the multi-instance and transductive learning domains are provided. These results demonstrate that, by accounting for ambiguity explicitly, classifier accuracy does not degrade and can improve significantly over techniques that ignore ambiguity. The results also suggest that our approach is best suited for tasks in which one and only one interpretation of each ambiguous example is valid, which is a reasonable assumption when the ambiguity is due to missing labels in the training data.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article

Multi-instance learning with any hypothesis class

Sivan Sabato, +1 more

- 01 Jan 2012

- Journal of Machine Learning Research

TL;DR: This work provides a unified theoretical analysis for MIL, which shows that the sample complexity of MIL is only poly-logarithmically dependent on the size of the bag, for any underlying hypothesis class, regardless of a specific application or problem domain.

...read moreread less

47

Proceedings Article•10.1109/ICDMW.2013.18

Handling Class Overlap and Imbalance to Detect Prompt Situations in Smart Homes

Barnan Das, +2 more

- 07 Dec 2013

TL;DR: The solution, ClusBUS, is a clustering-based under sampling technique that identifies data regions where minority class samples are embedded deep inside majority class, and preprocesses the data in order to give more importance to the minority class during classification.

...read moreread less

42

Journal Article•10.1016/J.KNOSYS.2014.01.008

A multi-instance learning wrapper based on the Rocchio classifier for web index recommendation

Dánel Sánchez Tarragó, +4 more

- 01 Mar 2014

- Knowledge Based Systems

TL;DR: In this article, a multi-instance learning wrapper method using the Rocchio classifier was proposed to recommend web index pages, which is a generalization of the traditional supervised learning where each example is a labeled bag that is composed of unlabeled instances.

...read moreread less

17

Proceedings Article•10.1109/MMSP.2008.4665145

Region-based image categorization with reduced feature set

Gunawan Herman, +3 more

- 05 Nov 2008

TL;DR: The proposed algorithm transforms the MIL problem into a traditional supervised learning problem, and solves it using a standard supervised learning method using the hyperclique patterns, which are a significantly small set of features which are empirically more discriminative than the PCA features.

...read moreread less

12

•Proceedings Article

Homogeneous Multi-Instance Learning with Arbitrary Dependence.

Sivan Sabato, +1 more

- 01 Jan 2009

TL;DR: This work formally defines a new setting which is more relevant for MIL applications than previous theoretic assumptions, and the sample complexity of this setting is shown to be only logarithmically dependent on the size of the bag.

...read moreread less

12