Open Access
Learning from ambiguous examples
Thomas Hofmann,Stuart Andrews +1 more
- 01 Jan 2007
15
TL;DR: This thesis demonstrates that the burden of collecting a large number of examples may be supplanted with algorithms that learn from readily available inputs that are ambiguous and suggests that the approach is best suited for tasks in which one and only one interpretation of each ambiguous example is valid, which is a reasonable assumption when the ambiguity is due to missing labels in the training data.
read more
Abstract: The main drawback of the supervised learning approach to solving pattern classification problems is that the initial instance-label pairs are often expensive to collect due to required human effort or comprehensive testing. In many applications however, it is evidently more practical and sometimes essential to collect training examples that are ambiguous due to polymorphism or missing labels. Since these ambiguous examples have a small number of interpretations as instance-label pairs, they are still informative. It is therefore of great interest to practitioners and machine learning researchers alike to develop principled methods that can utilize such examples. This thesis demonstrates that the burden of collecting a large number of examples may be supplanted with algorithms that learn from readily available inputs that are ambiguous. This thesis presents a statistical learning theoretic framework for learning from ambiguous examples that is based on a novel formalization of disambiguation consistency. Intuitively, a valid interpretation and concept hypothesis must be mutually reinforcing. Using this principle, disambiguation and learning are jointly formulated as a non-convex maximum-margin problem. The first presented algorithmic approach for solving the disambiguation and learning problem uses a 2-stage mixed-integer local search technique that leverages state-of-the-art support vector machine software. The subsequent two algorithms use novel specializations of methods from disjunctive programming, a branch of combinatorial optimization. The first and third algorithms are of practical importance because they are efficient and scale to large data sets. Empirical results on benchmark data sets from the multi-instance and transductive learning domains are provided. These results demonstrate that, by accounting for ambiguity explicitly, classifier accuracy does not degrade and can improve significantly over techniques that ignore ambiguity. The results also suggest that our approach is best suited for tasks in which one and only one interpretation of each ambiguous example is valid, which is a reasonable assumption when the ambiguity is due to missing labels in the training data.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Journal Article
Multi-instance learning with any hypothesis class
Sivan Sabato,Naftali Tishby +1 more
TL;DR: This work provides a unified theoretical analysis for MIL, which shows that the sample complexity of MIL is only poly-logarithmically dependent on the size of the bag, for any underlying hypothesis class, regardless of a specific application or problem domain.
Handling Class Overlap and Imbalance to Detect Prompt Situations in Smart Homes
Barnan Das,Narayanan C. Krishnan,Diane J. Cook +2 more
- 07 Dec 2013
TL;DR: The solution, ClusBUS, is a clustering-based under sampling technique that identifies data regions where minority class samples are embedded deep inside majority class, and preprocesses the data in order to give more importance to the minority class during classification.
42
A multi-instance learning wrapper based on the Rocchio classifier for web index recommendation
TL;DR: In this article, a multi-instance learning wrapper method using the Rocchio classifier was proposed to recommend web index pages, which is a generalization of the traditional supervised learning where each example is a labeled bag that is composed of unlabeled instances.
Region-based image categorization with reduced feature set
Gunawan Herman,Getian Ye,Jie Xu,Bang Zhang +3 more
- 05 Nov 2008
TL;DR: The proposed algorithm transforms the MIL problem into a traditional supervised learning problem, and solves it using a standard supervised learning method using the hyperclique patterns, which are a significantly small set of features which are empirically more discriminative than the PCA features.
12
•Proceedings Article
Homogeneous Multi-Instance Learning with Arbitrary Dependence.
Sivan Sabato,Naftali Tishby +1 more
- 01 Jan 2009
TL;DR: This work formally defines a new setting which is more relevant for MIL applications than previous theoretic assumptions, and the sample complexity of this setting is shown to be only logarithmically dependent on the size of the bag.
Related Papers (5)
Qi Zhang,Sally A. Goldman +1 more
- 03 Jan 2001
Oded Maron,Aparna Lakshmi Ratan +1 more
- 24 Jul 1998
Oded Maron,Tomás Lozano-Pérez +1 more
- 01 Dec 1997