Top 770 papers published in the topic of Unsupervised learning in 2009

Showing papers on "Unsupervised learning published in 2009"

Book•

Learning Deep Architectures for AI

[...]

Yoshua Bengio¹•Institutions (1)

1 Jan 2009

TL;DR: The motivations and principles regarding learning algorithms for deep architectures, in particular those exploiting as building blocks unsupervised learning of single-layer modelssuch as Restricted Boltzmann Machines, used to construct deeper models such as Deep Belief Networks are discussed.

...read moreread less

Abstract: Can machine learning deliver AI? Theoretical results, inspiration from the brain and cognition, as well as machine learning experiments suggest that in order to learn the kind of complicated functions that can represent high-level abstractions (e.g. in vision, language, and other AI-level tasks), one would need deep architectures. Deep architectures are composed of multiple levels of non-linear operations, such as in neural nets with many hidden layers, graphical models with many levels of latent variables, or in complicated propositional formulae re-using many sub-formulae. Each level of the architecture represents features at a different level of abstraction, defined as a composition of lower-level features. Searching the parameter space of deep architectures is a difficult task, but new algorithms have been discovered and a new sub-area has emerged in the machine learning community since 2006, following these discoveries. Learning algorithms such as those for Deep Belief Networks and other related unsupervised learning algorithms have recently been proposed to train deep architectures, yielding exciting results and beating the state-of-the-art in certain areas. Learning Deep Architectures for AI discusses the motivations for and principles of learning algorithms for deep architectures. By analyzing and comparing recent results with different learning algorithms for deep architectures, explanations for their success are proposed and discussed, highlighting challenges and suggesting avenues for future explorations in this area.

...read moreread less

8,546 citations

Proceedings Article•10.1145/1553374.1553453•

Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations

[...]

Honglak Lee¹, Roger Grosse¹, Rajesh Ranganath¹, Andrew Y. Ng¹•Institutions (1)

Stanford University¹

14 Jun 2009

TL;DR: The convolutional deep belief network is presented, a hierarchical generative model which scales to realistic image sizes and is translation-invariant and supports efficient bottom-up and top-down probabilistic inference.

...read moreread less

Abstract: There has been much interest in unsupervised learning of hierarchical generative models such as deep belief networks. Scaling such models to full-sized, high-dimensional images remains a difficult problem. To address this problem, we present the convolutional deep belief network, a hierarchical generative model which scales to realistic image sizes. This model is translation-invariant and supports efficient bottom-up and top-down probabilistic inference. Key to our approach is probabilistic max-pooling, a novel technique which shrinks the representations of higher layers in a probabilistically sound way. Our experiments show that the algorithm learns useful high-level visual features, such as object parts, from unlabeled images of objects and natural scenes. We demonstrate excellent performance on several visual recognition tasks and show that our model can perform hierarchical (bottom-up and top-down) inference over full-sized images.

...read moreread less

2,918 citations

Proceedings Article•10.1109/ICCV.2009.5459469•

What is the best multi-stage architecture for object recognition?

[...]

Kevin Jarrett¹, Koray Kavukcuoglu¹, Marc'Aurelio Ranzato¹, Yann LeCun¹•Institutions (1)

Courant Institute of Mathematical Sciences¹

1 Sep 2009

TL;DR: It is shown that using non-linearities that include rectification and local contrast normalization is the single most important ingredient for good accuracy on object recognition benchmarks and that two stages of feature extraction yield better accuracy than one.

...read moreread less

Abstract: In many recent object recognition systems, feature extraction stages are generally composed of a filter bank, a non-linear transformation, and some sort of feature pooling layer Most systems use only one stage of feature extraction in which the filters are hard-wired, or two stages where the filters in one or both stages are learned in supervised or unsupervised mode This paper addresses three questions: 1 How does the non-linearities that follow the filter banks influence the recognition accuracy? 2 does learning the filter banks in an unsupervised or supervised manner improve the performance over random filters or hardwired filters? 3 Is there any advantage to using an architecture with two stages of feature extraction, rather than one? We show that using non-linearities that include rectification and local contrast normalization is the single most important ingredient for good accuracy on object recognition benchmarks We show that two stages of feature extraction yield better accuracy than one Most surprisingly, we show that a two-stage system with random filters can yield almost 63% recognition rate on Caltech-101, provided that the proper non-linearities and pooling layers are used Finally, we show that with supervised refinement, the system achieves state-of-the-art performance on NORB dataset (56%) and unsupervised pre-training followed by supervised refinement produces good accuracy on Caltech-101 (≫ 65%), and the lowest known error rate on the undistorted, unprocessed MNIST dataset (053%)

...read moreread less

2,847 citations

Book•

Machine Learning: Neural and Statistical Classification

[...]

Donald Michie¹, David Spiegelhalter, Charles C. Taylor², John A. Campbell³•Institutions (3)

University of Edinburgh¹, University of Leeds², University College London³

1 Jan 2009

TL;DR: A survey of previous comparisons and theoretical work descriptions of methods dataset descriptions criteria for comparison and methodology (including validation) empirical results machine learning on machine learning can be found in this article, where the authors also discuss their own work.

...read moreread less

Abstract: Survey of previous comparisons and theoretical work descriptions of methods dataset descriptions criteria for comparison and methodology (including validation) empirical results machine learning on machine learning.

...read moreread less

2,643 citations

Book•

Introduction to Semi-Supervised Learning

[...]

Xiaojin Zhu¹, Andrew Goldberg¹, Ronald Brachman, Thomas G. Dietterich•Institutions (1)

University of Wisconsin-Madison¹

29 Jun 2009

TL;DR: This introductory book presents some popular semi-supervised learning models, including self-training, mixture models, co-training and multiview learning, graph-based methods, and semi- supervised support vector machines, and discusses their basic mathematical formulation.

...read moreread less

Abstract: Semi-supervised learning is a learning paradigm concerned with the study of how computers and natural systems such as humans learn in the presence of both labeled and unlabeled data. Traditionally, learning has been studied either in the unsupervised paradigm (e.g., clustering, outlier detection) where all the data is unlabeled, or in the supervised paradigm (e.g., classification, regression) where all the data is labeled.The goal of semi-supervised learning is to understand how combining labeled and unlabeled data may change the learning behavior, and design algorithms that take advantage of such a combination. Semi-supervised learning is of great interest in machine learning and data mining because it can use readily available unlabeled data to improve supervised learning tasks when the labeled data is scarce or expensive. Semi-supervised learning also shows potential as a quantitative tool to understand human category learning, where most of the input is self-evidently unlabeled. In this introductory book, we present some popular semi-supervised learning models, including self-training, mixture models, co-training and multiview learning, graph-based methods, and semi-supervised support vector machines. For each model, we discuss its basic mathematical formulation. The success of semi-supervised learning depends critically on some underlying assumptions. We emphasize the assumptions made by each model and give counterexamples when appropriate to demonstrate the limitations of the different models. In addition, we discuss semi-supervised learning for cognitive psychology. Finally, we give a computational learning theoretic perspective on semi-supervised learning, and we conclude the book with a brief discussion of open questions in the field.

...read moreread less

2,406 citations

Journal Article•10.5555/1577069.1755839•

Transfer Learning for Reinforcement Learning Domains: A Survey

[...]

Matthew D. Taylor¹, Peter Stone•Institutions (1)

University of Southern California¹

01 Dec 2009-Journal of Machine Learning Research

TL;DR: This article presents a framework that classifies transfer learning methods in terms of their capabilities and goals, and then uses it to survey the existing literature, as well as to suggest future directions for transfer learning work.

...read moreread less

Abstract: The reinforcement learning paradigm is a popular way to address problems that have only limited environmental feedback, rather than correctly labeled examples, as is common in other machine learning contexts. While significant progress has been made to improve learning in a single task, the idea of transfer learning has only recently been applied to reinforcement learning tasks. The core idea of transfer is that experience gained in learning to perform one task can help improve learning performance in a related, but different, task. In this article we present a framework that classifies transfer learning methods in terms of their capabilities and goals, and then use it to survey the existing literature, as well as to suggest future directions for transfer learning work.

...read moreread less

2,023 citations

Journal Article•10.1145/1577069.1577070•

Exploring Strategies for Training Deep Neural Networks

[...]

Hugo Larochelle, Yoshua Bengio, Jérôme Louradour, Pascal Lamblin

01 Dec 2009-Journal of Machine Learning Research

TL;DR: These experiments confirm the hypothesis that the greedy layer-wise unsupervised training strategy helps the optimization by initializing weights in a region near a good local minimum, but also implicitly acts as a sort of regularization that brings better generalization and encourages internal distributed representations that are high-level abstractions of the input.

...read moreread less

Abstract: Deep multi-layer neural networks have many levels of non-linearities allowing them to compactly represent highly non-linear and highly-varying functions. However, until recently it was not clear how to train such deep networks, since gradient-based optimization starting from random initialization often appears to get stuck in poor solutions. Hinton et al. recently proposed a greedy layer-wise unsupervised learning procedure relying on the training algorithm of restricted Boltzmann machines (RBM) to initialize the parameters of a deep belief network (DBN), a generative model with many layers of hidden causal variables. This was followed by the proposal of another greedy layer-wise procedure, relying on the usage of autoassociator networks. In the context of the above optimization problem, we study these algorithms empirically to better understand their success. Our experiments confirm the hypothesis that the greedy layer-wise unsupervised training strategy helps the optimization by initializing weights in a region near a good local minimum, but also implicitly acts as a sort of regularization that brings better generalization and encourages internal distributed representations that are high-level abstractions of the input. We also present a series of experiments aimed at evaluating the link between the performance of deep neural networks and practical aspects of their topology, for example, demonstrating cases where the addition of more depth helps. Finally, we empirically explore simple variants of these training algorithms, such as the use of different RBM input unit distributions, a simple way of combining gradient estimators to improve performance, as well as on-line versions of those algorithms.

...read moreread less

1,320 citations

Journal Article•10.1016/J.JOI.2009.01.003•

Sentiment analysis: A combined approach

[...]

Rudy Prabowo¹, Mike Thelwall¹•Institutions (1)

Information Technology University¹

01 Apr 2009-Journal of Informetrics

TL;DR: This paper combines rule-based classification, supervised learning and machine learning into a new combined method, and proposes a semi-automatic, complementary approach in which each classifier can contribute to other classifiers to achieve a good level of effectiveness.

...read moreread less

890 citations

Proceedings Article•10.1145/1553374.1553486•

Large-scale deep unsupervised learning using graphics processors

[...]

Rajat Raina¹, Anand Madhavan¹, Andrew Y. Ng¹•Institutions (1)

Stanford University¹

14 Jun 2009

TL;DR: It is argued that modern graphics processors far surpass the computational capabilities of multicore CPUs, and have the potential to revolutionize the applicability of deep unsupervised learning methods.

...read moreread less

Abstract: The promise of unsupervised learning methods lies in their potential to use vast amounts of unlabeled data to learn complex, highly nonlinear models with millions of free parameters. We consider two well-known unsupervised learning models, deep belief networks (DBNs) and sparse coding, that have recently been applied to a flurry of machine learning applications (Hinton & Salakhutdinov, 2006; Raina et al., 2007). Unfortunately, current learning algorithms for both models are too slow for large-scale applications, forcing researchers to focus on smaller-scale models, or to use fewer training examples.In this paper, we suggest massively parallel methods to help resolve these problems. We argue that modern graphics processors far surpass the computational capabilities of multicore CPUs, and have the potential to revolutionize the applicability of deep unsupervised learning methods. We develop general principles for massively parallelizing unsupervised learning tasks using graphics processors. We show that these principles can be applied to successfully scaling up learning algorithms for both DBNs and sparse coding. Our implementation of DBN learning is up to 70 times faster than a dual-core CPU implementation for large models. For example, we are able to reduce the time required to learn a four-layer DBN with 100 million free parameters from several weeks to around a single day. For sparse coding, we develop a simple, inherently parallel algorithm, that leads to a 5 to 15-fold speedup over previous methods.

...read moreread less

865 citations

Journal Article•10.1109/TPAMI.2008.87•

Unsupervised Activity Perception in Crowded and Complicated Scenes Using Hierarchical Bayesian Models

[...]

Xiaogang Wang¹, Xiaoxu Ma¹, W.E.L. Grimson¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Mar 2009-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A novel unsupervised learning framework to model activities and interactions in crowded and complicated scenes with many kinds of activities co-occurring, and three hierarchical Bayesian models are proposed that advance existing language models, such as LDA and HDP.

...read moreread less

Abstract: We propose a novel unsupervised learning framework to model activities and interactions in crowded and complicated scenes. Hierarchical Bayesian models are used to connect three elements in visual surveillance: low-level visual features, simple "atomic" activities, and interactions. Atomic activities are modeled as distributions over low-level visual features, and multi-agent interactions are modeled as distributions over atomic activities. These models are learnt in an unsupervised way. Given a long video sequence, moving pixels are clustered into different atomic activities and short video clips are clustered into different interactions. In this paper, we propose three hierarchical Bayesian models, Latent Dirichlet Allocation (LDA) mixture model, Hierarchical Dirichlet Process (HDP) mixture model, and Dual Hierarchical Dirichlet Processes (Dual-HDP) model. They advance existing language models, such as LDA [1] and HDP [2]. Our data sets are challenging video sequences from crowded traffic scenes and train station scenes with many kinds of activities co-occurring. Without tracking and human labeling effort, our framework completes many challenging visual surveillance tasks of board interest such as: (1) discovering typical atomic activities and interactions; (2) segmenting long video sequences into different interactions; (3) segmenting motions into different activities; (4) detecting abnormality; and (5) supporting high-level queries on activities and interactions.

...read moreread less

631 citations

Journal Article•10.1162/JOCN.2009.21131•

Neural evidence of statistical learning: Efficient detection of visual regularities without awareness

[...]

Nicholas B. Turk-Browne¹, Brian J. Scholl¹, Marvin M. Chun¹, Marcia K. Johnson¹•Institutions (1)

Yale University¹

21 Aug 2009-Journal of Cognitive Neuroscience

TL;DR: Evidence of learning emerged early during familiarization, showing that statistical learning can operate very quickly and with little exposure, and the findings help elucidate the underlying nature of statistical learning.

...read moreread less

Abstract: Our environment contains regularities distributed in space and time that can be detected by way of statistical learning. This unsupervised learning occurs without intent or awareness, but little is known about how it relates to other types of learning, how it affects perceptual processing, and how quickly it can occur. Here we use fMRI during statistical learning to explore these questions. Participants viewed statistically structured versus unstructured sequences of shapes while performing a task unrelated to the structure. Robust neural responses to statistical structure were observed, and these responses were notable in four ways: First, responses to structure were observed in the striatum and medial temporal lobe, suggesting that statistical learning may be related to other forms of associative learning and relational memory. Second, statistical regularities yielded greater activation in category-specific visual regions (object-selective lateral occipital cortex and word-selective ventral occipito-temporal cortex), demonstrating that these regions are sensitive to information distributed in time. Third, evidence of learning emerged early during familiarization, showing that statistical learning can operate very quickly and with little exposure. Finally, neural signatures of learning were dissociable from subsequent explicit familiarity, suggesting that learning can occur in the absence of awareness. Overall, our findings help elucidate the underlying nature of statistical learning.

...read moreread less

Proceedings Article•10.1145/1553374.1553469•

Deep learning from temporal coherence in video

[...]

Hossein Mobahi¹, Ronan Collobert², Jason Weston²•Institutions (2)

University of Illinois at Urbana–Champaign¹, Princeton University²

14 Jun 2009

TL;DR: A learning method for deep architectures that takes advantage of sequential data, in particular from the temporal coherence that naturally exists in unlabeled video recordings, and is used to improve the performance on a supervised task of interest.

...read moreread less

Abstract: This work proposes a learning method for deep architectures that takes advantage of sequential data, in particular from the temporal coherence that naturally exists in unlabeled video recordings. That is, two successive frames are likely to contain the same object or objects. This coherence is used as a supervisory signal over the unlabeled data, and is used to improve the performance on a supervised task of interest. We demonstrate the effectiveness of this method on some pose invariant object and face recognition tasks.

...read moreread less

Journal Article•10.1186/1743-0003-6-5•

Robotic neurorehabilitation: a computational motor learning perspective

[...]

Vincent S. Huang¹, John W. Krakauer¹•Institutions (1)

Columbia University¹

25 Feb 2009-Journal of Neuroengineering and Rehabilitation

TL;DR: Robots are particularly suitable for both rigorous testing and application of motor learning principles to neurorehabilitation and are considered as a general learning problem from the perspective of theoretical learning frameworks such as supervised and unsupervised learning.

...read moreread less

Abstract: Conventional neurorehabilitation appears to have little impact on impairment over and above that of spontaneous biological recovery Robotic neurorehabilitation has the potential for a greater impact on impairment due to easy deployment, its applicability across of a wide range of motor impairment, its high measurement reliability, and the capacity to deliver high dosage and high intensity training protocols We first describe current knowledge of the natural history of arm recovery after stroke and of outcome prediction in individual patients Rehabilitation strategies and outcome measures for impairment versus function are compared The topics of dosage, intensity, and time of rehabilitation are then discussed Robots are particularly suitable for both rigorous testing and application of motor learning principles to neurorehabilitation Computational motor control and learning principles derived from studies in healthy subjects are introduced in the context of robotic neurorehabilitation Particular attention is paid to the idea of context, task generalization and training schedule The assumptions that underlie the choice of both movement trajectory programmed into the robot and the degree of active participation required by subjects are examined We consider rehabilitation as a general learning problem, and examine it from the perspective of theoretical learning frameworks such as supervised and unsupervised learning We discuss the limitations of current robotic neurorehabilitation paradigms and suggest new research directions from the perspective of computational motor learning

...read moreread less

Proceedings Article•10.1109/ASRU.2009.5372931•

Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams

[...]

Yaodong Zhang¹, James Glass¹•Institutions (1)

Vassar College¹

1 Dec 2009

TL;DR: An unsupervised learning framework is presented to address the problem of detecting spoken keywords by using segmental dynamic time warping to compare the Gaussian posteriorgrams between keyword samples and test utterances and obtaining the keyword detection result.

...read moreread less

Abstract: In this paper, we present an unsupervised learning framework to address the problem of detecting spoken keywords. Without any transcription information, a Gaussian Mixture Model is trained to label speech frames with a Gaussian posteriorgram. Given one or more spoken examples of a keyword, we use segmental dynamic time warping to compare the Gaussian posteriorgrams between keyword samples and test utterances. The keyword detection result is then obtained by ranking the distortion scores of all the test utterances. We examine the TIMIT corpus as a development set to tune the parameters in our system, and the MIT Lecture corpus for more substantial evaluation. The results demonstrate the viability and effectiveness of our unsupervised learning framework on the keyword spotting task.

...read moreread less

Journal Article•10.1109/TPAMI.2008.235•

SemiBoost: Boosting for Semi-Supervised Learning

[...]

Pavan Kumar Mallapragada¹, Rong Jin¹, Anil K. Jain¹, Yi Liu¹•Institutions (1)

Michigan State University¹

01 Nov 2009-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A boosting framework for semi-supervised learning, termed as SemiBoost, that improves the performance of several commonly used supervised learning algorithms, given a large number of unlabeled examples and is comparable to the state-of-the-art semi- supervised learning algorithms.

...read moreread less

Abstract: Semi-supervised learning has attracted a significant amount of attention in pattern recognition and machine learning. Most previous studies have focused on designing special algorithms to effectively exploit the unlabeled data in conjunction with labeled data. Our goal is to improve the classification accuracy of any given supervised learning algorithm by using the available unlabeled examples. We call this as the Semi-supervised improvement problem, to distinguish the proposed approach from the existing approaches. We design a metasemi-supervised learning algorithm that wraps around the underlying supervised algorithm and improves its performance using unlabeled data. This problem is particularly important when we need to train a supervised learning algorithm with a limited number of labeled examples and a multitude of unlabeled examples. We present a boosting framework for semi-supervised learning, termed as SemiBoost. The key advantages of the proposed semi-supervised learning approach are: 1) performance improvement of any supervised learning algorithm with a multitude of unlabeled data, 2) efficient computation by the iterative boosting algorithm, and 3) exploiting both manifold and cluster assumption in training classification models. An empirical study on 16 different data sets and text categorization demonstrates that the proposed framework improves the performance of several commonly used supervised learning algorithms, given a large number of unlabeled examples. We also show that the performance of the proposed algorithm, SemiBoost, is comparable to the state-of-the-art semi-supervised learning algorithms.

...read moreread less

Proceedings Article•10.1109/ICCV.2009.5459342•

A Markov Clustering Topic Model for mining behaviour in video

[...]

Timothy M. Hospedales¹, Shaogang Gong¹, Tao Xiang¹•Institutions (1)

Queen Mary University of London¹

1 Sep 2009

TL;DR: A novel Markov Clustering Topic Model (MCTM) is introduced which builds on existing Dynamic Bayesian Network models and Bayesian topic models, and overcomes their drawbacks on accuracy, robustness and computational efficiency.

...read moreread less

Abstract: This paper addresses the problem of fully automated mining of public space video data. A novel Markov Clustering Topic Model (MCTM) is introduced which builds on existing Dynamic Bayesian Network models (e.g. HMMs) and Bayesian topic models (e.g. Latent Dirichlet Allocation), and overcomes their drawbacks on accuracy, robustness and computational efficiency. Specifically, our model profiles complex dynamic scenes by robustly clustering visual events into activities and these activities into global behaviours, and correlates behaviours over time. A collapsed Gibbs sampler is derived for offline learning with unlabeled training data, and significantly, a new approximation to online Bayesian inference is formulated to enable dynamic scene understanding and behaviour mining in new video data online in real-time. The strength of this model is demonstrated by unsupervised learning of dynamic scene models, mining behaviours and detecting salient events in three complex and crowded public scenes.

...read moreread less

Proceedings Article•

Semi-supervised Learning by Sparse Representation.

[...]

Shuicheng Yan¹, Huan Wang²•Institutions (2)

National University of Singapore¹, Yale University²

1 Jan 2009

TL;DR: This paper proposes a semi-supervised learning framework based on `1 graph to utilize both labeled and unlabeled data for inference on a graph and demonstrates the superiority of this framework over the counterparts based on traditional graphs.

...read moreread less

Abstract: In this paper, we present a novel semi-supervised learning framework based on `1 graph. The `1 graph is motivated by that each datum can be reconstructed by the sparse linear superposition of the training data. The sparse reconstruction coefficients, used to deduce the weights of the directed `1 graph, are derived by solving an `1 optimization problem on sparse representation. Different from conventional graph construction processes which are generally divided into two independent steps, i.e., adjacency searching and weight selection, the graph adjacency structure as well as the graph weights of the `1 graph is derived simultaneously and in a parameter-free manner. Illuminated by the validated discriminating power of sparse representation in [16], we propose a semi-supervised learning framework based on `1 graph to utilize both labeled and unlabeled data for inference on a graph. Extensive experiments on semi-supervised face recognition and image classification demonstrate the superiority of our proposed semi-supervised learning framework based on `1 graph over the counterparts based on traditional graphs.

...read moreread less

A literature survey of active machine learning in the context of natural language processing

[...]

Fredrik Olsson¹•Institutions (1)

Swedish Institute of Computer Science¹

1 Apr 2009

TL;DR: Active learning has been successfully applied to a number of natural language processing tasks, such as, information extraction, named entity recognition, text categorization, part-of-speech tagging, parsing, and word sense disambiguation.

...read moreread less

Abstract: Active learning is a supervised machine learning technique in which the learner is in control of the data used for learning. That control is utilized by the learner to ask an oracle, typically a human with extensive knowledge of the domain at hand, about the classes of the instances for which the model learned so far makes unreliable predictions. The active learning process takes as input a set of labeled examples, as well as a larger set of unlabeled examples, and produces a classifier and a relatively small set of newly labeled data. The overall goal is to create as good a classifier as possible, without having to mark-up and supply the learner with more data than necessary. The learning process aims at keeping the human annotation effort to a minimum, only asking for advice where the training utility of the result of such a query is high. Active learning has been successfully applied to a number of natural language processing tasks, such as, information extraction, named entity recognition, text categorization, part-of-speech tagging, parsing, and word sense disambiguation. This report is a literature survey of active learning from the perspective of natural language processing.

...read moreread less

Journal Article•10.1109/TSMCB.2008.2007630•

Reinforcement Learning Versus Model Predictive Control: A Comparison on a Power System Problem

[...]

Damien Ernst, Mevludin Glavic¹, Florin Capitanescu¹, Louis Wehenkel¹•Institutions (1)

University of Liège¹

1 Apr 2009

TL;DR: This paper compares reinforcement learning with model predictive control in a unified framework and reports experimental results of their application to the synthesis of a controller for a nonlinear and deterministic electrical power oscillations damping problem.

...read moreread less

Abstract: This paper compares reinforcement learning (RL) with model predictive control (MPC) in a unified framework and reports experimental results of their application to the synthesis of a controller for a nonlinear and deterministic electrical power oscillations damping problem. Both families of methods are based on the formulation of the control problem as a discrete-time optimal control problem. The considered MPC approach exploits an analytical model of the system dynamics and cost function and computes open-loop policies by applying an interior-point solver to a minimization problem in which the system dynamics are represented by equality constraints. The considered RL approach infers in a model-free way closed-loop policies from a set of system trajectories and instantaneous cost values by solving a sequence of batch-mode supervised learning problems. The results obtained provide insight into the pros and cons of the two approaches and show that RL may certainly be competitive with MPC even in contexts where a good deterministic system model is available.

...read moreread less

Journal Article•10.1039/B907946G•

Supervised learning with decision tree-based methods in computational and systems biology

[...]

Pierre Geurts¹, Alexandre Irrthum¹, Louis Wehenkel¹•Institutions (1)

University of Liège¹

12 Nov 2009-Molecular BioSystems

TL;DR: The goal of this paper is to provide an accessible and comprehensive introduction to decision tree-based methods and a survey of their applications in the context of computational and systems biology.

...read moreread less

Abstract: At the intersection between artificial intelligence and statistics, supervised learning allows algorithms to automatically build predictive models from just observations of a system. During the last twenty years, supervised learning has been a tool of choice to analyze the always increasing and complexifying data generated in the context of molecular biology, with successful applications in genome annotation, function prediction, or biomarker discovery. Among supervised learning methods, decision tree-based methods stand out as non parametric methods that have the unique feature of combining interpretability, efficiency, and, when used in ensembles of trees, excellent accuracy. The goal of this paper is to provide an accessible and comprehensive introduction to this class of methods. The first part of the review is devoted to an intuitive but complete description of decision tree-based methods and a discussion of their strengths and limitations with respect to other supervised learning methods. The second part of the review provides a survey of their applications in the context of computational and systems biology.

...read moreread less

Journal Article•10.1016/J.NEUCOM.2008.10.017•

Rule induction for forecasting method selection: Meta-learning the characteristics of univariate time series

[...]

Xiaozhe Wang¹, Kate Smith-Miles², Rob J. Hyndman²•Institutions (2)

La Trobe University¹, Monash University²

01 Jun 2009-Neurocomputing

TL;DR: A novel approach for selecting a forecasting method for univariate time series based on measurable data characteristics is presented that combines elements of data mining, meta-learning, clustering, classification and statistical measurement.

...read moreread less

Book Chapter•10.1007/978-3-642-04174-7_3•

Active Learning for Reward Estimation in Inverse Reinforcement Learning

[...]

Manuel Lopes¹, Francisco S. Melo², Luis Montesano³•Institutions (3)

Instituto Superior Técnico¹, Carnegie Mellon University², University of Zaragoza³

27 Aug 2009

TL;DR: An algorithm is proposed that allows the agent to query the demonstrator for samples at specific states, instead of relying only on samples provided at "arbitrary" states, to estimate the reward function with similar accuracy as other methods from the literature while reducing the amount of policy samples required from the expert.

...read moreread less

Abstract: Inverse reinforcement learning addresses the general problem of recovering a reward function from samples of a policy provided by an expert/demonstrator. In this paper, we introduce active learning for inverse reinforcement learning. We propose an algorithm that allows the agent to query the demonstrator for samples at specific states, instead of relying only on samples provided at "arbitrary" states. The purpose of our algorithm is to estimate the reward function with similar accuracy as other methods from the literature while reducing the amount of policy samples required from the expert. We also discuss the use of our algorithm in higher dimensional problems, using both Monte Carlo and gradient methods. We present illustrative results of our algorithm in several simulated examples of different complexities.

...read moreread less

Challenge-Based Learning: An Approach for Our Time

[...]

Laurence F. Johnson, Rachel Smith, J. Troy Smythe, Rachel Varon

1 Jan 2009

TL;DR: The most recent study of global math and science performance shows US students making some gains in the last four years, with fourth graders moving from 12th to 11th place, and eighth graders in from 15th to 9th place in math results, but what the rankings do not show is that that is largely due to erosions in performance around the world, not in the US making great strides.

...read moreread less

Abstract: Permission is granted under a Creative Commons Attribution-NoDerivs license to replicate and distribute this report freely provided that it is distributed only in its entirety. To view a copy of this license, visit creativecommons. org/licenses/by-nd/3. 0/ or send a letter to Public education in America is in trouble. We've known about it for more than 25 years now, since the publication of A Nation at Risk in 1983, and despite billions of dollars of investment and massive reform projects like No Child Left Behind (NCLB), we still find that three of ten kids drop out of school without a diploma. 1 Each year the US sees its children do worse in math and science than countries such as Kazakhstan, Latvia, and Lithuania. The most recent study of global math and science performance 2 shows US students making some gains in the last four years, with fourth graders moving from 12th to 11th place, and eighth graders in from 15th to 9th place in math results, but what the rankings do not show is that that is largely due to erosions in performance around the world, not in the US making great strides. In fact, there is no significant difference in science performance among US students in the last four years at all. At the same time, the world has never had a greater urgency in ensuring that our children are equipped to tackle the serious challenges that lay before them. The world, to a teenager, is a place rife with serious issues — a global financial meltdown, planetary warming, dependence on fossil fuels, wars on two continents. When polled, dropouts report that they leave school because it has no relevance in their lives. Employers sponsor study after study documenting the skills the American workforce needs to stay competitive in a global marketplace, yet increasingly employers are left looking overseas for those skills, as US schools are by and large not cultivating them. It is not that we don't know we have a problem. It is not that plenty of good people are not working on the challenges. And we are not alone. Most of the industrial world is experiencing many of the same issues. We have seen some gains in the quarter century this problem has been in the public eye, but they have not been nearly enough. We need to think differently. What if we focused our energy not …

...read moreread less

Journal Article•10.2174/138620709788167980•

Machine learning in virtual screening.

[...]

James L. Melville¹, Edmund K. Burke, Jonathan D. Hirst¹•Institutions (1)

University of Nottingham¹

01 May 2009-Combinatorial Chemistry & High Throughput Screening

TL;DR: This review highlights recent applications of machine learning to virtual screening, focusing on the use of supervised techniques to train statistical learning algorithms to prioritize databases of molecules as active against a particular protein target.

...read moreread less

Abstract: In this review, we highlight recent applications of machine learning to virtual screening, focusing on the use of supervised techniques to train statistical learning algorithms to prioritize databases of molecules as active against a particular protein target. Both ligand-based similarity searching and structure-based docking have benefited from machine learning algorithms, including naive Bayesian classifiers, support vector machines, neural networks, and decision trees, as well as more traditional regression techniques. Effective application of these methodologies requires an appreciation of data preparation, validation, optimization, and search methodologies, and we also survey developments in these areas.

...read moreread less

Book Chapter•10.1007/978-3-642-04747-3_5•

The Two Faces of Active Learning

[...]

Sanjoy Dasgupta¹•Institutions (1)

University of California, San Diego¹

7 Oct 2009

TL;DR: The active learning model is motivated by scenarios in which it is easy to amass vast quantities of unlabeled data but costly to obtain their labels, and thereby to obtain an accurate classifier at significantly lower cost than regular supervised learning.

...read moreread less

Abstract: The active learning model is motivated by scenarios in which it is easy to amass vast quantities of unlabeled data (images and videos off the web, speech signals from microphone recordings, and so on) but costly to obtain their labels. Like supervised learning, the goal is ultimately to learn a classifier. But like unsupervised learning, the data come unlabeled. More precisely, the labels are hidden, and each of them can be revealed only at a cost. The idea is to query the labels of just a few points that are especially informative about the decision boundary, and thereby to obtain an accurate classifier at significantly lower cost than regular supervised learning.

...read moreread less

Journal Article•10.1007/S11036-008-0139-0•

Unsupervised Learning for Solving RSS Hardware Variance Problem in WiFi Localization

[...]

Arvin Wen Tsui¹, Yu-Hsiang Chuang¹, Hao-Hua Chu²•Institutions (2)

Industrial Technology Research Institute¹, National Taiwan University²

01 Oct 2009-Mobile Networks and Applications

TL;DR: Experimental results demonstrate that the proposed learning method improves positional accuracy within 100 s of learning time, and was designed and implemented in a working WiFi positioning system and evaluated using different WiFi devices with diverse RSS signal patterns.

...read moreread less

Abstract: Hardware variance can significantly degrade the positional accuracy of RSS-based WiFi localization systems. Although manual adjustment can reduce positional error, this solution is not scalable as the number of new WiFi devices increases. We propose an unsupervised learning method to automatically solve the hardware variance problem in WiFi localization. This method was designed and implemented in a working WiFi positioning system and evaluated using different WiFi devices with diverse RSS signal patterns. Experimental results demonstrate that the proposed learning method improves positional accuracy within 100 s of learning time.

...read moreread less

Journal Article•10.1162/NECO.2009.05-08-785•

A binary variable model for affinity propagation

[...]

Inmar E. Givoni¹, Brendan J. Frey¹•Institutions (1)

University of Toronto¹

01 Jun 2009-Neural Computation

TL;DR: A derivation of AP that is much simpler than the original one and is based on a quite different graphical model is presented, which allows easy derivations of message updates for extensions and modifications of the standard AP algorithm.

...read moreread less

Abstract: Affinity propagation (AP) was recently introduced as an unsupervised learning algorithm for exemplar-based clustering. We present a derivation of AP that is much simpler than the original one and is based on a quite different graphical model. The new model allows easy derivations of message updates for extensions and modifications of the standard AP algorithm. We demonstrate this by adjusting the new AP model to represent the capacitated clustering problem. For those wishing to investigate or extend the graphical model of the AP algorithm, we suggest using this new formulation since it allows a simpler and more intuitive model manipulation.

...read moreread less

Journal Article•10.1109/TNN.2008.2010620•

Maximum Margin Clustering Made Practical

[...]

Kai Zhang¹, Ivor W. Tsang², James T. Kwok³•Institutions (3)

Lawrence Berkeley National Laboratory¹, Nanyang Technological University², Hong Kong University of Science and Technology³

01 Apr 2009-IEEE Transactions on Neural Networks

TL;DR: Experiments on a number of synthetic and real-world data sets demonstrate that the proposed approach is more accurate, much faster, and can handle data sets that are hundreds of times larger than the largest data set reported in the MMC literature.

...read moreread less

Abstract: Motivated by the success of large margin methods in supervised learning, maximum margin clustering (MMC) is a recent approach that aims at extending large margin methods to unsupervised learning. However, its optimization problem is nonconvex and existing MMC methods all rely on reformulating and relaxing the nonconvex optimization problem as semidefinite programs (SDP). Though SDP is convex and standard solvers are available, they are computationally very expensive and only small data sets can be handled. To make MMC more practical, we avoid SDP relaxations and propose in this paper an efficient approach that performs alternating optimization directly on the original nonconvex problem. A key step to avoid premature convergence in the resultant iterative procedure is to change the loss function from the hinge loss to the Laplacian/square loss so that overconfident predictions are penalized. Experiments on a number of synthetic and real-world data sets demonstrate that the proposed approach is more accurate, much faster (hundreds to tens of thousands of times faster), and can handle data sets that are hundreds of times larger than the largest data set reported in the MMC literature.

...read moreread less

Proceedings Article•10.1109/IFITA.2009.34•

The Application on Intrusion Detection Based on K-means Cluster Algorithm

[...]

Meng Jianliang, Shang Haikun, Bian Ling

15 May 2009

TL;DR: Computer simulations show that the K-means algorithm is used to cluster and analyze the data and can detect unknown intrusions efficiently in the real network connections.

...read moreread less

Abstract: Internet security has been one of the most important problems in the world. Anomaly detection is the basic method to defend new attack in Intrusion Detection. Network intrusion detection is the process of monitoring the events occurring in a computing system or network and analyzing them for signs of intrusions, defined as attempts to compromise the confidentiality. A wide variety of data mining techniques have been applied to intrusion detections. In data mining, clustering is the most important unsupervised learning process used to find the structures or patterns in a collection of unlabeled data. We use the K-means algorithm to cluster and analyze the data in this paper. Computer simulations show that this method can detect unknown intrusions efficiently in the real network connections.

...read moreread less

Proceedings Article•

Multi-Manifold Semi-Supervised Learning

[...]

Andrew Goldberg¹, Xiaojin Zhu¹, Aarti Singh², Zhiting Xu³, Robert Nowak² - Show less +1 more•Institutions (3)

Carnegie Mellon University¹, University of Wisconsin-Madison², Princeton University³

15 Apr 2009

TL;DR: A semi-supervised learning algorithm is proposed that separates different manifolds into decision sets, and performs supervised learning within each set, and involves a novel application of Hellinger distance and size-constrained spectral clustering.

...read moreread less

Abstract: We study semi-supervised learning when the data consists of multiple intersecting manifolds. We give a finite sample analysis to quantify the potential gain of using unlabeled data in this multi-manifold setting. We then propose a semi-supervised learning algorithm that separates different manifolds into decision sets, and performs supervised learning within each set. Our algorithm involves a novel application of Hellinger distance and size-constrained spectral clustering. Experiments demonstrate the benefit of our multimanifold semi-supervised learning approach.

...read moreread less

...

Expand