Top 911 papers published in the topic of Unsupervised learning in 2014

Showing papers on "Unsupervised learning published in 2014"

Book•

Deep Learning: Methods and Applications

[...]

Li Deng¹, Dong Yu¹•Institutions (1)

12 Jun 2014

TL;DR: This monograph provides an overview of general deep learning methodology and its applications to a variety of signal and information processing tasks, including natural language and text processing, information retrieval, and multimodal information processing empowered by multi-task deep learning.

...read moreread less

Abstract: This monograph provides an overview of general deep learning methodology and its applications to a variety of signal and information processing tasks. The application areas are chosen with the following three criteria in mind: (1) expertise or knowledge of the authors; (2) the application areas that have already been transformed by the successful use of deep learning technology, such as speech recognition and computer vision; and (3) the application areas that have the potential to be impacted significantly by deep learning and that have been experiencing research growth, including natural language and text processing, information retrieval, and multimodal information processing empowered by multi-task deep learning.

...read moreread less

3,392 citations

Journal Article•10.3389/FNINF.2014.00014•

Machine learning for neuroimaging with scikit-learn.

[...]

Alexandre Abraham¹, Alexandre Abraham², Fabian Pedregosa², Fabian Pedregosa¹, Michael Eickenberg¹, Michael Eickenberg², Philippe Gervais¹, Philippe Gervais², Andreas Mueller³, Jean Kossaifi⁴, Alexandre Gramfort⁵, Alexandre Gramfort¹, Alexandre Gramfort², Bertrand Thirion², Bertrand Thirion¹, Gaël Varoquaux¹, Gaël Varoquaux² - Show less +13 more•Institutions (5)

IBM¹, École Polytechnique², University of Bonn³, Imperial College London⁴, Institut Mines-Télécom⁵

21 Feb 2014-Frontiers in Neuroinformatics

TL;DR: It is illustrated how scikit-learn, a Python machine learning library, can be used to perform some key analysis steps and its application to neuroimaging data provides a versatile tool to study the brain.

...read moreread less

Abstract: Statistical machine learning methods are increasingly used for neuroimaging data analysis. Their main virtue is their ability to model high-dimensional datasets, e.g. multivariate analysis of activation images or resting-state time series. Supervised learning is typically used in decoding or encoding settings to relate brain images to behavioral or clinical observations, while unsupervised learning can uncover hidden structures in sets of images (e.g. resting state functional MRI) or find sub-populations in large cohorts. By considering different functional neuroimaging applications, we illustrate how scikit-learn, a Python machine learning library, can be used to perform some key analysis steps. Scikit-learn contains a very large set of statistical learning algorithms, both supervised and unsupervised, and its application to neuroimaging data provides a versatile tool to study the brain.

...read moreread less

2,275 citations

Proceedings Article•10.1145/2623330.2623732•

DeepWalk: Online Learning of Social Representations

[...]

Bryan Perozzi¹, Rami Al-Rfou¹, Steven Skiena¹•Institutions (1)

Stony Brook University¹

26 Mar 2014-arXiv: Social and Information Networks

TL;DR: DeepWalk is an online learning algorithm which builds useful incremental results, and is trivially parallelizable, which make it suitable for a broad class of real world applications such as network classification, and anomaly detection.

...read moreread less

Abstract: We present DeepWalk, a novel approach for learning latent representations of vertices in a network. These latent representations encode social relations in a continuous vector space, which is easily exploited by statistical models. DeepWalk generalizes recent advancements in language modeling and unsupervised feature learning (or deep learning) from sequences of words to graphs. DeepWalk uses local information obtained from truncated random walks to learn latent representations by treating walks as the equivalent of sentences. We demonstrate DeepWalk's latent representations on several multi-label network classification tasks for social networks such as BlogCatalog, Flickr, and YouTube. Our results show that DeepWalk outperforms challenging baselines which are allowed a global view of the network, especially in the presence of missing information. DeepWalk's representations can provide $F_1$ scores up to 10% higher than competing methods when labeled data is sparse. In some experiments, DeepWalk's representations are able to outperform all baseline methods while using 60% less training data. DeepWalk is also scalable. It is an online learning algorithm which builds useful incremental results, and is trivially parallelizable. These qualities make it suitable for a broad class of real world applications such as network classification, and anomaly detection.

...read moreread less

1,629 citations

Journal Article•10.1016/J.PATREC.2014.01.008•

A review of unsupervised feature learning and deep learning for time-series modeling ☆

[...]

Martin Längkvist¹, Lars Karlsson¹, Amy Loutfi¹•Institutions (1)

Örebro University¹

01 Jun 2014-Pattern Recognition Letters

TL;DR: This paper overviews the particular challenges present in time-series data and provides a review of the works that have either applied time- series data to unsupervised feature learning algorithms or alternatively have contributed to modifications of featurelearning algorithms to take into account the challenges present.

...read moreread less

1,344 citations

Posted Content•

Discriminative Unsupervised Feature Learning with Convolutional Neural Networks

[...]

Alexey Dosovitskiy¹, Jost Tobias Springenberg¹, Martin Riedmiller¹, Thomas Brox¹•Institutions (1)

University of Freiburg¹

26 Jun 2014

TL;DR: In this article, a set of surrogate classes are formed by applying a variety of transformations to a randomly sampled'seed' image patch, and the resulting feature representation is not class specific.

...read moreread less

Abstract: Deep convolutional networks have proven to be very successful in learning task specific features that allow for unprecedented performance on various computer vision tasks. Training of such networks follows mostly the supervised learning paradigm, where sufficiently many input-output pairs are required for training. Acquisition of large training sets is one of the key challenges, when approaching a new task. In this paper, we aim for generic feature learning and present an approach for training a convolutional network using only unlabeled data. To this end, we train the network to discriminate between a set of surrogate classes. Each surrogate class is formed by applying a variety of transformations to a randomly sampled 'seed' image patch. In contrast to supervised network training, the resulting feature representation is not class specific. It rather provides robustness to the transformations that have been applied during training. This generic feature representation allows for classification results that outperform the state of the art for unsupervised learning on several popular datasets (STL-10, CIFAR-10, Caltech-101, Caltech-256). While such generic features cannot compete with class specific features from supervised training on a classification task, we show that they are advantageous on geometric matching problems, where they also outperform the SIFT descriptor.

...read moreread less

854 citations

Posted Content•

Deep metric learning using Triplet network

[...]

Elad Hoffer¹, Nir Ailon¹•Institutions (1)

Technion – Israel Institute of Technology¹

20 Dec 2014-arXiv: Learning

TL;DR: In this paper, Wang et al. proposed the triplet network model, which aims to learn useful representations by distance comparisons, and demonstrate using various datasets that their model learns a better representation than that of its immediate competitor, the Siamese network.

...read moreread less

Abstract: Deep learning has proven itself as a successful set of models for learning useful semantic representations of data. These, however, are mostly implicitly learned as part of a classification task. In this paper we propose the triplet network model, which aims to learn useful representations by distance comparisons. A similar model was defined by Wang et al. (2014), tailor made for learning a ranking for image information retrieval. Here we demonstrate using various datasets that our model learns a better representation than that of its immediate competitor, the Siamese network. We also discuss future possible usage as a framework for unsupervised learning.

...read moreread less

825 citations

Journal Article•10.1109/TCYB.2014.2307349•

Semi-Supervised and Unsupervised Extreme Learning Machines

[...]

Gao Huang¹, Shiji Song¹, Jatinder N. D. Gupta², Cheng Wu¹•Institutions (2)

Tsinghua University¹, University of Alabama in Huntsville²

12 Mar 2014-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: It is shown in this paper that all the supervised, semi-supervised, and unsupervised ELMs can actually be put into a unified framework, which provides new perspectives for understanding the mechanism of random feature mapping, which is the key concept in ELM theory.

...read moreread less

Abstract: Extreme learning machines (ELMs) have proven to be efficient and effective learning mechanisms for pattern classification and regression. However, ELMs are primarily applied to supervised learning problems. Only a few existing research papers have used ELMs to explore unlabeled data. In this paper, we extend ELMs for both semi-supervised and unsupervised tasks based on the manifold regularization, thus greatly expanding the applicability of ELMs. The key advantages of the proposed algorithms are as follows: 1) both the semi-supervised ELM (SS-ELM) and the unsupervised ELM (US-ELM) exhibit learning capability and computational efficiency of ELMs; 2) both algorithms naturally handle multiclass classification or multicluster clustering; and 3) both algorithms are inductive and can handle unseen data at test time directly. Moreover, it is shown in this paper that all the supervised, semi-supervised, and unsupervised ELMs can actually be put into a unified framework. This provides new perspectives for understanding the mechanism of random feature mapping, which is the key concept in ELM theory. Empirical study on a wide range of data sets demonstrates that the proposed algorithms are competitive with the state-of-the-art semi-supervised or unsupervised learning algorithms in terms of accuracy and efficiency.

...read moreread less

815 citations

Proceedings Article•

Discriminative Unsupervised Feature Learning with Convolutional Neural Networks

[...]

Alexey Dosovitskiy¹, Jost Tobias Springenberg¹, Martin Riedmiller¹, Thomas Brox¹•Institutions (1)

University of Freiburg¹

8 Dec 2014

TL;DR: This paper presents an approach for training a convolutional neural network using only unlabeled data and trains the network to discriminate between a set of surrogate classes, finding that this simple feature learning algorithm is surprisingly successful when applied to visual object recognition.

...read moreread less

Abstract: Current methods for training convolutional neural networks depend on large amounts of labeled samples for supervised training. In this paper we present an approach for training a convolutional neural network using only unlabeled data. We train the network to discriminate between a set of surrogate classes. Each surrogate class is formed by applying a variety of transformations to a randomly sampled 'seed' image patch. We find that this simple feature learning algorithm is surprisingly successful when applied to visual object recognition. The feature representation learned by our algorithm achieves classification results matching or outperforming the current state-of-the-art for unsupervised learning on several popular datasets (STL-10, CIFAR-10, Caltech-101).

...read moreread less

734 citations

Journal Article•10.3389/FNINS.2014.00229•

Deep learning for neuroimaging: A validation study

[...]

Sergey M. Plis¹, R Devon Hjelm², Ruslan Salakhutdinov³, Elena A. Allen⁴, Elena A. Allen¹, HJ Bockholt⁵, Jeffrey D. Long⁶, Jeffrey D. Long⁵, Hans J. Johnson⁵, Hans J. Johnson⁶, Jane S. Paulsen⁵, Jane S. Paulsen⁶, Jessica A. Turner⁷, Vince D. Calhoun², Vince D. Calhoun¹ - Show less +11 more•Institutions (7)

The Mind Research Network¹, University of New Mexico², University of Toronto³, University of Bergen⁴, University of Iowa⁵, Roy J. and Lucille A. Carver College of Medicine⁶, Georgia State University⁷

20 Aug 2014-Frontiers in Neuroscience

TL;DR: In this article, a constraint-based approach to visualizing high dimensional data was proposed to analyze the effect of parameter choices on data transformations and showed that deep learning methods are able to learn physiologically important representations and detect latent relations in neuroimaging data.

...read moreread less

Abstract: Deep learning methods have recently made notable advances in the tasks of classification and representation learning. These tasks are important for brain imaging and neuroscience discovery, making the methods attractive for porting to a neuroimager's toolbox. Success of these methods is, in part, explained by the flexibility of deep learning models. However, this flexibility makes the process of porting to new areas a difficult parameter optimization problem. In this work we demonstrate our results (and feasible parameter ranges) in application of deep learning methods to structural and functional brain imaging data. These methods include deep belief networks and their building block the restricted Boltzmann machine. We also describe a novel constraint-based approach to visualizing high dimensional data. We use it to analyze the effect of parameter choices on data transformations. Our results show that deep learning methods are able to learn physiologically important representations and detect latent relations in neuroimaging data.

...read moreread less

564 citations

Journal Article•10.1109/TPAMI.2014.2307881•

Active Learning by Querying Informative and Representative Examples

[...]

Sheng-Jun Huang¹, Rong Jin², Zhi-Hua Zhou¹•Institutions (2)

Nanjing University¹, Michigan State University²

24 Feb 2014-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: In this paper, the min-max view of active learning is used to measure and combine the informativeness and representativeness of an unlabeled instance, and the QUIRE approach is extended to multi-label learning by actively querying instance-label pairs.

...read moreread less

Abstract: Active learning reduces the labeling cost by iteratively selecting the most valuable data to query their labels. It has attracted a lot of interests given the abundance of unlabeled data and the high cost of labeling. Most active learning approaches select either informative or representative unlabeled instances to query their labels, which could significantly limit their performance. Although several active learning algorithms were proposed to combine the two query selection criteria, they are usually ad hoc in finding unlabeled instances that are both informative and representative. We address this limitation by developing a principled approach, termed QUIRE, based on the min-max view of active learning. The proposed approach provides a systematic way for measuring and combining the informativeness and representativeness of an unlabeled instance. Further, by incorporating the correlation among labels, we extend the QUIRE approach to multi-label learning by actively querying instance-label pairs. Extensive experimental results show that the proposed QUIRE approach outperforms several state-of-the-art active learning approaches in both single-label and multi-label learning.

...read moreread less

457 citations

Book•

Quantum Machine Learning: What Quantum Computing Means to Data Mining

[...]

Peter Wittek

28 Aug 2014

TL;DR: Quantum Machine Learning bridges the gap between abstract developments in quantum computing and the applied research on machine learning by paring down the complexity of the disciplines involved.

...read moreread less

Abstract: Quantum Machine Learning bridges the gap between abstract developments in quantum computing and the applied research on machine learning. Paring down the complexity of the disciplines involved, it ...

...read moreread less

Proceedings Article•10.1109/ICASSP.2014.6853873•

Deep learning of feature representation with multiple instance learning for medical image analysis

[...]

Yan Xu¹, Tao Mo², Qiwei Feng², Peilin Zhong², Maode Lai³, Eric Chang² - Show less +2 more•Institutions (3)

Beihang University¹, Microsoft², Zhejiang University³

4 May 2014

TL;DR: In this article, the authors used multiple instance learning (MIL) framework in classification training with deep learning features and found that automatic feature learning outperformed manual feature learning and achieved performance that's close to fully supervised approach.

...read moreread less

Abstract: This paper studies the effectiveness of accomplishing high-level tasks with a minimum of manual annotation and good feature representations for medical images. In medical image analysis, objects like cells are characterized by significant clinical features. Previously developed features like SIFT and HARR are unable to comprehensively represent such objects. Therefore, feature representation is especially important. In this paper, we study automatic extraction of feature representation through deep learning (DNN). Furthermore, detailed annotation of objects is often an ambiguous and challenging task. We use multiple instance learning (MIL) framework in classification training with deep learning features. Several interesting conclusions can be drawn from our work: (1) automatic feature learning outperforms manual feature; (2) the unsupervised approach can achieve performance that's close to fully supervised approach (93.56%) vs. (94.52%); and (3) the MIL performance of coarse label (96.30%) outweighs the supervised performance of fine label (95.40%) in supervised deep learning features.

...read moreread less

Journal Article•10.1080/09332480.2014.914768•

Machine Learning, a Probabilistic Perspective

[...]

Christian P. Robert

23 Apr 2014-Chance

TL;DR: Overall, the chapter on Bayesian inference does not spend much time on prior specification, and both Chib’s method and the Savage-Dickey density ratio are suggested for the approximation of marginal likelihoods.

...read moreread less

Abstract: Kevin P. MurphyHardcover: 1104 pagesYear: 2012Publisher: The MIT PressISBN-13: 978-0262018029I have to admit the rather embarrassing fact that Machine Learning, a Probabilistic Perspective is the f...

...read moreread less

Proceedings Article•

Analysis of Learning from Positive and Unlabeled Data

[...]

Marthinus Christoffel du Plessis¹, Gang Niu², Masashi Sugiyama¹•Institutions (2)

University of Tokyo¹, Baidu²

8 Dec 2014

TL;DR: This paper first shows that this problem can be solved by cost-sensitive learning between positive and unlabeled data, and shows that convex surrogate loss functions such as the hinge loss may lead to a wrong classification boundary due to an intrinsic bias, but this can be avoided by using non-convex loss functionssuch as the ramp loss.

...read moreread less

Abstract: Learning a classifier from positive and unlabeled data is an important class of classification problems that are conceivable in many practical applications. In this paper, we first show that this problem can be solved by cost-sensitive learning between positive and unlabeled data. We then show that convex surrogate loss functions such as the hinge loss may lead to a wrong classification boundary due to an intrinsic bias, but the problem can be avoided by using non-convex loss functions such as the ramp loss. We next analyze the excess risk when the class prior is estimated from data, and show that the classification accuracy is not sensitive to class prior estimation if the unlabeled data is dominated by the positive data (this is naturally satisfied in inlier-based outlier detection because inliers are dominant in the unlabeled dataset). Finally, we provide generalization error bounds and show that, for an equal number of labeled and unlabeled samples, the generalization error of learning only from positive and unlabeled samples is no worse than 2√2 times the fully supervised case. These theoretical findings are also validated through experiments.

...read moreread less

Journal Article•10.1002/WICS.1317•

Matching and record linkage

[...]

William E. Winkler¹•Institutions (1)

United States Census Bureau¹

01 Sep 2014-Wiley Interdisciplinary Reviews: Computational Statistics

TL;DR: This overview gives background on a number of statistical methods that have been proven effective for record linkage and describes ongoing research for adjusting standard statistical analyses for linkage error.

...read moreread less

Abstract: This overview gives background on a number of statistical methods that have been proven effective for record linkage. To prepare data for the main computational algorithms, we need parsing/standardization that allows us to structure the free-form names, addresses, and other fields into corresponding components. The main parameter-estimation methods are unsupervised methods that yield 'optimal' record linkage parameters. Extended methods provide estimates of false match rates in both unsupervised and, with greater accuracy, in semi-supervised situations. Finally, the paper describes ongoing research for adjusting standard statistical analyses for linkage error. WIREs Comput Stat 2014, 6:313-325. doi: 10.1002/wics.1317

...read moreread less

Journal Article•10.1016/J.PATREC.2013.06.010•

A bagging SVM to learn from positive and unlabeled examples

[...]

Fantine Mordelet¹, Jean-Philippe Vert², Jean-Philippe Vert³, Jean-Philippe Vert⁴•Institutions (4)

Duke University¹, Curie Institute², Mines ParisTech³, French Institute of Health and Medical Research⁴

01 Feb 2014-Pattern Recognition Letters

TL;DR: It is shown theoretically and experimentally that the proposed method can match and even outperform the performance of state-of-the-art methods for PU learning, particularly when the number of positive examples is limited and the fraction of negatives among the unlabeled examples is small.

...read moreread less

Journal Article•10.1016/J.NEURON.2013.11.030•

Learning by the Dendritic Prediction of Somatic Spiking

[...]

Robert Urbanczik¹, Walter Senn¹•Institutions (1)

University of Bern¹

05 Feb 2014-Neuron

TL;DR: A simple compartmental neuron model is presented together with a non-Hebbian, biologically plausible learning rule for dendritic synapses where plasticity is modulated by these three factors, and a single plasticity rule supports diverse learning paradigms.

...read moreread less

Book•

Theory of Disagreement-Based Active Learning

[...]

Steve Hanneke

30 May 2014

TL;DR: Recent advances in the understanding of the theoretical benefits of active learning are described, and implications for the design of effective active learning algorithms are described.

...read moreread less

Abstract: Active learning is a protocol for supervised machine learning, in which a learning algorithm sequentially requests the labels of selected data points from a large pool of unlabeled data. This contrasts with passive learning, where the labeled data are taken at random. The objective in active learning is to produce a highly-accurate classifier, ideally using fewer labels than the number of random labeled data sufficient for passive learning to achieve the same. This article describes recent advances in our understanding of the theoretical benefits of active learning, and implications for the design of effective active learning algorithms. Much of the article focuses on a particular technique, namely disagreement-based active learning, which by now has amassed a mature and coherent literature. It also briefly surveys several alternative approaches from the literature. The emphasis is on theorems regarding the performance of a few general algorithms, including rigorous proofs where appropriate. However, the presentation is intended to be pedagogical, focusing on results that illustrate fundamental ideas, rather than obtaining the strongest or most general known theorems. The intended audience includes researchers and advanced graduate students in machine learning and statistics, interested in gaining a deeper understanding of the recent and ongoing developments in the theory of active learning.

...read moreread less

Book•

Mastering Machine Learning With scikit-learn

[...]

Gavin Hackeling

10 Nov 2014

TL;DR: This book examines machine learning models including logistic regression, decision trees, and support vector machines, and applies them to common problems such as categorizing documents and classifying images and uses an unsupervised Hidden Markov Model to predict stock prices.

...read moreread less

Abstract: Apply effective learning algorithms to real-world problems using scikit-learn About This BookDesign and troubleshoot machine learning systems for common tasks including regression, classification, and clusteringAcquaint yourself with popular machine learning algorithms, including decision trees, logistic regression, and support vector machinesA practical example-based guide to help you gain expertise in implementing and evaluating machine learning systems using scikit-learnWho This Book Is ForIf you are a software developer who wants to learn how machine learning models work and how to apply them effectively, this book is for you. Familiarity with machine learning fundamentals and Python will be helpful, but is not essential. In Detail This book examines machine learning models including logistic regression, decision trees, and support vector machines, and applies them to common problems such as categorizing documents and classifying images. It begins with the fundamentals of machine learning, introducing you to the supervised-unsupervised spectrum, the uses of training and test data, and evaluating models. You will learn how to use generalized linear models in regression problems, as well as solve problems with text and categorical features.You will be acquainted with the use of logistic regression, regularization, and the various loss functions that are used by generalized linear models. The book will also walk you through an example project that prompts you to label the most uncertain training examples. You will also use an unsupervised Hidden Markov Model to predict stock prices.By the end of the book, you will be an expert in scikit-learn and will be well versed in machine learning

...read moreread less

Proceedings Article•10.3850/978-981-09-5247-1_017•

Methods to Avoid Over-Fitting and Under-Fitting in Supervised Machine Learning (Comparative Study)

[...]

Haider Khalaf Jabbar, Rafiqul Zaman Khan

1 Jan 2014

Posted Content•

Variational Recurrent Auto-Encoders

[...]

Otto Fabius, Joost van Amersfoort

20 Dec 2014-arXiv: Machine Learning

TL;DR: In this paper, the authors propose a model that combines the strengths of RNNs and SGVB called Variational Recurrent Auto-Encoder (VRAE), which can be used for efficient, large scale unsupervised learning on time series data, mapping the time-series data to a latent vector representation.

...read moreread less

Abstract: In this paper we propose a model that combines the strengths of RNNs and SGVB: the Variational Recurrent Auto-Encoder (VRAE). Such a model can be used for efficient, large scale unsupervised learning on time series data, mapping the time series data to a latent vector representation. The model is generative, such that data can be generated from samples of the latent space. An important contribution of this work is that the model can make use of unlabeled data in order to facilitate supervised training of RNNs by initialising the weights and network state.

...read moreread less

Book Chapter•10.1016/B978-0-12-802806-3.00008-7•

From neural PCA to deep unsupervised learning

[...]

Harri Valpola

28 Nov 2014-arXiv: Machine Learning

TL;DR: A network supporting deep unsupervised learning is presented that is an autoencoder with lateral shortcut connections from the encoder to the decoder at each level of the hierarchy, analogous to hierarchical latent variable models.

...read moreread less

Abstract: A network supporting deep unsupervised learning is presented. The network is an autoencoder with lateral shortcut connections from the encoder to the decoder at each level of the hierarchy. The lateral shortcut connections allow the higher levels of the hierarchy to focus on abstract invariant features. Whereas autoencoders are analogous to latent variable models with a single layer of stochastic variables, the proposed network is analogous to hierarchical latent variable models. Learning combines denoising autoencoder and denoising sources separation frameworks. Each layer of the network contributes to the cost function a term which measures the distance of the representations produced by the encoder and the decoder. Since training signals originate from all levels of the network, all layers can learn efficiently even in deep networks. The speedup offered by cost terms from higher levels of the hierarchy and the ability to learn invariant features are demonstrated in experiments.

...read moreread less

Posted Content•

Reinforcement and Imitation Learning via Interactive No-Regret Learning

[...]

Stephane Ross, J. Andrew Bagnell

23 Jun 2014-arXiv: Learning

TL;DR: This work develops an interactive imitation learning approach that leverages cost information and extends the technique to address reinforcement learning, suggesting a broad new family of algorithms and providing a unifying view of existing techniques for imitation and reinforcement learning.

...read moreread less

Abstract: Recent work has demonstrated that problems-- particularly imitation learning and structured prediction-- where a learner's predictions influence the input-distribution it is tested on can be naturally addressed by an interactive approach and analyzed using no-regret online learning. These approaches to imitation learning, however, neither require nor benefit from information about the cost of actions. We extend existing results in two directions: first, we develop an interactive imitation learning approach that leverages cost information; second, we extend the technique to address reinforcement learning. The results provide theoretical support to the commonly observed successes of online approximate policy iteration. Our approach suggests a broad new family of algorithms and provides a unifying view of existing techniques for imitation and reinforcement learning.

...read moreread less

Journal Article•10.1109/TIP.2014.2363423•

Spectral Unmixing via Data-Guided Sparsity

[...]

Feiyun Zhu¹, Ying Wang¹, Bin Fan¹, Shiming Xiang¹, Geofeng Meng¹, Chunhong Pan¹ - Show less +2 more•Institutions (1)

Chinese Academy of Sciences¹

01 Dec 2014-IEEE Transactions on Image Processing

TL;DR: This paper proposes a novel sparsity-based method by learning a data-guided map (DgMap) to describe the individual mixed level of each pixel and applies the ℓp (0 <; p <; 1) constraint in an adaptive manner.

...read moreread less

Abstract: Hyperspectral unmixing, the process of estimating a common set of spectral bases and their corresponding composite percentages at each pixel, is an important task for hyperspectral analysis, visualization, and understanding. From an unsupervised learning perspective, this problem is very challenging—both the spectral bases and their composite percentages are unknown, making the solution space too large. To reduce the solution space, many approaches have been proposed by exploiting various priors. In practice, these priors would easily lead to some unsuitable solution. This is because they are achieved by applying an identical strength of constraints to all the factors, which does not hold in practice. To overcome this limitation, we propose a novel sparsity-based method by learning a data-guided map (DgMap) to describe the individual mixed level of each pixel. Through this DgMap, the $\ell _{p}\left (0 constraint is applied in an adaptive manner. Such implementation not only meets the practical situation, but also guides the spectral bases toward the pixels under highly sparse constraint. What is more, an elegant optimization scheme as well as its convergence proof have been provided in this paper. Extensive experiments on several datasets also demonstrate that the DgMap is feasible, and high quality unmixing results could be obtained by our method.

...read moreread less

Journal Article•10.1109/TITS.2013.2290285•

Spatiotemporal Patterns in Large-Scale Traffic Speed Prediction

[...]

Muhammad Tayyab Asif¹, Justin Dauwels¹, Chong Yang Goh², Ali Oran³, Esmail Fathi⁴, Muye Xu⁵, Menoth Mohan Dhanya¹, Nikola Mitrovic¹, Patrick Jaillet² - Show less +5 more•Institutions (5)

Nanyang Technological University¹, Massachusetts Institute of Technology², Singapore–MIT alliance³, Siemens⁴, Deutsche Bank⁵

01 Apr 2014-IEEE Transactions on Intelligent Transportation Systems

TL;DR: This work proposes unsupervised learning methods, such as k-means clustering, principal component analysis, and self-organizing maps, to mine spatiotemporal performance trends at the network level and for individual links to improve the performance of intelligent transportation systems.

...read moreread less

Abstract: The ability to accurately predict traffic speed in a large and heterogeneous road network has many useful applications, such as route guidance and congestion avoidance. In principle, data-driven methods, such as support vector regression (SVR), can predict traffic with high accuracy because traffic tends to exhibit regular patterns over time. However, in practice, the prediction performance can significantly vary across the network and during different time periods. Insight into those spatiotemporal trends can improve the performance of intelligent transportation systems. Traditional prediction error measures, such as the mean absolute percentage error, provide information about the individual links in the network but do not capture global trends. We propose unsupervised learning methods, such as k-means clustering, principal component analysis, and self-organizing maps, to mine spatiotemporal performance trends at the network level and for individual links. We perform prediction for a large interconnected road network and for multiple prediction horizons with an SVR-based algorithm. We show the effectiveness of the proposed performance analysis methods by applying them to the prediction data of the SVR.

...read moreread less

Book•

Hierarchical Relative Entropy Policy Search

[...]

Christian Daniel¹, Gerhard Neumann¹, Oliver Kroemer¹, Jan Peters¹•Institutions (1)

Technische Universität Darmstadt¹

4 Jan 2014

TL;DR: In this article, the problem of learning sub-policies in continuous state action spaces is defined as finding a hierarchical policy that is composed of a high-level gating policy to select the low-level sub-tasks for execution by the agent.

...read moreread less

Abstract: Many reinforcement learning (RL) tasks, especially in robotics, consist of multiple sub-tasks that are strongly structured. Such task structures can be exploited by incorporating hierarchical policies that consist of gating networks and sub-policies. However, this concept has only been partially explored for real world settings and complete methods, derived from first principles, are needed. Real world settings are challenging due to large and continuous state-action spaces that are prohibitive for exhaustive sampling methods. We define the problem of learning sub-policies in continuous state action spaces as finding a hierarchical policy that is composed of a high-level gating policy to select the low-level sub-policies for execution by the agent. In order to efficiently share experience with all sub-policies, also called inter-policy learning, we treat these sub-policies as latent variables which allows for distribution of the update information between the sub-policies. We present three different variants of our algorithm, designed to be suitable for a wide variety of real world robot learning tasks and evaluate our algorithms in two real robot learning scenarios as well as several simulations and comparisons.

...read moreread less

Posted Content•

Quantum Algorithms for Nearest-Neighbor Methods for Supervised and Unsupervised Learning

[...]

Nathan Wiebe¹, Ashish Kapoor¹, Krysta M. Svore¹•Institutions (1)

Microsoft¹

09 Jan 2014-arXiv: Quantum Physics

TL;DR: In this paper, the authors present several quantum algorithms for nearest-neighbor learning, which are fast and coherent quantum methods for computing distance metrics such as the inner product and Euclidean distance.

...read moreread less

Abstract: We present several quantum algorithms for performing nearest-neighbor learning. At the core of our algorithms are fast and coherent quantum methods for computing distance metrics such as the inner product and Euclidean distance. We prove upper bounds on the number of queries to the input data required to compute these metrics. In the worst case, our quantum algorithms lead to polynomial reductions in query complexity relative to the corresponding classical algorithm. In certain cases, we show exponential or even super-exponential reductions over the classical analog. We study the performance of our quantum nearest-neighbor algorithms on several real-world binary classification tasks and find that the classification accuracy is competitive with classical methods.

...read moreread less

Proceedings Article•10.1109/.40•

An Intrusion Detection Model Based on Deep Belief Networks

[...]

Ni Gao¹, Ling Gao¹, Quanli Gao¹, Hai Wang¹•Institutions (1)

Northwest University (China)¹

20 Nov 2014

TL;DR: This paper focuses on an important research problem of Big Data classification in intrusion detection system, and an intrusion detection model based on Deep Belief Networks is proposed to apply in intrusion recognition domain.

...read moreread less

Abstract: This paper focuses on an important research problem of Big Data classification in intrusion detection system. Deep Belief Networks is introduced to the field of intrusion detection, and an intrusion detection model based on Deep Belief Networks is proposed to apply in intrusion recognition domain. The deep hierarchical model is a deep neural network classifier of a combination of multilayer unsupervised learning networks, which is called as Restricted Boltzmann Machine, and a supervised learning network, which is called as Back-propagation network. The experimental results on KDD CUP 1999 dataset demonstrate that the performance of Deep Belief Networks model is better than that of SVM and ANN.

...read moreread less

Journal Article•10.1093/BIB/BBT034•

Supervised, semi-supervised and unsupervised inference of gene regulatory networks

[...]

Stefan Maetschke, Piyush B. Madhamshettiwar, Melissa J. Davis, Mark A. Ragan

01 Mar 2014-Briefings in Bioinformatics

TL;DR: An extensive evaluation of inference methods on simulated and experimental expression data reveals low prediction accuracies for unsupervised techniques with the notable exception of the Z-SCORE method on knockout data.

...read moreread less

Abstract: Inference of gene regulatory network from expression data is a challenging task. Many methods have been developed to this purpose but a comprehensive evaluation that covers unsupervised, semi-supervised and supervised methods, and provides guidelines for their practical application, is lacking. We performed an extensive evaluation of inference methods on simulated and experimental expression data. The results reveal low prediction accuracies for unsupervised techniques with the notable exception of the Z-SCORE method on knockout data. In all other cases, the supervised approach achieved the highest accuracies and even in a semi-supervised setting with small numbers of only positive samples, outperformed the unsupervised techniques.

...read moreread less

Journal Article•10.1016/J.NICL.2013.11.002•

Dissecting psychiatric spectrum disorders by generative embedding

[...]

Kay H. Brodersen¹, Kay H. Brodersen², Lorenz Deserno³, Lorenz Deserno⁴, Florian Schlagenhauf³, Florian Schlagenhauf⁴, Zhihao Lin¹, Zhihao Lin², William D. Penny⁵, Joachim M. Buhmann², Klaas E. Stephan⁵, Klaas E. Stephan¹ - Show less +8 more•Institutions (5)

University of Zurich¹, ETH Zurich², Charité³, Max Planck Society⁴, Wellcome Trust Centre for Neuroimaging⁵

01 Jan 2014-NeuroImage: Clinical

TL;DR: The results corroborate the previous findings in stroke patients that generative embedding, compared to analyses of more conventional measures such as functional connectivity or regional activity, can significantly enhance both the interpretability and performance of computational approaches to clinical classification.

...read moreread less

...

Expand