Scispace (Formerly Typeset)
  1. Home
  2. Topics
  3. Unsupervised learning
  4. 2014
  1. Home
  2. Topics
  3. Unsupervised learning
  4. 2014
Showing papers on "Unsupervised learning published in 2014"
Book•
Deep Learning: Methods and Applications

[...]

Li Deng1, Dong Yu1•
Microsoft1
12 Jun 2014
TL;DR: This monograph provides an overview of general deep learning methodology and its applications to a variety of signal and information processing tasks, including natural language and text processing, information retrieval, and multimodal information processing empowered by multi-task deep learning.
Abstract: This monograph provides an overview of general deep learning methodology and its applications to a variety of signal and information processing tasks. The application areas are chosen with the following three criteria in mind: (1) expertise or knowledge of the authors; (2) the application areas that have already been transformed by the successful use of deep learning technology, such as speech recognition and computer vision; and (3) the application areas that have the potential to be impacted significantly by deep learning and that have been experiencing research growth, including natural language and text processing, information retrieval, and multimodal information processing empowered by multi-task deep learning.

3,392 citations

Journal Article•10.3389/FNINF.2014.00014•
Machine learning for neuroimaging with scikit-learn.

[...]

Alexandre Abraham1, Alexandre Abraham2, Fabian Pedregosa2, Fabian Pedregosa1, Michael Eickenberg1, Michael Eickenberg2, Philippe Gervais1, Philippe Gervais2, Andreas Mueller3, Jean Kossaifi4, Alexandre Gramfort5, Alexandre Gramfort1, Alexandre Gramfort2, Bertrand Thirion2, Bertrand Thirion1, Gaël Varoquaux1, Gaël Varoquaux2 •
IBM1, École Polytechnique2, University of Bonn3, Imperial College London4, Institut Mines-Télécom5
21 Feb 2014-Frontiers in Neuroinformatics
TL;DR: It is illustrated how scikit-learn, a Python machine learning library, can be used to perform some key analysis steps and its application to neuroimaging data provides a versatile tool to study the brain.
Abstract: Statistical machine learning methods are increasingly used for neuroimaging data analysis. Their main virtue is their ability to model high-dimensional datasets, e.g. multivariate analysis of activation images or resting-state time series. Supervised learning is typically used in decoding or encoding settings to relate brain images to behavioral or clinical observations, while unsupervised learning can uncover hidden structures in sets of images (e.g. resting state functional MRI) or find sub-populations in large cohorts. By considering different functional neuroimaging applications, we illustrate how scikit-learn, a Python machine learning library, can be used to perform some key analysis steps. Scikit-learn contains a very large set of statistical learning algorithms, both supervised and unsupervised, and its application to neuroimaging data provides a versatile tool to study the brain.

2,275 citations

Proceedings Article•10.1145/2623330.2623732•
DeepWalk: Online Learning of Social Representations

[...]

Bryan Perozzi1, Rami Al-Rfou1, Steven Skiena1•
Stony Brook University1
26 Mar 2014-arXiv: Social and Information Networks
TL;DR: DeepWalk is an online learning algorithm which builds useful incremental results, and is trivially parallelizable, which make it suitable for a broad class of real world applications such as network classification, and anomaly detection.
Abstract: We present DeepWalk, a novel approach for learning latent representations of vertices in a network. These latent representations encode social relations in a continuous vector space, which is easily exploited by statistical models. DeepWalk generalizes recent advancements in language modeling and unsupervised feature learning (or deep learning) from sequences of words to graphs. DeepWalk uses local information obtained from truncated random walks to learn latent representations by treating walks as the equivalent of sentences. We demonstrate DeepWalk's latent representations on several multi-label network classification tasks for social networks such as BlogCatalog, Flickr, and YouTube. Our results show that DeepWalk outperforms challenging baselines which are allowed a global view of the network, especially in the presence of missing information. DeepWalk's representations can provide $F_1$ scores up to 10% higher than competing methods when labeled data is sparse. In some experiments, DeepWalk's representations are able to outperform all baseline methods while using 60% less training data. DeepWalk is also scalable. It is an online learning algorithm which builds useful incremental results, and is trivially parallelizable. These qualities make it suitable for a broad class of real world applications such as network classification, and anomaly detection.

1,629 citations

Journal Article•10.1016/J.PATREC.2014.01.008•
A review of unsupervised feature learning and deep learning for time-series modeling ☆

[...]

Martin Längkvist1, Lars Karlsson1, Amy Loutfi1•
Örebro University1
01 Jun 2014-Pattern Recognition Letters
TL;DR: This paper overviews the particular challenges present in time-series data and provides a review of the works that have either applied time- series data to unsupervised feature learning algorithms or alternatively have contributed to modifications of featurelearning algorithms to take into account the challenges present.

1,344 citations

Posted Content•
Discriminative Unsupervised Feature Learning with Convolutional Neural Networks

[...]

Alexey Dosovitskiy1, Jost Tobias Springenberg1, Martin Riedmiller1, Thomas Brox1•
University of Freiburg1
26 Jun 2014
TL;DR: In this article, a set of surrogate classes are formed by applying a variety of transformations to a randomly sampled'seed' image patch, and the resulting feature representation is not class specific.
Abstract: Deep convolutional networks have proven to be very successful in learning task specific features that allow for unprecedented performance on various computer vision tasks. Training of such networks follows mostly the supervised learning paradigm, where sufficiently many input-output pairs are required for training. Acquisition of large training sets is one of the key challenges, when approaching a new task. In this paper, we aim for generic feature learning and present an approach for training a convolutional network using only unlabeled data. To this end, we train the network to discriminate between a set of surrogate classes. Each surrogate class is formed by applying a variety of transformations to a randomly sampled 'seed' image patch. In contrast to supervised network training, the resulting feature representation is not class specific. It rather provides robustness to the transformations that have been applied during training. This generic feature representation allows for classification results that outperform the state of the art for unsupervised learning on several popular datasets (STL-10, CIFAR-10, Caltech-101, Caltech-256). While such generic features cannot compete with class specific features from supervised training on a classification task, we show that they are advantageous on geometric matching problems, where they also outperform the SIFT descriptor.

854 citations

Posted Content•
Deep metric learning using Triplet network

[...]

Elad Hoffer1, Nir Ailon1•
Technion – Israel Institute of Technology1
20 Dec 2014-arXiv: Learning
TL;DR: In this paper, Wang et al. proposed the triplet network model, which aims to learn useful representations by distance comparisons, and demonstrate using various datasets that their model learns a better representation than that of its immediate competitor, the Siamese network.
Abstract: Deep learning has proven itself as a successful set of models for learning useful semantic representations of data. These, however, are mostly implicitly learned as part of a classification task. In this paper we propose the triplet network model, which aims to learn useful representations by distance comparisons. A similar model was defined by Wang et al. (2014), tailor made for learning a ranking for image information retrieval. Here we demonstrate using various datasets that our model learns a better representation than that of its immediate competitor, the Siamese network. We also discuss future possible usage as a framework for unsupervised learning.

825 citations

Journal Article•10.1109/TCYB.2014.2307349•
Semi-Supervised and Unsupervised Extreme Learning Machines

[...]

Gao Huang1, Shiji Song1, Jatinder N. D. Gupta2, Cheng Wu1•
Tsinghua University1, University of Alabama in Huntsville2
12 Mar 2014-IEEE Transactions on Systems, Man, and Cybernetics
TL;DR: It is shown in this paper that all the supervised, semi-supervised, and unsupervised ELMs can actually be put into a unified framework, which provides new perspectives for understanding the mechanism of random feature mapping, which is the key concept in ELM theory.
Abstract: Extreme learning machines (ELMs) have proven to be efficient and effective learning mechanisms for pattern classification and regression. However, ELMs are primarily applied to supervised learning problems. Only a few existing research papers have used ELMs to explore unlabeled data. In this paper, we extend ELMs for both semi-supervised and unsupervised tasks based on the manifold regularization, thus greatly expanding the applicability of ELMs. The key advantages of the proposed algorithms are as follows: 1) both the semi-supervised ELM (SS-ELM) and the unsupervised ELM (US-ELM) exhibit learning capability and computational efficiency of ELMs; 2) both algorithms naturally handle multiclass classification or multicluster clustering; and 3) both algorithms are inductive and can handle unseen data at test time directly. Moreover, it is shown in this paper that all the supervised, semi-supervised, and unsupervised ELMs can actually be put into a unified framework. This provides new perspectives for understanding the mechanism of random feature mapping, which is the key concept in ELM theory. Empirical study on a wide range of data sets demonstrates that the proposed algorithms are competitive with the state-of-the-art semi-supervised or unsupervised learning algorithms in terms of accuracy and efficiency.

815 citations

Proceedings Article•
Discriminative Unsupervised Feature Learning with Convolutional Neural Networks

[...]

Alexey Dosovitskiy1, Jost Tobias Springenberg1, Martin Riedmiller1, Thomas Brox1•
University of Freiburg1
8 Dec 2014
TL;DR: This paper presents an approach for training a convolutional neural network using only unlabeled data and trains the network to discriminate between a set of surrogate classes, finding that this simple feature learning algorithm is surprisingly successful when applied to visual object recognition.
Abstract: Current methods for training convolutional neural networks depend on large amounts of labeled samples for supervised training. In this paper we present an approach for training a convolutional neural network using only unlabeled data. We train the network to discriminate between a set of surrogate classes. Each surrogate class is formed by applying a variety of transformations to a randomly sampled 'seed' image patch. We find that this simple feature learning algorithm is surprisingly successful when applied to visual object recognition. The feature representation learned by our algorithm achieves classification results matching or outperforming the current state-of-the-art for unsupervised learning on several popular datasets (STL-10, CIFAR-10, Caltech-101).

734 citations

Journal Article•10.3389/FNINS.2014.00229•
Deep learning for neuroimaging: A validation study

[...]

Sergey M. Plis1, R Devon Hjelm2, Ruslan Salakhutdinov3, Elena A. Allen4, Elena A. Allen1, HJ Bockholt5, Jeffrey D. Long6, Jeffrey D. Long5, Hans J. Johnson5, Hans J. Johnson6, Jane S. Paulsen5, Jane S. Paulsen6, Jessica A. Turner7, Vince D. Calhoun2, Vince D. Calhoun1 •
The Mind Research Network1, University of New Mexico2, University of Toronto3, University of Bergen4, University of Iowa5, Roy J. and Lucille A. Carver College of Medicine6, Georgia State University7
20 Aug 2014-Frontiers in Neuroscience
TL;DR: In this article, a constraint-based approach to visualizing high dimensional data was proposed to analyze the effect of parameter choices on data transformations and showed that deep learning methods are able to learn physiologically important representations and detect latent relations in neuroimaging data.
Abstract: Deep learning methods have recently made notable advances in the tasks of classification and representation learning. These tasks are important for brain imaging and neuroscience discovery, making the methods attractive for porting to a neuroimager's toolbox. Success of these methods is, in part, explained by the flexibility of deep learning models. However, this flexibility makes the process of porting to new areas a difficult parameter optimization problem. In this work we demonstrate our results (and feasible parameter ranges) in application of deep learning methods to structural and functional brain imaging data. These methods include deep belief networks and their building block the restricted Boltzmann machine. We also describe a novel constraint-based approach to visualizing high dimensional data. We use it to analyze the effect of parameter choices on data transformations. Our results show that deep learning methods are able to learn physiologically important representations and detect latent relations in neuroimaging data.

564 citations

Journal Article•10.1109/TPAMI.2014.2307881•
Active Learning by Querying Informative and Representative Examples

[...]

Sheng-Jun Huang1, Rong Jin2, Zhi-Hua Zhou1•
Nanjing University1, Michigan State University2
24 Feb 2014-IEEE Transactions on Pattern Analysis and Machine Intelligence
TL;DR: In this paper, the min-max view of active learning is used to measure and combine the informativeness and representativeness of an unlabeled instance, and the QUIRE approach is extended to multi-label learning by actively querying instance-label pairs.
Abstract: Active learning reduces the labeling cost by iteratively selecting the most valuable data to query their labels. It has attracted a lot of interests given the abundance of unlabeled data and the high cost of labeling. Most active learning approaches select either informative or representative unlabeled instances to query their labels, which could significantly limit their performance. Although several active learning algorithms were proposed to combine the two query selection criteria, they are usually ad hoc in finding unlabeled instances that are both informative and representative. We address this limitation by developing a principled approach, termed QUIRE, based on the min-max view of active learning. The proposed approach provides a systematic way for measuring and combining the informativeness and representativeness of an unlabeled instance. Further, by incorporating the correlation among labels, we extend the QUIRE approach to multi-label learning by actively querying instance-label pairs. Extensive experimental results show that the proposed QUIRE approach outperforms several state-of-the-art active learning approaches in both single-label and multi-label learning.

457 citations

Book•
Quantum Machine Learning: What Quantum Computing Means to Data Mining

[...]

Peter Wittek
28 Aug 2014
TL;DR: Quantum Machine Learning bridges the gap between abstract developments in quantum computing and the applied research on machine learning by paring down the complexity of the disciplines involved.
Abstract: Quantum Machine Learning bridges the gap between abstract developments in quantum computing and the applied research on machine learning. Paring down the complexity of the disciplines involved, it ...
Proceedings Article•10.1109/ICASSP.2014.6853873•
Deep learning of feature representation with multiple instance learning for medical image analysis

[...]

Yan Xu1, Tao Mo2, Qiwei Feng2, Peilin Zhong2, Maode Lai3, Eric Chang2 •
Beihang University1, Microsoft2, Zhejiang University3
4 May 2014
TL;DR: In this article, the authors used multiple instance learning (MIL) framework in classification training with deep learning features and found that automatic feature learning outperformed manual feature learning and achieved performance that's close to fully supervised approach.
Abstract: This paper studies the effectiveness of accomplishing high-level tasks with a minimum of manual annotation and good feature representations for medical images. In medical image analysis, objects like cells are characterized by significant clinical features. Previously developed features like SIFT and HARR are unable to comprehensively represent such objects. Therefore, feature representation is especially important. In this paper, we study automatic extraction of feature representation through deep learning (DNN). Furthermore, detailed annotation of objects is often an ambiguous and challenging task. We use multiple instance learning (MIL) framework in classification training with deep learning features. Several interesting conclusions can be drawn from our work: (1) automatic feature learning outperforms manual feature; (2) the unsupervised approach can achieve performance that's close to fully supervised approach (93.56%) vs. (94.52%); and (3) the MIL performance of coarse label (96.30%) outweighs the supervised performance of fine label (95.40%) in supervised deep learning features.
Journal Article•10.1080/09332480.2014.914768•
Machine Learning, a Probabilistic Perspective

[...]

Christian P. Robert
23 Apr 2014-Chance
TL;DR: Overall, the chapter on Bayesian inference does not spend much time on prior specification, and both Chib’s method and the Savage-Dickey density ratio are suggested for the approximation of marginal likelihoods.
Abstract: Kevin P. MurphyHardcover: 1104 pagesYear: 2012Publisher: The MIT PressISBN-13: 978-0262018029I have to admit the rather embarrassing fact that Machine Learning, a Probabilistic Perspective is the f...
Proceedings Article•
Analysis of Learning from Positive and Unlabeled Data

[...]

Marthinus Christoffel du Plessis1, Gang Niu2, Masashi Sugiyama1•
University of Tokyo1, Baidu2
8 Dec 2014
TL;DR: This paper first shows that this problem can be solved by cost-sensitive learning between positive and unlabeled data, and shows that convex surrogate loss functions such as the hinge loss may lead to a wrong classification boundary due to an intrinsic bias, but this can be avoided by using non-convex loss functionssuch as the ramp loss.
Abstract: Learning a classifier from positive and unlabeled data is an important class of classification problems that are conceivable in many practical applications. In this paper, we first show that this problem can be solved by cost-sensitive learning between positive and unlabeled data. We then show that convex surrogate loss functions such as the hinge loss may lead to a wrong classification boundary due to an intrinsic bias, but the problem can be avoided by using non-convex loss functions such as the ramp loss. We next analyze the excess risk when the class prior is estimated from data, and show that the classification accuracy is not sensitive to class prior estimation if the unlabeled data is dominated by the positive data (this is naturally satisfied in inlier-based outlier detection because inliers are dominant in the unlabeled dataset). Finally, we provide generalization error bounds and show that, for an equal number of labeled and unlabeled samples, the generalization error of learning only from positive and unlabeled samples is no worse than 2√2 times the fully supervised case. These theoretical findings are also validated through experiments.
Journal Article•10.1002/WICS.1317•
Matching and record linkage

[...]

William E. Winkler1•
United States Census Bureau1
01 Sep 2014-Wiley Interdisciplinary Reviews: Computational Statistics
TL;DR: This overview gives background on a number of statistical methods that have been proven effective for record linkage and describes ongoing research for adjusting standard statistical analyses for linkage error.
Abstract: This overview gives background on a number of statistical methods that have been proven effective for record linkage. To prepare data for the main computational algorithms, we need parsing/standardization that allows us to structure the free-form names, addresses, and other fields into corresponding components. The main parameter-estimation methods are unsupervised methods that yield 'optimal' record linkage parameters. Extended methods provide estimates of false match rates in both unsupervised and, with greater accuracy, in semi-supervised situations. Finally, the paper describes ongoing research for adjusting standard statistical analyses for linkage error. WIREs Comput Stat 2014, 6:313-325. doi: 10.1002/wics.1317
Journal Article•10.1016/J.PATREC.2013.06.010•
A bagging SVM to learn from positive and unlabeled examples

[...]

Fantine Mordelet1, Jean-Philippe Vert2, Jean-Philippe Vert3, Jean-Philippe Vert4•
Duke University1, Curie Institute2, Mines ParisTech3, French Institute of Health and Medical Research4
01 Feb 2014-Pattern Recognition Letters
TL;DR: It is shown theoretically and experimentally that the proposed method can match and even outperform the performance of state-of-the-art methods for PU learning, particularly when the number of positive examples is limited and the fraction of negatives among the unlabeled examples is small.
Journal Article•10.1016/J.NEURON.2013.11.030•
Learning by the Dendritic Prediction of Somatic Spiking

[...]

Robert Urbanczik1, Walter Senn1•
University of Bern1
05 Feb 2014-Neuron
TL;DR: A simple compartmental neuron model is presented together with a non-Hebbian, biologically plausible learning rule for dendritic synapses where plasticity is modulated by these three factors, and a single plasticity rule supports diverse learning paradigms.
Book•
Theory of Disagreement-Based Active Learning

[...]

Steve Hanneke
30 May 2014
TL;DR: Recent advances in the understanding of the theoretical benefits of active learning are described, and implications for the design of effective active learning algorithms are described.
Abstract: Active learning is a protocol for supervised machine learning, in which a learning algorithm sequentially requests the labels of selected data points from a large pool of unlabeled data. This contrasts with passive learning, where the labeled data are taken at random. The objective in active learning is to produce a highly-accurate classifier, ideally using fewer labels than the number of random labeled data sufficient for passive learning to achieve the same. This article describes recent advances in our understanding of the theoretical benefits of active learning, and implications for the design of effective active learning algorithms. Much of the article focuses on a particular technique, namely disagreement-based active learning, which by now has amassed a mature and coherent literature. It also briefly surveys several alternative approaches from the literature. The emphasis is on theorems regarding the performance of a few general algorithms, including rigorous proofs where appropriate. However, the presentation is intended to be pedagogical, focusing on results that illustrate fundamental ideas, rather than obtaining the strongest or most general known theorems. The intended audience includes researchers and advanced graduate students in machine learning and statistics, interested in gaining a deeper understanding of the recent and ongoing developments in the theory of active learning.
Book•
Mastering Machine Learning With scikit-learn

[...]

Gavin Hackeling
10 Nov 2014
TL;DR: This book examines machine learning models including logistic regression, decision trees, and support vector machines, and applies them to common problems such as categorizing documents and classifying images and uses an unsupervised Hidden Markov Model to predict stock prices.
Abstract: Apply effective learning algorithms to real-world problems using scikit-learn About This BookDesign and troubleshoot machine learning systems for common tasks including regression, classification, and clusteringAcquaint yourself with popular machine learning algorithms, including decision trees, logistic regression, and support vector machinesA practical example-based guide to help you gain expertise in implementing and evaluating machine learning systems using scikit-learnWho This Book Is ForIf you are a software developer who wants to learn how machine learning models work and how to apply them effectively, this book is for you. Familiarity with machine learning fundamentals and Python will be helpful, but is not essential. In Detail This book examines machine learning models including logistic regression, decision trees, and support vector machines, and applies them to common problems such as categorizing documents and classifying images. It begins with the fundamentals of machine learning, introducing you to the supervised-unsupervised spectrum, the uses of training and test data, and evaluating models. You will learn how to use generalized linear models in regression problems, as well as solve problems with text and categorical features.You will be acquainted with the use of logistic regression, regularization, and the various loss functions that are used by generalized linear models. The book will also walk you through an example project that prompts you to label the most uncertain training examples. You will also use an unsupervised Hidden Markov Model to predict stock prices.By the end of the book, you will be an expert in scikit-learn and will be well versed in machine learning
Proceedings Article•10.3850/978-981-09-5247-1_017•
Methods to Avoid Over-Fitting and Under-Fitting in Supervised Machine Learning (Comparative Study)

[...]

Haider Khalaf Jabbar, Rafiqul Zaman Khan
1 Jan 2014
Posted Content•
Variational Recurrent Auto-Encoders

[...]

Otto Fabius, Joost van Amersfoort
20 Dec 2014-arXiv: Machine Learning
TL;DR: In this paper, the authors propose a model that combines the strengths of RNNs and SGVB called Variational Recurrent Auto-Encoder (VRAE), which can be used for efficient, large scale unsupervised learning on time series data, mapping the time-series data to a latent vector representation.
Abstract: In this paper we propose a model that combines the strengths of RNNs and SGVB: the Variational Recurrent Auto-Encoder (VRAE). Such a model can be used for efficient, large scale unsupervised learning on time series data, mapping the time series data to a latent vector representation. The model is generative, such that data can be generated from samples of the latent space. An important contribution of this work is that the model can make use of unlabeled data in order to facilitate supervised training of RNNs by initialising the weights and network state.
Book Chapter•10.1016/B978-0-12-802806-3.00008-7•
From neural PCA to deep unsupervised learning

[...]

Harri Valpola
28 Nov 2014-arXiv: Machine Learning
TL;DR: A network supporting deep unsupervised learning is presented that is an autoencoder with lateral shortcut connections from the encoder to the decoder at each level of the hierarchy, analogous to hierarchical latent variable models.
Abstract: A network supporting deep unsupervised learning is presented. The network is an autoencoder with lateral shortcut connections from the encoder to the decoder at each level of the hierarchy. The lateral shortcut connections allow the higher levels of the hierarchy to focus on abstract invariant features. Whereas autoencoders are analogous to latent variable models with a single layer of stochastic variables, the proposed network is analogous to hierarchical latent variable models. Learning combines denoising autoencoder and denoising sources separation frameworks. Each layer of the network contributes to the cost function a term which measures the distance of the representations produced by the encoder and the decoder. Since training signals originate from all levels of the network, all layers can learn efficiently even in deep networks. The speedup offered by cost terms from higher levels of the hierarchy and the ability to learn invariant features are demonstrated in experiments.
Posted Content•
Reinforcement and Imitation Learning via Interactive No-Regret Learning

[...]

Stephane Ross, J. Andrew Bagnell
23 Jun 2014-arXiv: Learning
TL;DR: This work develops an interactive imitation learning approach that leverages cost information and extends the technique to address reinforcement learning, suggesting a broad new family of algorithms and providing a unifying view of existing techniques for imitation and reinforcement learning.
Abstract: Recent work has demonstrated that problems-- particularly imitation learning and structured prediction-- where a learner's predictions influence the input-distribution it is tested on can be naturally addressed by an interactive approach and analyzed using no-regret online learning. These approaches to imitation learning, however, neither require nor benefit from information about the cost of actions. We extend existing results in two directions: first, we develop an interactive imitation learning approach that leverages cost information; second, we extend the technique to address reinforcement learning. The results provide theoretical support to the commonly observed successes of online approximate policy iteration. Our approach suggests a broad new family of algorithms and provides a unifying view of existing techniques for imitation and reinforcement learning.
Journal Article•10.1109/TIP.2014.2363423•
Spectral Unmixing via Data-Guided Sparsity

[...]

Feiyun Zhu1, Ying Wang1, Bin Fan1, Shiming Xiang1, Geofeng Meng1, Chunhong Pan1 •
Chinese Academy of Sciences1
01 Dec 2014-IEEE Transactions on Image Processing
TL;DR: This paper proposes a novel sparsity-based method by learning a data-guided map (DgMap) to describe the individual mixed level of each pixel and applies the ℓp (0 <; p <; 1) constraint in an adaptive manner.
Abstract: Hyperspectral unmixing, the process of estimating a common set of spectral bases and their corresponding composite percentages at each pixel, is an important task for hyperspectral analysis, visualization, and understanding. From an unsupervised learning perspective, this problem is very challenging—both the spectral bases and their composite percentages are unknown, making the solution space too large. To reduce the solution space, many approaches have been proposed by exploiting various priors. In practice, these priors would easily lead to some unsuitable solution. This is because they are achieved by applying an identical strength of constraints to all the factors, which does not hold in practice. To overcome this limitation, we propose a novel sparsity-based method by learning a data-guided map (DgMap) to describe the individual mixed level of each pixel. Through this DgMap, the $\ell _{p}\left (0 constraint is applied in an adaptive manner. Such implementation not only meets the practical situation, but also guides the spectral bases toward the pixels under highly sparse constraint. What is more, an elegant optimization scheme as well as its convergence proof have been provided in this paper. Extensive experiments on several datasets also demonstrate that the DgMap is feasible, and high quality unmixing results could be obtained by our method.
Journal Article•10.1109/TITS.2013.2290285•
Spatiotemporal Patterns in Large-Scale Traffic Speed Prediction

[...]

Muhammad Tayyab Asif1, Justin Dauwels1, Chong Yang Goh2, Ali Oran3, Esmail Fathi4, Muye Xu5, Menoth Mohan Dhanya1, Nikola Mitrovic1, Patrick Jaillet2 •
Nanyang Technological University1, Massachusetts Institute of Technology2, Singapore–MIT alliance3, Siemens4, Deutsche Bank5
01 Apr 2014-IEEE Transactions on Intelligent Transportation Systems
TL;DR: This work proposes unsupervised learning methods, such as k-means clustering, principal component analysis, and self-organizing maps, to mine spatiotemporal performance trends at the network level and for individual links to improve the performance of intelligent transportation systems.
Abstract: The ability to accurately predict traffic speed in a large and heterogeneous road network has many useful applications, such as route guidance and congestion avoidance. In principle, data-driven methods, such as support vector regression (SVR), can predict traffic with high accuracy because traffic tends to exhibit regular patterns over time. However, in practice, the prediction performance can significantly vary across the network and during different time periods. Insight into those spatiotemporal trends can improve the performance of intelligent transportation systems. Traditional prediction error measures, such as the mean absolute percentage error, provide information about the individual links in the network but do not capture global trends. We propose unsupervised learning methods, such as k-means clustering, principal component analysis, and self-organizing maps, to mine spatiotemporal performance trends at the network level and for individual links. We perform prediction for a large interconnected road network and for multiple prediction horizons with an SVR-based algorithm. We show the effectiveness of the proposed performance analysis methods by applying them to the prediction data of the SVR.
Book•
Hierarchical Relative Entropy Policy Search

[...]

Christian Daniel1, Gerhard Neumann1, Oliver Kroemer1, Jan Peters1•
Technische Universität Darmstadt1
4 Jan 2014
TL;DR: In this article, the problem of learning sub-policies in continuous state action spaces is defined as finding a hierarchical policy that is composed of a high-level gating policy to select the low-level sub-tasks for execution by the agent.
Abstract: Many reinforcement learning (RL) tasks, especially in robotics, consist of multiple sub-tasks that are strongly structured. Such task structures can be exploited by incorporating hierarchical policies that consist of gating networks and sub-policies. However, this concept has only been partially explored for real world settings and complete methods, derived from first principles, are needed. Real world settings are challenging due to large and continuous state-action spaces that are prohibitive for exhaustive sampling methods. We define the problem of learning sub-policies in continuous state action spaces as finding a hierarchical policy that is composed of a high-level gating policy to select the low-level sub-policies for execution by the agent. In order to efficiently share experience with all sub-policies, also called inter-policy learning, we treat these sub-policies as latent variables which allows for distribution of the update information between the sub-policies. We present three different variants of our algorithm, designed to be suitable for a wide variety of real world robot learning tasks and evaluate our algorithms in two real robot learning scenarios as well as several simulations and comparisons.
Posted Content•
Quantum Algorithms for Nearest-Neighbor Methods for Supervised and Unsupervised Learning

[...]

Nathan Wiebe1, Ashish Kapoor1, Krysta M. Svore1•
Microsoft1
09 Jan 2014-arXiv: Quantum Physics
TL;DR: In this paper, the authors present several quantum algorithms for nearest-neighbor learning, which are fast and coherent quantum methods for computing distance metrics such as the inner product and Euclidean distance.
Abstract: We present several quantum algorithms for performing nearest-neighbor learning. At the core of our algorithms are fast and coherent quantum methods for computing distance metrics such as the inner product and Euclidean distance. We prove upper bounds on the number of queries to the input data required to compute these metrics. In the worst case, our quantum algorithms lead to polynomial reductions in query complexity relative to the corresponding classical algorithm. In certain cases, we show exponential or even super-exponential reductions over the classical analog. We study the performance of our quantum nearest-neighbor algorithms on several real-world binary classification tasks and find that the classification accuracy is competitive with classical methods.
Proceedings Article•10.1109/.40•
An Intrusion Detection Model Based on Deep Belief Networks

[...]

Ni Gao1, Ling Gao1, Quanli Gao1, Hai Wang1•
Northwest University (China)1
20 Nov 2014
TL;DR: This paper focuses on an important research problem of Big Data classification in intrusion detection system, and an intrusion detection model based on Deep Belief Networks is proposed to apply in intrusion recognition domain.
Abstract: This paper focuses on an important research problem of Big Data classification in intrusion detection system. Deep Belief Networks is introduced to the field of intrusion detection, and an intrusion detection model based on Deep Belief Networks is proposed to apply in intrusion recognition domain. The deep hierarchical model is a deep neural network classifier of a combination of multilayer unsupervised learning networks, which is called as Restricted Boltzmann Machine, and a supervised learning network, which is called as Back-propagation network. The experimental results on KDD CUP 1999 dataset demonstrate that the performance of Deep Belief Networks model is better than that of SVM and ANN.
Journal Article•10.1093/BIB/BBT034•
Supervised, semi-supervised and unsupervised inference of gene regulatory networks

[...]

Stefan Maetschke, Piyush B. Madhamshettiwar, Melissa J. Davis, Mark A. Ragan
01 Mar 2014-Briefings in Bioinformatics
TL;DR: An extensive evaluation of inference methods on simulated and experimental expression data reveals low prediction accuracies for unsupervised techniques with the notable exception of the Z-SCORE method on knockout data.
Abstract: Inference of gene regulatory network from expression data is a challenging task. Many methods have been developed to this purpose but a comprehensive evaluation that covers unsupervised, semi-supervised and supervised methods, and provides guidelines for their practical application, is lacking. We performed an extensive evaluation of inference methods on simulated and experimental expression data. The results reveal low prediction accuracies for unsupervised techniques with the notable exception of the Z-SCORE method on knockout data. In all other cases, the supervised approach achieved the highest accuracies and even in a semi-supervised setting with small numbers of only positive samples, outperformed the unsupervised techniques.
Journal Article•10.1016/J.NICL.2013.11.002•
Dissecting psychiatric spectrum disorders by generative embedding

[...]

Kay H. Brodersen1, Kay H. Brodersen2, Lorenz Deserno3, Lorenz Deserno4, Florian Schlagenhauf3, Florian Schlagenhauf4, Zhihao Lin1, Zhihao Lin2, William D. Penny5, Joachim M. Buhmann2, Klaas E. Stephan5, Klaas E. Stephan1 •
University of Zurich1, ETH Zurich2, Charité3, Max Planck Society4, Wellcome Trust Centre for Neuroimaging5
01 Jan 2014-NeuroImage: Clinical
TL;DR: The results corroborate the previous findings in stroke patients that generative embedding, compared to analyses of more conventional measures such as functional connectivity or regional activity, can significantly enhance both the interpretability and performance of computational approaches to clinical classification.
...

Tools

SciSpace AgentBiomedical AgentSciSpace RecruitSciSpace for EnterpriseAgent GalleryChat with PDFLiterature ReviewAI WriterFind TopicsParaphraserCitation GeneratorExtract DataAI DetectorCitation Booster

Learn

ResourcesLive Workshops

SciSpace

CareersSupportBrowse PapersPricingSciSpace Affiliate ProgramCancellation & Refund PolicyTermsPrivacyData Sources

Directories

PapersTopicsJournalsAuthorsConferencesInstitutionsCitation StylesWriting templates

Extension & Apps

SciSpace Chrome ExtensionSciSpace Mobile App

Contact

support@scispace.com
SciSpace

© 2026 | PubGenius Inc. | Suite # 217 691 S Milpitas Blvd Milpitas CA 95035, USA

soc2
Secured by Delve