Scispace (Formerly Typeset)
  1. Home
  2. Topics
  3. Unsupervised learning
  4. 2020
  1. Home
  2. Topics
  3. Unsupervised learning
  4. 2020
Showing papers on "Unsupervised learning published in 2020"
Proceedings Article•10.1109/CVPR42600.2020.00975•
Momentum Contrast for Unsupervised Visual Representation Learning

[...]

Kaiming He1, Haoqi Fan1, Yuxin Wu1, Saining Xie1, Ross Girshick1 •
Facebook1
14 Jun 2020
TL;DR: This article proposed Momentum Contrast (MoCo) for unsupervised visual representation learning, which enables building a large and consistent dictionary on-the-fly that facilitates contrastive learning.
Abstract: We present Momentum Contrast (MoCo) for unsupervised visual representation learning. From a perspective on contrastive learning as dictionary look-up, we build a dynamic dictionary with a queue and a moving-averaged encoder. This enables building a large and consistent dictionary on-the-fly that facilitates contrastive unsupervised learning. MoCo provides competitive results under the common linear protocol on ImageNet classification. More importantly, the representations learned by MoCo transfer well to downstream tasks. MoCo can outperform its supervised pre-training counterpart in 7 detection/segmentation tasks on PASCAL VOC, COCO, and other datasets, sometimes surpassing it by large margins. This suggests that the gap between unsupervised and supervised representation learning has been largely closed in many vision tasks.

9,710 citations

Posted Content•
Improved Baselines with Momentum Contrastive Learning

[...]

Xinlei Chen, Haoqi Fan, Ross Girshick, Kaiming He
09 Mar 2020-arXiv: Computer Vision and Pattern Recognition
TL;DR: With simple modifications to MoCo, this note establishes stronger baselines that outperform SimCLR and do not require large training batches, and hopes this will make state-of-the-art unsupervised learning research more accessible.
Abstract: Contrastive unsupervised learning has recently shown encouraging progress, e.g., in Momentum Contrast (MoCo) and SimCLR. In this note, we verify the effectiveness of two of SimCLR's design improvements by implementing them in the MoCo framework. With simple modifications to MoCo---namely, using an MLP projection head and more data augmentation---we establish stronger baselines that outperform SimCLR and do not require large training batches. We hope this will make state-of-the-art unsupervised learning research more accessible. Code will be made public.

3,738 citations

Posted Content•
Unsupervised Learning of Visual Features by Contrasting Cluster Assignments

[...]

Mathilde Caron1, Ishan Misra1, Julien Mairal, Priya Goyal1, Piotr Bojanowski1, Armand Joulin1 •
Facebook1
17 Jun 2020-arXiv: Computer Vision and Pattern Recognition
TL;DR: This paper proposes an online algorithm, SwAV, that takes advantage of contrastive methods without requiring to compute pairwise comparisons, and uses a swapped prediction mechanism where it predicts the cluster assignment of a view from the representation of another view.
Abstract: Unsupervised image representations have significantly reduced the gap with supervised pretraining, notably with the recent achievements of contrastive learning methods. These contrastive methods typically work online and rely on a large number of explicit pairwise feature comparisons, which is computationally challenging. In this paper, we propose an online algorithm, SwAV, that takes advantage of contrastive methods without requiring to compute pairwise comparisons. Specifically, our method simultaneously clusters the data while enforcing consistency between cluster assignments produced for different augmentations (or views) of the same image, instead of comparing features directly as in contrastive learning. Simply put, we use a swapped prediction mechanism where we predict the cluster assignment of a view from the representation of another view. Our method can be trained with large and small batches and can scale to unlimited amounts of data. Compared to previous contrastive methods, our method is more memory efficient since it does not require a large memory bank or a special momentum network. In addition, we also propose a new data augmentation strategy, multi-crop, that uses a mix of views with different resolutions in place of two full-resolution views, without increasing the memory or compute requirements much. We validate our findings by achieving 75.3% top-1 accuracy on ImageNet with ResNet-50, as well as surpassing supervised pretraining on all the considered transfer tasks.

3,734 citations

Journal Article•10.1007/S10994-019-05855-6•
A survey on semi-supervised learning

[...]

Jesper E. van Engelen1, Holger H. Hoos1, Holger H. Hoos2•
Leiden University1, University of British Columbia2
01 Feb 2020-Machine Learning
TL;DR: This survey aims to provide researchers and practitioners new to the field as well as more advanced readers with a solid understanding of the main approaches and algorithms developed over the past two decades, with an emphasis on the most prominent and currently relevant work.
Abstract: Semi-supervised learning is the branch of machine learning concerned with using labelled as well as unlabelled data to perform certain learning tasks. Conceptually situated between supervised and unsupervised learning, it permits harnessing the large amounts of unlabelled data available in many use cases in combination with typically smaller sets of labelled data. In recent years, research in this area has followed the general trends observed in machine learning, with much attention directed at neural network-based models and generative learning. The literature on the topic has also expanded in volume and scope, now encompassing a broad spectrum of theory, algorithms and applications. However, no recent surveys exist to collect and organize this knowledge, impeding the ability of researchers and engineers alike to utilize it. Filling this void, we present an up-to-date overview of semi-supervised learning methods, covering earlier work as well as more recent advances. We focus primarily on semi-supervised classification, where the large majority of semi-supervised learning research takes place. Our survey aims to provide researchers and practitioners new to the field as well as more advanced readers with a solid understanding of the main approaches and algorithms developed over the past two decades, with an emphasis on the most prominent and currently relevant work. Furthermore, we propose a new taxonomy of semi-supervised classification algorithms, which sheds light on the different conceptual and methodological approaches for incorporating unlabelled data into the training process. Lastly, we show how the fundamental assumptions underlying most semi-supervised learning algorithms are closely connected to each other, and how they relate to the well-known semi-supervised clustering assumption.

2,388 citations

Journal Article•10.1109/ACCESS.2020.2988796•
Unsupervised K-Means Clustering Algorithm

[...]

Kristina P. Sinaga1, Miin-Shen Yang1•
Chung Yuan Christian University1
20 Apr 2020-IEEE Access
TL;DR: An unsupervised learning schema is constructed for the k-means algorithm so that it is free of initializations without parameter selection and can also simultaneously find an optimal number of clusters.
Abstract: The k-means algorithm is generally the most known and used clustering method. There are various extensions of k-means to be proposed in the literature. Although it is an unsupervised learning to clustering in pattern recognition and machine learning, the k-means algorithm and its extensions are always influenced by initializations with a necessary number of clusters a priori. That is, the k-means algorithm is not exactly an unsupervised clustering method. In this paper, we construct an unsupervised learning schema for the k-means algorithm so that it is free of initializations without parameter selection and can also simultaneously find an optimal number of clusters. That is, we propose a novel unsupervised k-means (U-k-means) clustering algorithm with automatically finding an optimal number of clusters without giving any initialization and parameter selection. The computational complexity of the proposed U-k-means clustering algorithm is also analyzed. Comparisons between the proposed U-k-means and other existing methods are made. Experimental results and comparisons actually demonstrate these good aspects of the proposed U-k-means clustering algorithm.

1,539 citations

Journal Article•10.3390/TECHNOLOGIES9010002•
A Survey on Contrastive Self-Supervised Learning

[...]

Ashish Jaiswal, Ashwin Ramesh Babu, Mohammad Zaki Zadeh, Debapriya Banerjee, Fillia Makedon 
31 Oct 2020
TL;DR: In contrastive self-supervised learning as discussed by the authors, augmented versions of the same sample close to each other while trying to push away embeddings from different samples is used to learn representations for several downstream tasks.
Abstract: Self-supervised learning has gained popularity because of its ability to avoid the cost of annotating large-scale datasets. It is capable of adopting self-defined pseudolabels as supervision and use the learned representations for several downstream tasks. Specifically, contrastive learning has recently become a dominant component in self-supervised learning for computer vision, natural language processing (NLP), and other domains. It aims at embedding augmented versions of the same sample close to each other while trying to push away embeddings from different samples. This paper provides an extensive review of self-supervised methods that follow the contrastive approach. The work explains commonly used pretext tasks in a contrastive learning setup, followed by different architectures that have been proposed so far. Next, we present a performance comparison of different methods for multiple downstream tasks such as image classification, object detection, and action recognition. Finally, we conclude with the limitations of the current methods and the need for further techniques and future directions to make meaningful progress.

1,004 citations

Journal Article•10.1038/S41586-020-2038-X•
Ultrafast machine vision with 2D material neural network image sensors

[...]

Lukas Mennel1, Joanna Symonowicz1, Stefan Wachter1, Dmitry K. Polyushkin1, Aday J. Molina-Mendoza1, Thomas Mueller1 •
Vienna University of Technology1
04 Mar 2020-Nature
TL;DR: It is demonstrated that an image sensor can itself constitute an ANN that can simultaneously sense and process optical images without latency, and is trained to classify and encode images with high throughput, acting as an artificial neural network.
Abstract: Machine vision technology has taken huge leaps in recent years, and is now becoming an integral part of various intelligent systems, including autonomous vehicles and robotics. Usually, visual information is captured by a frame-based camera, converted into a digital format and processed afterwards using a machine-learning algorithm such as an artificial neural network (ANN)1. The large amount of (mostly redundant) data passed through the entire signal chain, however, results in low frame rates and high power consumption. Various visual data preprocessing techniques have thus been developed2-7 to increase the efficiency of the subsequent signal processing in an ANN. Here we demonstrate that an image sensor can itself constitute an ANN that can simultaneously sense and process optical images without latency. Our device is based on a reconfigurable two-dimensional (2D) semiconductor8,9 photodiode10-12 array, and the synaptic weights of the network are stored in a continuously tunable photoresponsivity matrix. We demonstrate both supervised and unsupervised learning and train the sensor to classify and encode images that are optically projected onto the chip with a throughput of 20 million bins per second.

801 citations

Journal Article•10.1109/ACCESS.2020.3031549•
Contrastive Representation Learning: A Framework and Review

[...]

Phuc H. Le-Khac1, Graham Healy1, Alan F. Smeaton1•
Dublin City University1
16 Oct 2020-IEEE Access
TL;DR: A general Contrastive Representation Learning framework is proposed that simplifies and unifies many different contrastive learning methods and a taxonomy for each of the components is provided in order to summarise and distinguish it from other forms of machine learning.
Abstract: Contrastive Learning has recently received interest due to its success in self-supervised representation learning in the computer vision domain. However, the origins of Contrastive Learning date as far back as the 1990s and its development has spanned across many fields and domains including Metric Learning and natural language processing. In this paper, we provide a comprehensive literature review and we propose a general Contrastive Representation Learning framework that simplifies and unifies many different contrastive learning methods. We also provide a taxonomy for each of the components of contrastive learning in order to summarise it and distinguish it from other forms of machine learning. We then discuss the inductive biases which are present in any contrastive learning system and we analyse our framework under different views from various sub-fields of Machine Learning. Examples of how contrastive learning has been applied in computer vision, natural language processing, audio processing, and others, as well as in Reinforcement Learning are also presented. Finally, we discuss the challenges and some of the most promising future research directions ahead.

785 citations

Proceedings Article•10.1145/3394486.3403392•
USAD: UnSupervised Anomaly Detection on Multivariate Time Series

[...]

Julien Audibert1, Pietro Michiardi1, Frédéric Guyard, Sébastien Marti, Maria A. Zuluaga1 •
Institut Eurécom1
23 Aug 2020
TL;DR: A fast and stable method called UnSupervised Anomaly Detection for multivariate time series (USAD) based on adversely trained autoencoders capable of learning in an unsupervised way is proposed.
Abstract: The automatic supervision of IT systems is a current challenge at Orange. Given the size and complexity reached by its IT operations, the number of sensors needed to obtain measurements over time, used to infer normal and abnormal behaviors, has increased dramatically making traditional expert-based supervision methods slow or prone to errors. In this paper, we propose a fast and stable method called UnSupervised Anomaly Detection for multivariate time series (USAD) based on adversely trained autoencoders. Its autoencoder architecture makes it capable of learning in an unsupervised way. The use of adversarial training and its architecture allows it to isolate anomalies while providing fast training. We study the properties of our methods through experiments on five public datasets, thus demonstrating its robustness, training speed and high anomaly detection performance. Through a feasibility study using Orange's proprietary data we have been able to validate Orange's requirements on scalability, stability, robustness, training speed and high performance.

738 citations

Proceedings Article•10.1109/CVPR42600.2020.01438•
Learning Memory-Guided Normality for Anomaly Detection

[...]

Hyunjong Park1, Jongyoun Noh1, Bumsub Ham1•
Yonsei University1
14 Jun 2020
TL;DR: In this article, an unsupervised learning approach to anomaly detection that considers the diversity of normal patterns explicitly, while lessening the representation capacity of CNNs is presented. But the main drawbacks of these approaches are that they do not consider the diversity this article.
Abstract: We address the problem of anomaly detection, that is, detecting anomalous events in a video sequence. Anomaly detection methods based on convolutional neural networks (CNNs) typically leverage proxy tasks, such as reconstructing input video frames, to learn models describing normality without seeing anomalous samples at training time, and quantify the extent of abnormalities using the reconstruction error at test time. The main drawbacks of these approaches are that they do not consider the diversity of normal patterns explicitly, and the powerful representation capacity of CNNs allows to reconstruct abnormal video frames. To address this problem, we present an unsupervised learning approach to anomaly detection that considers the diversity of normal patterns explicitly, while lessening the representation capacity of CNNs. To this end, we propose to use a memory module with a new update scheme where items in the memory record prototypical patterns of normal data. We also present novel feature compactness and separateness losses to train the memory, boosting the discriminative power of both memory items and deeply learned features from normal data. Experimental results on standard benchmarks demonstrate the effectiveness and efficiency of our approach, which outperforms the state of the art.

638 citations

Proceedings Article•10.1145/3366423.3380112•
Graph Representation Learning via Graphical Mutual Information Maximization

[...]

Zhen Peng1, Wenbing Huang2, Minnan Luo1, Qinghua Zheng1, Yu Rong3, Tingyang Xu3, Junzhou Huang3 •
Xi'an Jiaotong University1, Tsinghua University2, Tencent3
20 Apr 2020
TL;DR: An unsupervised learning model trained by maximizing GMI between the input and output of a graph neural encoder is developed, which outperforms state-of-the-art unsuper supervised counterparts, and even sometimes exceeds the performance of supervised ones.
Abstract: The richness in the content of various information networks such as social networks and communication networks provides the unprecedented potential for learning high-quality expressive representations without external supervision. This paper investigates how to preserve and extract the abundant information from graph-structured data into embedding space in an unsupervised manner. To this end, we propose a novel concept, Graphical Mutual Information (GMI), to measure the correlation between input graphs and high-level hidden representations. GMI generalizes the idea of conventional mutual information computations from vector space to the graph domain where measuring mutual information from two aspects of node features and topological structure is indispensable. GMI exhibits several benefits: First, it is invariant to the isomorphic transformation of input graphs—an inevitable constraint in many existing graph representation learning algorithms; Besides, it can be efficiently estimated and maximized by current mutual information estimation methods such as MINE; Finally, our theoretical analysis confirms its correctness and rationality. With the aid of GMI, we develop an unsupervised learning model trained by maximizing GMI between the input and output of a graph neural encoder. Considerable experiments on transductive as well as inductive node classification and link prediction demonstrate that our method outperforms state-of-the-art unsupervised counterparts, and even sometimes exceeds the performance of supervised ones.
Posted Content•
A Transformer-based Framework for Multivariate Time Series Representation Learning

[...]

George Zerveas1, Srideepika Jayaraman2, Dhaval Patel2, Anuradha Bhamidipaty2, Carsten Eickhoff1 •
Brown University1, IBM2
06 Oct 2020-arXiv: Learning
TL;DR: A novel framework for multivariate time series representation learning based on the transformer encoder architecture, which can offer substantial performance benefits over fully supervised learning on downstream tasks, both with but even without leveraging additional unlabeled data, i.e., by reusing the existing data samples.
Abstract: In this work we propose for the first time a transformer-based framework for unsupervised representation learning of multivariate time series. Pre-trained models can be potentially used for downstream tasks such as regression and classification, forecasting and missing value imputation. By evaluating our models on several benchmark datasets for multivariate time series regression and classification, we show that not only does our modeling approach represent the most successful method employing unsupervised learning of multivariate time series presented to date, but also that it exceeds the current state-of-the-art performance of supervised methods; it does so even when the number of training samples is very limited, while offering computational efficiency. Finally, we demonstrate that unsupervised pre-training of our transformer models offers a substantial performance benefit over fully supervised learning, even without leveraging additional unlabeled data, i.e., by reusing the same data samples through the unsupervised objective.
Journal Article•10.1109/COMST.2020.2965856•
Thirty Years of Machine Learning: The Road to Pareto-Optimal Wireless Networks

[...]

Jingjing Wang1, Chunxiao Jiang1, Haijun Zhang2, Yong Ren1, Kwang-Cheng Chen3, Lajos Hanzo4 •
Tsinghua University1, University of Science and Technology Beijing2, University of South Florida3, University of Southampton4
13 Jan 2020-IEEE Communications Surveys and Tutorials
TL;DR: In this article, the authors review the thirty-year history of ML by elaborating on supervised learning, unsupervised learning, reinforcement learning and deep learning and investigate their employment in the compelling applications of wireless networks, including heterogeneous networks, cognitive radios (CR), Internet of Things (IoT), machine to machine networks (M2M), and so on.
Abstract: Future wireless networks have a substantial potential in terms of supporting a broad range of complex compelling applications both in military and civilian fields, where the users are able to enjoy high-rate, low-latency, low-cost and reliable information services. Achieving this ambitious goal requires new radio techniques for adaptive learning and intelligent decision making because of the complex heterogeneous nature of the network structures and wireless services. Machine learning (ML) algorithms have great success in supporting big data analytics, efficient parameter estimation and interactive decision making. Hence, in this article, we review the thirty-year history of ML by elaborating on supervised learning, unsupervised learning, reinforcement learning and deep learning. Furthermore, we investigate their employment in the compelling applications of wireless networks, including heterogeneous networks (HetNets), cognitive radios (CR), Internet of Things (IoT), machine to machine networks (M2M), and so on. This article aims for assisting the readers in clarifying the motivation and methodology of the various ML algorithms, so as to invoke them for hitherto unexplored services as well as scenarios of future wireless networks.
Journal Article•10.3390/S20020342•
Face Recognition Systems: A Survey

[...]

Yassin Kortli, Maher Jridi, Ayman Al Falou, Mohamed Atri1•
King Khalid University1
07 Jan 2020-Sensors
TL;DR: This survey is to review some well-known techniques for each approach and to give the taxonomy of their categories and a solid discussion is given about future directions in terms of techniques to be used for face recognition.
Abstract: Over the past few decades, interest in theories and algorithms for face recognition has been growing rapidly. Video surveillance, criminal identification, building access control, and unmanned and autonomous vehicles are just a few examples of concrete applications that are gaining attraction among industries. Various techniques are being developed including local, holistic, and hybrid approaches, which provide a face image description using only a few face image features or the whole facial features. The main contribution of this survey is to review some well-known techniques for each approach and to give the taxonomy of their categories. In the paper, a detailed comparison between these techniques is exposed by listing the advantages and the disadvantages of their schemes in terms of robustness, accuracy, complexity, and discrimination. One interesting feature mentioned in the paper is about the database used for face recognition. An overview of the most commonly used databases, including those of supervised and unsupervised learning, is given. Numerical results of the most interesting techniques are given along with the context of experiments and challenges handled by these techniques. Finally, a solid discussion is given in the paper about future directions in terms of techniques to be used for face recognition.
Book Chapter•10.1007/978-3-030-22475-2_1•
A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

[...]

Mohamed Alloghani1, Dhiya Al-Jumeily1, Jamila Mustafina2, Abir Hussain1, Ahmed J. Aljaaf1 •
Liverpool John Moores University1, Kazan Federal University2
1 Jan 2020
TL;DR: A systematic review of scholarly articles published between 2015 and 2018 addressing or implementing supervised and unsupervised machine learning techniques in different problem-solving paradigms revealed decision tree, support vector machine, and Naive Bayes algorithms appeared to be the most cited, discussed, and implemented supervised learners.
Abstract: Machine learning is as growing as fast as concepts such as Big data and the field of data science in general. The purpose of the systematic review was to analyze scholarly articles that were published between 2015 and 2018 addressing or implementing supervised and unsupervised machine learning techniques in different problem-solving paradigms. Using the elements of PRISMA, the review process identified 84 scholarly articles that had been published in different journals. Of the 84 articles, 6 were published before 2015 despite their metadata indicating that they were published in 2015. The existence of the six articles in the final papers was attributed to errors in indexing. Nonetheless, from the reviewed papers, decision tree, support vector machine, and Naive Bayes algorithms appeared to be the most cited, discussed, and implemented supervised learners. Conversely, k-means, hierarchical clustering, and principal component analysis also emerged as the commonly used unsupervised learners. The review also revealed other commonly used algorithms that include ensembles and reinforce learners, and future systematic reviews can focus on them because of the developments that machine learning and data science is undergoing at the moment.
Journal Article•10.1007/S40745-020-00253-5•
A Comprehensive Survey of Loss Functions in Machine Learning

[...]

Qi Wang1, Yue Ma1, Kun Zhao2, Yingjie Tian1•
Chinese Academy of Sciences1, Beijing Wuzi University2
12 Apr 2020-Annals of Data Science
TL;DR: This paper summarizes and analyzes 31 classical loss functions in machine learning from the aspects of traditional machine learning and deep learning respectively and mainly selects object detection and face recognition to introduces their loss functions.
Abstract: As one of the important research topics in machine learning, loss function plays an important role in the construction of machine learning algorithms and the improvement of their performance, which has been concerned and explored by many researchers. But it still has a big gap to summarize, analyze and compare the classical loss functions. Therefore, this paper summarizes and analyzes 31 classical loss functions in machine learning. Specifically, we describe the loss functions from the aspects of traditional machine learning and deep learning respectively. The former is divided into classification problem, regression problem and unsupervised learning according to the task type. The latter is subdivided according to the application scenario, and here we mainly select object detection and face recognition to introduces their loss functions. In each task or application, in addition to analyzing each loss function from formula, meaning, image and algorithm, the loss functions under the same task or application are also summarized and compared to deepen the understanding and provide help for the selection and improvement of loss function.
Journal Article•10.1109/TAI.2021.3054609•
A Decade Survey of Transfer Learning (2010–2020)

[...]

Shuteng Niu1, Yongxin Liu1, Jian Wang1, Houbing Song1•
Embry-Riddle Aeronautical University, Daytona Beach1
1 Apr 2020
TL;DR: Transfer Learning (TL) as discussed by the authors can be classified into four classes: transductive learning, inductive, unsupervised learning and negative learning, and each category can be organized into four learning types: learning on instances, learning on features, learningon parameters, and learning on relations.
Abstract: Transfer learning (TL) has been successfully applied to many real-world problems that traditional machine learning (ML) cannot handle, such as image processing, speech recognition, and natural language processing (NLP). Commonly, TL tends to address three main problems of traditional machine learning: (1) insufficient labeled data, (2) incompatible computation power, and (3) distribution mismatch. In general, TL can be organized into four categories: transductive learning, inductive learning, unsupervised learning, and negative learning. Furthermore, each category can be organized into four learning types: learning on instances, learning on features, learning on parameters, and learning on relations. This article presents a comprehensive survey on TL. In addition, this article presents the state of the art, current trends, applications, and open challenges.
Posted Content•
Fairness in Machine Learning: A Survey.

[...]

Simon Caton1, Christian Haas2•
University College Dublin1, University of Nebraska Omaha2
04 Oct 2020-arXiv: Learning
TL;DR: An overview of the different schools of thought and approaches to mitigating (social) biases and increase fairness in the Machine Learning literature is provided, organises approaches into the widely accepted framework of pre-processing, in- processing, and post-processing methods, subcategorizing into a further 11 method areas.
Abstract: As Machine Learning technologies become increasingly used in contexts that affect citizens, companies as well as researchers need to be confident that their application of these methods will not have unexpected social implications, such as bias towards gender, ethnicity, and/or people with disabilities. There is significant literature on approaches to mitigate bias and promote fairness, yet the area is complex and hard to penetrate for newcomers to the domain. This article seeks to provide an overview of the different schools of thought and approaches to mitigating (social) biases and increase fairness in the Machine Learning literature. It organises approaches into the widely accepted framework of pre-processing, in-processing, and post-processing methods, subcategorizing into a further 11 method areas. Although much of the literature emphasizes binary classification, a discussion of fairness in regression, recommender systems, unsupervised learning, and natural language processing is also provided along with a selection of currently available open source libraries. The article concludes by summarising open challenges articulated as four dilemmas for fairness research.
Journal Article•10.1177/0049124117729703•
Computational Grounded Theory: A Methodological Framework

[...]

Laura K. Nelson1•
Northeastern University1
01 Feb 2020-Sociological Methods & Research
TL;DR: A three-step methodological framework called computational grounded theory is proposed, which combines expert human knowledge and hermeneutic skills with the processing power and pattern recognition of computers, producing a more methodologically rigorous but interpretive approach to content analysis.
Abstract: This article proposes a three-step methodological framework called computational grounded theory, which combines expert human knowledge and hermeneutic skills with the processing power and pattern ...
Journal Article•10.1016/J.COMNET.2020.107247•
Building an Efficient Intrusion Detection System Based on Feature Selection and Ensemble Classifier

[...]

Yuyang Zhou1, Yuyang Zhou2, Guang Cheng2, Guang Cheng1, Shanqing Jiang2, Mian Dai1, Mian Dai2 •
Chinese Ministry of Education1, Southeast University2
19 Jun 2020-Computer Networks
TL;DR: Wang et al. as discussed by the authors proposed a new intrusion detection framework based on the feature selection and ensemble learning techniques, and this framework is able to exhibit better performance than other related and state of the art approaches under several metrics.
Proceedings Article•10.1145/3394486.3406704•
Robust Deep Learning Methods for Anomaly Detection

[...]

Raghavendra Chalapathy1, Nguyen Lu Dang Khoa1, Sanjay Chawla2•
Commonwealth Scientific and Industrial Research Organisation1, Qatar Computing Research Institute2
23 Aug 2020
TL;DR: The tutorial will revisit well known unsupervised learning techniques in deep learning including autoencoders and generative adversarial networks (GANs) from the perspective of anomaly detection to give the audience a more grounded perspective on un supervised deep learning methods.
Abstract: Anomaly detection is an important problem that has been well-studied within diverse research areas and application domains. A robust anomaly detection system identifies rare events and patterns in the absence of labelled data. The identified patterns provide crucial insights about both the fidelity of the data and deviations in the underlying data-generating process. For example a surveillance system designed to monitor the emergence of new epidemics will use a robust anomaly detection methods to separate spurious associations from genuine indicators of an epidemic with minimal lag time. The key concept in anomaly detection is the notion of "robustness'', i.e., designing models and representations which are less-sensitive to small changes in the underlying data distribution. The canonical example is that the median is more robust than the mean as an estimator. The tutorial will primarily help researchers and developers design deep learning architectures and loss functions where the learnt representation behave more like the "median'' rather than the "mean.'' The tutorial will revisit well known unsupervised learning techniques in deep learning including autoencoders and generative adversarial networks (GANs) from the perspective of anomaly detection. This in turn will give the audience a more grounded perspective on unsupervised deep learning methods. All the methods will be introduced in a hands-on manner to demonstrate how high-level ideas and concepts get translated to practical real code.
Posted Content•
Learning Memory-guided Normality for Anomaly Detection

[...]

Hyunjong Park1, Jongyoun Noh1, Bumsub Ham1•
Yonsei University1
30 Mar 2020-arXiv: Computer Vision and Pattern Recognition
TL;DR: This work proposes to use a memory module with a new update scheme where items in the memory record prototypical patterns of normal data, boosting the discriminative power of both memory items and deeply learned features from normal data and lessening the representation capacity of CNNs.
Abstract: We address the problem of anomaly detection, that is, detecting anomalous events in a video sequence. Anomaly detection methods based on convolutional neural networks (CNNs) typically leverage proxy tasks, such as reconstructing input video frames, to learn models describing normality without seeing anomalous samples at training time, and quantify the extent of abnormalities using the reconstruction error at test time. The main drawbacks of these approaches are that they do not consider the diversity of normal patterns explicitly, and the powerful representation capacity of CNNs allows to reconstruct abnormal video frames. To address this problem, we present an unsupervised learning approach to anomaly detection that considers the diversity of normal patterns explicitly, while lessening the representation capacity of CNNs. To this end, we propose to use a memory module with a new update scheme where items in the memory record prototypical patterns of normal data. We also present novel feature compactness and separateness losses to train the memory, boosting the discriminative power of both memory items and deeply learned features from normal data. Experimental results on standard benchmarks demonstrate the effectiveness and efficiency of our approach, which outperforms the state of the art.
Journal Article•10.1038/S41524-019-0267-Z•
Machine learning enabled autonomous microstructural characterization in 3D samples

[...]

Henry Chan1, Mathew J. Cherukara1, Troy D. Loeffler1, Badri Narayanan1, Badri Narayanan2, Subramanian K. R. S. Sankaranarayanan1, Subramanian K. R. S. Sankaranarayanan3 •
Argonne National Laboratory1, University of Louisville2, University of Illinois at Chicago3
6 Jan 2020
TL;DR: This work introduces an unsupervised machine learning (ML) based technique for the identification and characterization of microstructures in three-dimensional samples obtained from molecular dynamics simulations, particle tracking data, or experiments that combines topology classification, image processing, and clustering algorithms.
Abstract: We introduce an unsupervised machine learning (ML) based technique for the identification and characterization of microstructures in three-dimensional (3D) samples obtained from molecular dynamics simulations, particle tracking data, or experiments. Our technique combines topology classification, image processing, and clustering algorithms, and can handle a wide range of microstructure types including grains in polycrystalline materials, voids in porous systems, and structures from self/directed assembly in soft-matter complex solutions. Our technique does not require a priori microstructure description of the target system and is insensitive to disorder such as extended defects in polycrystals arising from line and plane defects. We demonstrate quantitively that our technique provides unbiased microstructural information such as precise quantification of grains and their size distributions in 3D polycrystalline samples, characterizes features such as voids and porosity in 3D polymeric samples and micellar size distribution in 3D complex fluids. To demonstrate the efficacy of our ML approach, we benchmark it against a diverse set of synthetic data samples representing nanocrystalline metals, polymers and complex fluids as well as experimentally published characterization data. Our technique is computationally efficient and provides a way to quickly identify, track, and quantify complex microstructural features that impact the observed material behavior.
Journal Article•10.1609/AAAI.V34I07.6936•
FusionDN: A Unified Densely Connected Network for Image Fusion.

[...]

Han Xu1, Jiayi Ma1, Zhuliang Le1, Junjun Jiang2, Xiaojie Guo3 •
Wuhan University1, Harbin Institute of Technology2, Tianjin University3
3 Apr 2020
TL;DR: A new unsupervised and unified densely connected network for different types of image fusion tasks, termed as FusionDN, which obtains a single model applicable to multiple fusion tasks by applying elastic weight consolidation to avoid forgetting what has been learned from previous tasks when training multiple tasks sequentially.
Abstract: In this paper, we present a new unsupervised and unified densely connected network for different types of image fusion tasks, termed as FusionDN. In our method, the densely connected network is trained to generate the fused image conditioned on source images. Meanwhile, a weight block is applied to obtain two data-driven weights as the retention degrees of features in different source images, which are the measurement of the quality and the amount of information in them. Losses of similarities based on these weights are applied for unsupervised learning. In addition, we obtain a single model applicable to multiple fusion tasks by applying elastic weight consolidation to avoid forgetting what has been learned from previous tasks when training multiple tasks sequentially, rather than train individual models for every fusion task or jointly train tasks roughly. Qualitative and quantitative results demonstrate the advantages of FusionDN compared with state-of-the-art methods in different fusion tasks.
Book Chapter•10.1007/978-3-030-58580-8_34•
PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding

[...]

Saining Xie1, Jiatao Gu1, Demi Guo1, Charles R. Qi1, Leonidas J. Guibas2, Or Litany2 •
Facebook1, Stanford University2
23 Aug 2020
TL;DR: In this article, a triplet of architecture, source dataset, and contrastive loss for pre-training is used for 3D point cloud point cloud segmentation and detection in indoor and outdoor, real and synthetic datasets.
Abstract: Arguably one of the top success stories of deep learning is transfer learning. The finding that pre-training a network on a rich source set (e.g., ImageNet) can help boost performance once fine-tuned on a usually much smaller target set, has been instrumental to many applications in language and vision. Yet, very little is known about its usefulness in 3D point cloud understanding. We see this as an opportunity considering the effort required for annotating data in 3D. In this work, we aim at facilitating research on 3D representation learning. Different from previous works, we focus on high-level scene understanding tasks. To this end, we select a suit of diverse datasets and tasks to measure the effect of unsupervised pre-training on a large source set of 3D scenes. Our findings are extremely encouraging: using a unified triplet of architecture, source dataset, and contrastive loss for pre-training, we achieve improvement over recent best results in segmentation and detection across 6 different benchmarks for indoor and outdoor, real and synthetic datasets – demonstrating that the learned representation can generalize across domains. Furthermore, the improvement was similar to supervised pre-training, suggesting that future efforts should favor scaling data collection over more detailed annotation. We hope these findings will encourage more research on unsupervised pretext task design for 3D deep learning.
Book Chapter•10.1007/978-3-030-58607-2_16•
SCAN: Learning to Classify Images Without Labels

[...]

Wouter Van Gansbeke1, Simon Vandenhende1, Stamatios Georgoulis2, Marc Proesmans1, Luc Van Gool1 •
Katholieke Universiteit Leuven1, ETH Zurich2
23 Aug 2020
TL;DR: Wang et al. as mentioned in this paper proposed a two-step approach where feature learning and clustering are decoupled, and obtained semantically meaningful features as a prior in a learnable clustering approach.
Abstract: Can we automatically group images into semantically meaningful clusters when ground-truth annotations are absent? The task of unsupervised image classification remains an important, and open challenge in computer vision. Several recent approaches have tried to tackle this problem in an end-to-end fashion. In this paper, we deviate from recent works, and advocate a two-step approach where feature learning and clustering are decoupled. First, a self-supervised task from representation learning is employed to obtain semantically meaningful features. Second, we use the obtained features as a prior in a learnable clustering approach. In doing so, we remove the ability for cluster learning to depend on low-level features, which is present in current end-to-end learning approaches. Experimental evaluation shows that we outperform state-of-the-art methods by large margins, in particular \(+26.6\%\) on CIFAR10, \(+25.0\%\) on CIFAR100-20 and \(+21.3\%\) on STL10 in terms of classification accuracy. Furthermore, our method is the first to perform well on a large-scale dataset for image classification. In particular, we obtain promising results on ImageNet, and outperform several semi-supervised learning methods in the low-data regime without the use of any ground-truth annotations. The code is available at www.github.com/wvangansbeke/Unsupervised-Classification.git.
Proceedings Article•10.1109/ICASSP40776.2020.9053569•
Multi-Task Self-Supervised Learning for Robust Speech Recognition

[...]

Mirco Ravanelli1, Jianyuan Zhong2, Santiago Pascual3, Pawel Swietojanski4, Joao Monteiro5, Jan Trmal6, Yoshua Bengio1 •
Université de Montréal1, University of Rochester2, Polytechnic University of Catalonia3, University of New South Wales4, Institut national de la recherche scientifique5, Johns Hopkins University6
25 Jan 2020
TL;DR: PASE+ is proposed, an improved version of PASE that better learns short- and long-term speech dynamics with an efficient combination of recurrent and convolutional networks and learns transferable representations suitable for highly mismatched acoustic conditions.
Abstract: Despite the growing interest in unsupervised learning, extracting meaningful knowledge from unlabelled audio remains an open challenge. To take a step in this direction, we recently proposed a problem-agnostic speech encoder (PASE), that combines a convolutional encoder followed by multiple neural networks, called workers, tasked to solve self-supervised problems (i.e., ones that do not require manual annotations as ground truth). PASE was shown to capture relevant speech information, including speaker voice-print and phonemes. This paper proposes PASE+, an improved version of PASE for robust speech recognition in noisy and reverberant environments. To this end, we employ an online speech distortion module, that contaminates the input signals with a variety of random disturbances. We then propose a revised encoder that better learns short- and long-term speech dynamics with an efficient combination of recurrent and convolutional networks. Finally, we refine the set of workers used in self-supervision to encourage better cooperation.Results on TIMIT, DIRHA and CHiME-5 show that PASE+ significantly outperforms both the previous version of PASE as well as common acoustic features. Interestingly, PASE+ learns transferable representations suitable for highly mismatched acoustic conditions.
Journal Article•10.1109/TCOMM.2019.2960361•
A Deep Learning Framework for Optimization of MISO Downlink Beamforming

[...]

Wenchao Xia1, Gan Zheng2, Yongxu Zhu3, Jun Zhang1, Jiangzhou Wang4, Athina P. Petropulu5 •
Nanjing University of Posts and Telecommunications1, Loughborough University2, London South Bank University3, University of Kent4, Rutgers University5
14 Jan 2020-IEEE Transactions on Communications
TL;DR: A deep learning framework for the optimization of downlink beamforming is proposed based on convolutional neural networks and exploitation of expert knowledge, such as the uplink-downlink duality and the known structure of optimal solutions, paving the way for fast realization of optimal beamforming in multiuser MISO systems.
Abstract: Beamforming is an effective means to improve the quality of the received signals in multiuser multiple-input-single-output (MISO) systems. Traditionally, finding the optimal beamforming solution relies on iterative algorithms, which introduces high computational delay and is thus not suitable for real-time implementation. In this paper, we propose a deep learning framework for the optimization of downlink beamforming. In particular, the solution is obtained based on convolutional neural networks and exploitation of expert knowledge, such as the uplink-downlink duality and the known structure of optimal solutions. Using this framework, we construct three beamforming neural networks (BNNs) for three typical optimization problems, i.e., the signal-to-interference-plus-noise ratio (SINR) balancing problem, the power minimization problem, and the sum rate maximization problem. For the former two problems the BNNs adopt the supervised learning approach, while for the sum rate maximization problem a hybrid method of supervised and unsupervised learning is employed. Simulation results show that the BNNs can achieve near-optimal solutions to the SINR balancing and power minimization problems, and a performance close to that of the weighted minimum mean squared error algorithm for the sum rate maximization problem, while in all cases enjoy significantly reduced computational complexity. In summary, this work paves the way for fast realization of optimal beamforming in multiuser MISO systems.
Proceedings Article•
On Mutual Information Maximization for Representation Learning

[...]

Michael Tschannen1, Josip Djolonga1, Paul K. Rubenstein2, Sylvain Gelly1, Mario Lucic1 •
Google1, Max Planck Society2
30 Apr 2020
TL;DR: This paper argues, and provides empirical evidence, that the success of these methods cannot be attributed to the properties of MI alone, and that they strongly depend on the inductive bias in both the choice of feature extractor architectures and the parametrization of the employed MI estimators.
Abstract: Many recent methods for unsupervised or self-supervised representation learning train feature extractors by maximizing an estimate of the mutual information (MI) between different views of the data. This comes with several immediate problems: For example, MI is notoriously hard to estimate, and using it as an objective for representation learning may lead to highly entangled representations due to its invariance under arbitrary invertible transformations. Nevertheless, these methods have been repeatedly shown to excel in practice. In this paper we argue, and provide empirical evidence, that the success of these methods cannot be attributed to the properties of MI alone, and that they strongly depend on the inductive bias in both the choice of feature extractor architectures and the parametrization of the employed MI estimators. Finally, we establish a connection to deep metric learning and argue that this interpretation may be a plausible explanation for the success of the recently introduced methods.
Proceedings Article•10.1109/CVPR42600.2020.01077•
Visual Commonsense R-CNN

[...]

Tan Wang1, Jianqiang Huang2, Hanwang Zhang2, Qianru Sun3•
University of Electronic Science and Technology of China1, Nanyang Technological University2, Singapore Management University3
14 Jun 2020
TL;DR: A novel unsupervised feature representation learning method, Visual Commonsense Region-based Convolutional Neural Network (VC R-CNN), is presented to serve as an improved visual region encoder for high-level tasks such as captioning and VQA, and observes consistent performance boosts across them.
Abstract: We present a novel unsupervised feature representation learning method, Visual Commonsense Region-based Convolutional Neural Network (VC R-CNN), to serve as an improved visual region encoder for high-level tasks such as captioning and VQA. Given a set of detected object regions in an image (e.g., using Faster R-CNN), like any other unsupervised feature learning methods (e.g., word2vec), the proxy training objective of VC R-CNN is to predict the contextual objects of a region. However, they are fundamentally different: the prediction of VC R-CNN is by using causal intervention: P(Y|do(X)), while others are by using the conventional likelihood: P(Y|X). This is also the core reason why VC R-CNN can learn “sense-making” knowledge like chair can be sat — while not just “common” co-occurrences such as chair is likely to exist if table is observed. We extensively apply VC R-CNN features in prevailing models of three popular tasks: Image Captioning, VQA, and VCR, and observe consistent performance boosts across them, achieving many new state-of-the-arts.
...

Tools

SciSpace AgentBiomedical AgentSciSpace RecruitSciSpace for EnterpriseAgent GalleryChat with PDFLiterature ReviewAI WriterFind TopicsParaphraserCitation GeneratorExtract DataAI DetectorCitation Booster

Learn

ResourcesLive Workshops

SciSpace

CareersSupportBrowse PapersPricingSciSpace Affiliate ProgramCancellation & Refund PolicyTermsPrivacyData Sources

Directories

PapersTopicsJournalsAuthorsConferencesInstitutionsCitation StylesWriting templates

Extension & Apps

SciSpace Chrome ExtensionSciSpace Mobile App

Contact

support@scispace.com
SciSpace

© 2026 | PubGenius Inc. | Suite # 217 691 S Milpitas Blvd Milpitas CA 95035, USA

soc2
Secured by Delve