Top 2342 papers published in the topic of Unsupervised learning in 2020

Showing papers on "Unsupervised learning published in 2020"

Proceedings Article•10.1109/CVPR42600.2020.00975•

Momentum Contrast for Unsupervised Visual Representation Learning

[...]

Kaiming He¹, Haoqi Fan¹, Yuxin Wu¹, Saining Xie¹, Ross Girshick¹ - Show less +1 more•Institutions (1)

14 Jun 2020

TL;DR: This article proposed Momentum Contrast (MoCo) for unsupervised visual representation learning, which enables building a large and consistent dictionary on-the-fly that facilitates contrastive learning.

...read moreread less

Abstract: We present Momentum Contrast (MoCo) for unsupervised visual representation learning. From a perspective on contrastive learning as dictionary look-up, we build a dynamic dictionary with a queue and a moving-averaged encoder. This enables building a large and consistent dictionary on-the-fly that facilitates contrastive unsupervised learning. MoCo provides competitive results under the common linear protocol on ImageNet classification. More importantly, the representations learned by MoCo transfer well to downstream tasks. MoCo can outperform its supervised pre-training counterpart in 7 detection/segmentation tasks on PASCAL VOC, COCO, and other datasets, sometimes surpassing it by large margins. This suggests that the gap between unsupervised and supervised representation learning has been largely closed in many vision tasks.

...read moreread less

9,710 citations

Posted Content•

Improved Baselines with Momentum Contrastive Learning

[...]

Xinlei Chen, Haoqi Fan, Ross Girshick, Kaiming He

09 Mar 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: With simple modifications to MoCo, this note establishes stronger baselines that outperform SimCLR and do not require large training batches, and hopes this will make state-of-the-art unsupervised learning research more accessible.

...read moreread less

Abstract: Contrastive unsupervised learning has recently shown encouraging progress, e.g., in Momentum Contrast (MoCo) and SimCLR. In this note, we verify the effectiveness of two of SimCLR's design improvements by implementing them in the MoCo framework. With simple modifications to MoCo---namely, using an MLP projection head and more data augmentation---we establish stronger baselines that outperform SimCLR and do not require large training batches. We hope this will make state-of-the-art unsupervised learning research more accessible. Code will be made public.

...read moreread less

3,738 citations

Posted Content•

Unsupervised Learning of Visual Features by Contrasting Cluster Assignments

[...]

Mathilde Caron¹, Ishan Misra¹, Julien Mairal, Priya Goyal¹, Piotr Bojanowski¹, Armand Joulin¹ - Show less +2 more•Institutions (1)

Facebook¹

17 Jun 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper proposes an online algorithm, SwAV, that takes advantage of contrastive methods without requiring to compute pairwise comparisons, and uses a swapped prediction mechanism where it predicts the cluster assignment of a view from the representation of another view.

...read moreread less

Abstract: Unsupervised image representations have significantly reduced the gap with supervised pretraining, notably with the recent achievements of contrastive learning methods. These contrastive methods typically work online and rely on a large number of explicit pairwise feature comparisons, which is computationally challenging. In this paper, we propose an online algorithm, SwAV, that takes advantage of contrastive methods without requiring to compute pairwise comparisons. Specifically, our method simultaneously clusters the data while enforcing consistency between cluster assignments produced for different augmentations (or views) of the same image, instead of comparing features directly as in contrastive learning. Simply put, we use a swapped prediction mechanism where we predict the cluster assignment of a view from the representation of another view. Our method can be trained with large and small batches and can scale to unlimited amounts of data. Compared to previous contrastive methods, our method is more memory efficient since it does not require a large memory bank or a special momentum network. In addition, we also propose a new data augmentation strategy, multi-crop, that uses a mix of views with different resolutions in place of two full-resolution views, without increasing the memory or compute requirements much. We validate our findings by achieving 75.3% top-1 accuracy on ImageNet with ResNet-50, as well as surpassing supervised pretraining on all the considered transfer tasks.

...read moreread less

3,734 citations

Journal Article•10.1007/S10994-019-05855-6•

A survey on semi-supervised learning

[...]

Jesper E. van Engelen¹, Holger H. Hoos¹, Holger H. Hoos²•Institutions (2)

Leiden University¹, University of British Columbia²

01 Feb 2020-Machine Learning

TL;DR: This survey aims to provide researchers and practitioners new to the field as well as more advanced readers with a solid understanding of the main approaches and algorithms developed over the past two decades, with an emphasis on the most prominent and currently relevant work.

...read moreread less

Abstract: Semi-supervised learning is the branch of machine learning concerned with using labelled as well as unlabelled data to perform certain learning tasks. Conceptually situated between supervised and unsupervised learning, it permits harnessing the large amounts of unlabelled data available in many use cases in combination with typically smaller sets of labelled data. In recent years, research in this area has followed the general trends observed in machine learning, with much attention directed at neural network-based models and generative learning. The literature on the topic has also expanded in volume and scope, now encompassing a broad spectrum of theory, algorithms and applications. However, no recent surveys exist to collect and organize this knowledge, impeding the ability of researchers and engineers alike to utilize it. Filling this void, we present an up-to-date overview of semi-supervised learning methods, covering earlier work as well as more recent advances. We focus primarily on semi-supervised classification, where the large majority of semi-supervised learning research takes place. Our survey aims to provide researchers and practitioners new to the field as well as more advanced readers with a solid understanding of the main approaches and algorithms developed over the past two decades, with an emphasis on the most prominent and currently relevant work. Furthermore, we propose a new taxonomy of semi-supervised classification algorithms, which sheds light on the different conceptual and methodological approaches for incorporating unlabelled data into the training process. Lastly, we show how the fundamental assumptions underlying most semi-supervised learning algorithms are closely connected to each other, and how they relate to the well-known semi-supervised clustering assumption.

...read moreread less

2,388 citations

Journal Article•10.1109/ACCESS.2020.2988796•

Unsupervised K-Means Clustering Algorithm

[...]

Kristina P. Sinaga¹, Miin-Shen Yang¹•Institutions (1)

Chung Yuan Christian University¹

20 Apr 2020-IEEE Access

TL;DR: An unsupervised learning schema is constructed for the k-means algorithm so that it is free of initializations without parameter selection and can also simultaneously find an optimal number of clusters.

...read moreread less

Abstract: The k-means algorithm is generally the most known and used clustering method. There are various extensions of k-means to be proposed in the literature. Although it is an unsupervised learning to clustering in pattern recognition and machine learning, the k-means algorithm and its extensions are always influenced by initializations with a necessary number of clusters a priori. That is, the k-means algorithm is not exactly an unsupervised clustering method. In this paper, we construct an unsupervised learning schema for the k-means algorithm so that it is free of initializations without parameter selection and can also simultaneously find an optimal number of clusters. That is, we propose a novel unsupervised k-means (U-k-means) clustering algorithm with automatically finding an optimal number of clusters without giving any initialization and parameter selection. The computational complexity of the proposed U-k-means clustering algorithm is also analyzed. Comparisons between the proposed U-k-means and other existing methods are made. Experimental results and comparisons actually demonstrate these good aspects of the proposed U-k-means clustering algorithm.

...read moreread less

1,539 citations

Journal Article•10.3390/TECHNOLOGIES9010002•

A Survey on Contrastive Self-Supervised Learning

[...]

Ashish Jaiswal, Ashwin Ramesh Babu, Mohammad Zaki Zadeh, Debapriya Banerjee, Fillia Makedon - Show less +1 more

31 Oct 2020

TL;DR: In contrastive self-supervised learning as discussed by the authors, augmented versions of the same sample close to each other while trying to push away embeddings from different samples is used to learn representations for several downstream tasks.

...read moreread less

Abstract: Self-supervised learning has gained popularity because of its ability to avoid the cost of annotating large-scale datasets. It is capable of adopting self-defined pseudolabels as supervision and use the learned representations for several downstream tasks. Specifically, contrastive learning has recently become a dominant component in self-supervised learning for computer vision, natural language processing (NLP), and other domains. It aims at embedding augmented versions of the same sample close to each other while trying to push away embeddings from different samples. This paper provides an extensive review of self-supervised methods that follow the contrastive approach. The work explains commonly used pretext tasks in a contrastive learning setup, followed by different architectures that have been proposed so far. Next, we present a performance comparison of different methods for multiple downstream tasks such as image classification, object detection, and action recognition. Finally, we conclude with the limitations of the current methods and the need for further techniques and future directions to make meaningful progress.

...read moreread less

1,004 citations

Journal Article•10.1038/S41586-020-2038-X•

Ultrafast machine vision with 2D material neural network image sensors

[...]

Lukas Mennel¹, Joanna Symonowicz¹, Stefan Wachter¹, Dmitry K. Polyushkin¹, Aday J. Molina-Mendoza¹, Thomas Mueller¹ - Show less +2 more•Institutions (1)

Vienna University of Technology¹

04 Mar 2020-Nature

TL;DR: It is demonstrated that an image sensor can itself constitute an ANN that can simultaneously sense and process optical images without latency, and is trained to classify and encode images with high throughput, acting as an artificial neural network.

...read moreread less

Abstract: Machine vision technology has taken huge leaps in recent years, and is now becoming an integral part of various intelligent systems, including autonomous vehicles and robotics. Usually, visual information is captured by a frame-based camera, converted into a digital format and processed afterwards using a machine-learning algorithm such as an artificial neural network (ANN)1. The large amount of (mostly redundant) data passed through the entire signal chain, however, results in low frame rates and high power consumption. Various visual data preprocessing techniques have thus been developed2-7 to increase the efficiency of the subsequent signal processing in an ANN. Here we demonstrate that an image sensor can itself constitute an ANN that can simultaneously sense and process optical images without latency. Our device is based on a reconfigurable two-dimensional (2D) semiconductor8,9 photodiode10-12 array, and the synaptic weights of the network are stored in a continuously tunable photoresponsivity matrix. We demonstrate both supervised and unsupervised learning and train the sensor to classify and encode images that are optically projected onto the chip with a throughput of 20 million bins per second.

...read moreread less

801 citations

Journal Article•10.1109/ACCESS.2020.3031549•

Contrastive Representation Learning: A Framework and Review

[...]

Phuc H. Le-Khac¹, Graham Healy¹, Alan F. Smeaton¹•Institutions (1)

Dublin City University¹

16 Oct 2020-IEEE Access

TL;DR: A general Contrastive Representation Learning framework is proposed that simplifies and unifies many different contrastive learning methods and a taxonomy for each of the components is provided in order to summarise and distinguish it from other forms of machine learning.

...read moreread less

Abstract: Contrastive Learning has recently received interest due to its success in self-supervised representation learning in the computer vision domain. However, the origins of Contrastive Learning date as far back as the 1990s and its development has spanned across many fields and domains including Metric Learning and natural language processing. In this paper, we provide a comprehensive literature review and we propose a general Contrastive Representation Learning framework that simplifies and unifies many different contrastive learning methods. We also provide a taxonomy for each of the components of contrastive learning in order to summarise it and distinguish it from other forms of machine learning. We then discuss the inductive biases which are present in any contrastive learning system and we analyse our framework under different views from various sub-fields of Machine Learning. Examples of how contrastive learning has been applied in computer vision, natural language processing, audio processing, and others, as well as in Reinforcement Learning are also presented. Finally, we discuss the challenges and some of the most promising future research directions ahead.

...read moreread less

785 citations

Proceedings Article•10.1145/3394486.3403392•

USAD: UnSupervised Anomaly Detection on Multivariate Time Series

[...]

Julien Audibert¹, Pietro Michiardi¹, Frédéric Guyard, Sébastien Marti, Maria A. Zuluaga¹ - Show less +1 more•Institutions (1)

Institut Eurécom¹

23 Aug 2020

TL;DR: A fast and stable method called UnSupervised Anomaly Detection for multivariate time series (USAD) based on adversely trained autoencoders capable of learning in an unsupervised way is proposed.

...read moreread less

Abstract: The automatic supervision of IT systems is a current challenge at Orange. Given the size and complexity reached by its IT operations, the number of sensors needed to obtain measurements over time, used to infer normal and abnormal behaviors, has increased dramatically making traditional expert-based supervision methods slow or prone to errors. In this paper, we propose a fast and stable method called UnSupervised Anomaly Detection for multivariate time series (USAD) based on adversely trained autoencoders. Its autoencoder architecture makes it capable of learning in an unsupervised way. The use of adversarial training and its architecture allows it to isolate anomalies while providing fast training. We study the properties of our methods through experiments on five public datasets, thus demonstrating its robustness, training speed and high anomaly detection performance. Through a feasibility study using Orange's proprietary data we have been able to validate Orange's requirements on scalability, stability, robustness, training speed and high performance.

...read moreread less

738 citations

Proceedings Article•10.1109/CVPR42600.2020.01438•

Learning Memory-Guided Normality for Anomaly Detection

[...]

Hyunjong Park¹, Jongyoun Noh¹, Bumsub Ham¹•Institutions (1)

Yonsei University¹

14 Jun 2020

TL;DR: In this article, an unsupervised learning approach to anomaly detection that considers the diversity of normal patterns explicitly, while lessening the representation capacity of CNNs is presented. But the main drawbacks of these approaches are that they do not consider the diversity this article.

...read moreread less

Abstract: We address the problem of anomaly detection, that is, detecting anomalous events in a video sequence. Anomaly detection methods based on convolutional neural networks (CNNs) typically leverage proxy tasks, such as reconstructing input video frames, to learn models describing normality without seeing anomalous samples at training time, and quantify the extent of abnormalities using the reconstruction error at test time. The main drawbacks of these approaches are that they do not consider the diversity of normal patterns explicitly, and the powerful representation capacity of CNNs allows to reconstruct abnormal video frames. To address this problem, we present an unsupervised learning approach to anomaly detection that considers the diversity of normal patterns explicitly, while lessening the representation capacity of CNNs. To this end, we propose to use a memory module with a new update scheme where items in the memory record prototypical patterns of normal data. We also present novel feature compactness and separateness losses to train the memory, boosting the discriminative power of both memory items and deeply learned features from normal data. Experimental results on standard benchmarks demonstrate the effectiveness and efficiency of our approach, which outperforms the state of the art.

...read moreread less

638 citations

Proceedings Article•10.1145/3366423.3380112•

Graph Representation Learning via Graphical Mutual Information Maximization

[...]

Zhen Peng¹, Wenbing Huang², Minnan Luo¹, Qinghua Zheng¹, Yu Rong³, Tingyang Xu³, Junzhou Huang³ - Show less +3 more•Institutions (3)

Xi'an Jiaotong University¹, Tsinghua University², Tencent³

20 Apr 2020

TL;DR: An unsupervised learning model trained by maximizing GMI between the input and output of a graph neural encoder is developed, which outperforms state-of-the-art unsuper supervised counterparts, and even sometimes exceeds the performance of supervised ones.

...read moreread less

Abstract: The richness in the content of various information networks such as social networks and communication networks provides the unprecedented potential for learning high-quality expressive representations without external supervision. This paper investigates how to preserve and extract the abundant information from graph-structured data into embedding space in an unsupervised manner. To this end, we propose a novel concept, Graphical Mutual Information (GMI), to measure the correlation between input graphs and high-level hidden representations. GMI generalizes the idea of conventional mutual information computations from vector space to the graph domain where measuring mutual information from two aspects of node features and topological structure is indispensable. GMI exhibits several benefits: First, it is invariant to the isomorphic transformation of input graphs—an inevitable constraint in many existing graph representation learning algorithms; Besides, it can be efficiently estimated and maximized by current mutual information estimation methods such as MINE; Finally, our theoretical analysis confirms its correctness and rationality. With the aid of GMI, we develop an unsupervised learning model trained by maximizing GMI between the input and output of a graph neural encoder. Considerable experiments on transductive as well as inductive node classification and link prediction demonstrate that our method outperforms state-of-the-art unsupervised counterparts, and even sometimes exceeds the performance of supervised ones.

...read moreread less

Posted Content•

A Transformer-based Framework for Multivariate Time Series Representation Learning

[...]

George Zerveas¹, Srideepika Jayaraman², Dhaval Patel², Anuradha Bhamidipaty², Carsten Eickhoff¹ - Show less +1 more•Institutions (2)

Brown University¹, IBM²

06 Oct 2020-arXiv: Learning

TL;DR: A novel framework for multivariate time series representation learning based on the transformer encoder architecture, which can offer substantial performance benefits over fully supervised learning on downstream tasks, both with but even without leveraging additional unlabeled data, i.e., by reusing the existing data samples.

...read moreread less

Abstract: In this work we propose for the first time a transformer-based framework for unsupervised representation learning of multivariate time series. Pre-trained models can be potentially used for downstream tasks such as regression and classification, forecasting and missing value imputation. By evaluating our models on several benchmark datasets for multivariate time series regression and classification, we show that not only does our modeling approach represent the most successful method employing unsupervised learning of multivariate time series presented to date, but also that it exceeds the current state-of-the-art performance of supervised methods; it does so even when the number of training samples is very limited, while offering computational efficiency. Finally, we demonstrate that unsupervised pre-training of our transformer models offers a substantial performance benefit over fully supervised learning, even without leveraging additional unlabeled data, i.e., by reusing the same data samples through the unsupervised objective.

...read moreread less

Journal Article•10.1109/COMST.2020.2965856•

Thirty Years of Machine Learning: The Road to Pareto-Optimal Wireless Networks

[...]

Jingjing Wang¹, Chunxiao Jiang¹, Haijun Zhang², Yong Ren¹, Kwang-Cheng Chen³, Lajos Hanzo⁴ - Show less +2 more•Institutions (4)

Tsinghua University¹, University of Science and Technology Beijing², University of South Florida³, University of Southampton⁴

13 Jan 2020-IEEE Communications Surveys and Tutorials

TL;DR: In this article, the authors review the thirty-year history of ML by elaborating on supervised learning, unsupervised learning, reinforcement learning and deep learning and investigate their employment in the compelling applications of wireless networks, including heterogeneous networks, cognitive radios (CR), Internet of Things (IoT), machine to machine networks (M2M), and so on.

...read moreread less

Abstract: Future wireless networks have a substantial potential in terms of supporting a broad range of complex compelling applications both in military and civilian fields, where the users are able to enjoy high-rate, low-latency, low-cost and reliable information services. Achieving this ambitious goal requires new radio techniques for adaptive learning and intelligent decision making because of the complex heterogeneous nature of the network structures and wireless services. Machine learning (ML) algorithms have great success in supporting big data analytics, efficient parameter estimation and interactive decision making. Hence, in this article, we review the thirty-year history of ML by elaborating on supervised learning, unsupervised learning, reinforcement learning and deep learning. Furthermore, we investigate their employment in the compelling applications of wireless networks, including heterogeneous networks (HetNets), cognitive radios (CR), Internet of Things (IoT), machine to machine networks (M2M), and so on. This article aims for assisting the readers in clarifying the motivation and methodology of the various ML algorithms, so as to invoke them for hitherto unexplored services as well as scenarios of future wireless networks.

...read moreread less

Journal Article•10.3390/S20020342•

Face Recognition Systems: A Survey

[...]

Yassin Kortli, Maher Jridi, Ayman Al Falou, Mohamed Atri¹•Institutions (1)

King Khalid University¹

07 Jan 2020-Sensors

TL;DR: This survey is to review some well-known techniques for each approach and to give the taxonomy of their categories and a solid discussion is given about future directions in terms of techniques to be used for face recognition.

...read moreread less

Abstract: Over the past few decades, interest in theories and algorithms for face recognition has been growing rapidly. Video surveillance, criminal identification, building access control, and unmanned and autonomous vehicles are just a few examples of concrete applications that are gaining attraction among industries. Various techniques are being developed including local, holistic, and hybrid approaches, which provide a face image description using only a few face image features or the whole facial features. The main contribution of this survey is to review some well-known techniques for each approach and to give the taxonomy of their categories. In the paper, a detailed comparison between these techniques is exposed by listing the advantages and the disadvantages of their schemes in terms of robustness, accuracy, complexity, and discrimination. One interesting feature mentioned in the paper is about the database used for face recognition. An overview of the most commonly used databases, including those of supervised and unsupervised learning, is given. Numerical results of the most interesting techniques are given along with the context of experiments and challenges handled by these techniques. Finally, a solid discussion is given in the paper about future directions in terms of techniques to be used for face recognition.

...read moreread less

Book Chapter•10.1007/978-3-030-22475-2_1•

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

[...]

Mohamed Alloghani¹, Dhiya Al-Jumeily¹, Jamila Mustafina², Abir Hussain¹, Ahmed J. Aljaaf¹ - Show less +1 more•Institutions (2)

Liverpool John Moores University¹, Kazan Federal University²

1 Jan 2020

TL;DR: A systematic review of scholarly articles published between 2015 and 2018 addressing or implementing supervised and unsupervised machine learning techniques in different problem-solving paradigms revealed decision tree, support vector machine, and Naive Bayes algorithms appeared to be the most cited, discussed, and implemented supervised learners.

...read moreread less

Abstract: Machine learning is as growing as fast as concepts such as Big data and the field of data science in general. The purpose of the systematic review was to analyze scholarly articles that were published between 2015 and 2018 addressing or implementing supervised and unsupervised machine learning techniques in different problem-solving paradigms. Using the elements of PRISMA, the review process identified 84 scholarly articles that had been published in different journals. Of the 84 articles, 6 were published before 2015 despite their metadata indicating that they were published in 2015. The existence of the six articles in the final papers was attributed to errors in indexing. Nonetheless, from the reviewed papers, decision tree, support vector machine, and Naive Bayes algorithms appeared to be the most cited, discussed, and implemented supervised learners. Conversely, k-means, hierarchical clustering, and principal component analysis also emerged as the commonly used unsupervised learners. The review also revealed other commonly used algorithms that include ensembles and reinforce learners, and future systematic reviews can focus on them because of the developments that machine learning and data science is undergoing at the moment.

...read moreread less

Journal Article•10.1007/S40745-020-00253-5•

A Comprehensive Survey of Loss Functions in Machine Learning

[...]

Qi Wang¹, Yue Ma¹, Kun Zhao², Yingjie Tian¹•Institutions (2)

Chinese Academy of Sciences¹, Beijing Wuzi University²

12 Apr 2020-Annals of Data Science

TL;DR: This paper summarizes and analyzes 31 classical loss functions in machine learning from the aspects of traditional machine learning and deep learning respectively and mainly selects object detection and face recognition to introduces their loss functions.

...read moreread less

Abstract: As one of the important research topics in machine learning, loss function plays an important role in the construction of machine learning algorithms and the improvement of their performance, which has been concerned and explored by many researchers. But it still has a big gap to summarize, analyze and compare the classical loss functions. Therefore, this paper summarizes and analyzes 31 classical loss functions in machine learning. Specifically, we describe the loss functions from the aspects of traditional machine learning and deep learning respectively. The former is divided into classification problem, regression problem and unsupervised learning according to the task type. The latter is subdivided according to the application scenario, and here we mainly select object detection and face recognition to introduces their loss functions. In each task or application, in addition to analyzing each loss function from formula, meaning, image and algorithm, the loss functions under the same task or application are also summarized and compared to deepen the understanding and provide help for the selection and improvement of loss function.

...read moreread less

Journal Article•10.1109/TAI.2021.3054609•

A Decade Survey of Transfer Learning (2010–2020)

[...]

Shuteng Niu¹, Yongxin Liu¹, Jian Wang¹, Houbing Song¹•Institutions (1)

Embry-Riddle Aeronautical University, Daytona Beach¹

1 Apr 2020

TL;DR: Transfer Learning (TL) as discussed by the authors can be classified into four classes: transductive learning, inductive, unsupervised learning and negative learning, and each category can be organized into four learning types: learning on instances, learning on features, learningon parameters, and learning on relations.

...read moreread less

Abstract: Transfer learning (TL) has been successfully applied to many real-world problems that traditional machine learning (ML) cannot handle, such as image processing, speech recognition, and natural language processing (NLP). Commonly, TL tends to address three main problems of traditional machine learning: (1) insufficient labeled data, (2) incompatible computation power, and (3) distribution mismatch. In general, TL can be organized into four categories: transductive learning, inductive learning, unsupervised learning, and negative learning. Furthermore, each category can be organized into four learning types: learning on instances, learning on features, learning on parameters, and learning on relations. This article presents a comprehensive survey on TL. In addition, this article presents the state of the art, current trends, applications, and open challenges.

...read moreread less

Posted Content•

Fairness in Machine Learning: A Survey.

[...]

Simon Caton¹, Christian Haas²•Institutions (2)

University College Dublin¹, University of Nebraska Omaha²

04 Oct 2020-arXiv: Learning

TL;DR: An overview of the different schools of thought and approaches to mitigating (social) biases and increase fairness in the Machine Learning literature is provided, organises approaches into the widely accepted framework of pre-processing, in- processing, and post-processing methods, subcategorizing into a further 11 method areas.

...read moreread less

Abstract: As Machine Learning technologies become increasingly used in contexts that affect citizens, companies as well as researchers need to be confident that their application of these methods will not have unexpected social implications, such as bias towards gender, ethnicity, and/or people with disabilities. There is significant literature on approaches to mitigate bias and promote fairness, yet the area is complex and hard to penetrate for newcomers to the domain. This article seeks to provide an overview of the different schools of thought and approaches to mitigating (social) biases and increase fairness in the Machine Learning literature. It organises approaches into the widely accepted framework of pre-processing, in-processing, and post-processing methods, subcategorizing into a further 11 method areas. Although much of the literature emphasizes binary classification, a discussion of fairness in regression, recommender systems, unsupervised learning, and natural language processing is also provided along with a selection of currently available open source libraries. The article concludes by summarising open challenges articulated as four dilemmas for fairness research.

...read moreread less

Journal Article•10.1177/0049124117729703•

Computational Grounded Theory: A Methodological Framework

[...]

Laura K. Nelson¹•Institutions (1)

Northeastern University¹

01 Feb 2020-Sociological Methods & Research

TL;DR: A three-step methodological framework called computational grounded theory is proposed, which combines expert human knowledge and hermeneutic skills with the processing power and pattern recognition of computers, producing a more methodologically rigorous but interpretive approach to content analysis.

...read moreread less

Abstract: This article proposes a three-step methodological framework called computational grounded theory, which combines expert human knowledge and hermeneutic skills with the processing power and pattern ...

...read moreread less

Journal Article•10.1016/J.COMNET.2020.107247•

Building an Efficient Intrusion Detection System Based on Feature Selection and Ensemble Classifier

[...]

Yuyang Zhou¹, Yuyang Zhou², Guang Cheng², Guang Cheng¹, Shanqing Jiang², Mian Dai¹, Mian Dai² - Show less +3 more•Institutions (2)

Chinese Ministry of Education¹, Southeast University²

19 Jun 2020-Computer Networks

TL;DR: Wang et al. as discussed by the authors proposed a new intrusion detection framework based on the feature selection and ensemble learning techniques, and this framework is able to exhibit better performance than other related and state of the art approaches under several metrics.

...read moreread less

Proceedings Article•10.1145/3394486.3406704•

Robust Deep Learning Methods for Anomaly Detection

[...]

Raghavendra Chalapathy¹, Nguyen Lu Dang Khoa¹, Sanjay Chawla²•Institutions (2)

Commonwealth Scientific and Industrial Research Organisation¹, Qatar Computing Research Institute²

23 Aug 2020

TL;DR: The tutorial will revisit well known unsupervised learning techniques in deep learning including autoencoders and generative adversarial networks (GANs) from the perspective of anomaly detection to give the audience a more grounded perspective on un supervised deep learning methods.

...read moreread less

Abstract: Anomaly detection is an important problem that has been well-studied within diverse research areas and application domains. A robust anomaly detection system identifies rare events and patterns in the absence of labelled data. The identified patterns provide crucial insights about both the fidelity of the data and deviations in the underlying data-generating process. For example a surveillance system designed to monitor the emergence of new epidemics will use a robust anomaly detection methods to separate spurious associations from genuine indicators of an epidemic with minimal lag time. The key concept in anomaly detection is the notion of "robustness'', i.e., designing models and representations which are less-sensitive to small changes in the underlying data distribution. The canonical example is that the median is more robust than the mean as an estimator. The tutorial will primarily help researchers and developers design deep learning architectures and loss functions where the learnt representation behave more like the "median'' rather than the "mean.'' The tutorial will revisit well known unsupervised learning techniques in deep learning including autoencoders and generative adversarial networks (GANs) from the perspective of anomaly detection. This in turn will give the audience a more grounded perspective on unsupervised deep learning methods. All the methods will be introduced in a hands-on manner to demonstrate how high-level ideas and concepts get translated to practical real code.

...read moreread less

Posted Content•

Learning Memory-guided Normality for Anomaly Detection

[...]

Hyunjong Park¹, Jongyoun Noh¹, Bumsub Ham¹•Institutions (1)

Yonsei University¹

30 Mar 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: This work proposes to use a memory module with a new update scheme where items in the memory record prototypical patterns of normal data, boosting the discriminative power of both memory items and deeply learned features from normal data and lessening the representation capacity of CNNs.

...read moreread less

Journal Article•10.1038/S41524-019-0267-Z•

Machine learning enabled autonomous microstructural characterization in 3D samples

[...]

Henry Chan¹, Mathew J. Cherukara¹, Troy D. Loeffler¹, Badri Narayanan¹, Badri Narayanan², Subramanian K. R. S. Sankaranarayanan¹, Subramanian K. R. S. Sankaranarayanan³ - Show less +3 more•Institutions (3)

Argonne National Laboratory¹, University of Louisville², University of Illinois at Chicago³

6 Jan 2020

TL;DR: This work introduces an unsupervised machine learning (ML) based technique for the identification and characterization of microstructures in three-dimensional samples obtained from molecular dynamics simulations, particle tracking data, or experiments that combines topology classification, image processing, and clustering algorithms.

...read moreread less

Abstract: We introduce an unsupervised machine learning (ML) based technique for the identification and characterization of microstructures in three-dimensional (3D) samples obtained from molecular dynamics simulations, particle tracking data, or experiments. Our technique combines topology classification, image processing, and clustering algorithms, and can handle a wide range of microstructure types including grains in polycrystalline materials, voids in porous systems, and structures from self/directed assembly in soft-matter complex solutions. Our technique does not require a priori microstructure description of the target system and is insensitive to disorder such as extended defects in polycrystals arising from line and plane defects. We demonstrate quantitively that our technique provides unbiased microstructural information such as precise quantification of grains and their size distributions in 3D polycrystalline samples, characterizes features such as voids and porosity in 3D polymeric samples and micellar size distribution in 3D complex fluids. To demonstrate the efficacy of our ML approach, we benchmark it against a diverse set of synthetic data samples representing nanocrystalline metals, polymers and complex fluids as well as experimentally published characterization data. Our technique is computationally efficient and provides a way to quickly identify, track, and quantify complex microstructural features that impact the observed material behavior.

...read moreread less

Journal Article•10.1609/AAAI.V34I07.6936•

FusionDN: A Unified Densely Connected Network for Image Fusion.

[...]

Han Xu¹, Jiayi Ma¹, Zhuliang Le¹, Junjun Jiang², Xiaojie Guo³ - Show less +1 more•Institutions (3)

Wuhan University¹, Harbin Institute of Technology², Tianjin University³

3 Apr 2020

TL;DR: A new unsupervised and unified densely connected network for different types of image fusion tasks, termed as FusionDN, which obtains a single model applicable to multiple fusion tasks by applying elastic weight consolidation to avoid forgetting what has been learned from previous tasks when training multiple tasks sequentially.

...read moreread less

Abstract: In this paper, we present a new unsupervised and unified densely connected network for different types of image fusion tasks, termed as FusionDN. In our method, the densely connected network is trained to generate the fused image conditioned on source images. Meanwhile, a weight block is applied to obtain two data-driven weights as the retention degrees of features in different source images, which are the measurement of the quality and the amount of information in them. Losses of similarities based on these weights are applied for unsupervised learning. In addition, we obtain a single model applicable to multiple fusion tasks by applying elastic weight consolidation to avoid forgetting what has been learned from previous tasks when training multiple tasks sequentially, rather than train individual models for every fusion task or jointly train tasks roughly. Qualitative and quantitative results demonstrate the advantages of FusionDN compared with state-of-the-art methods in different fusion tasks.

...read moreread less

Book Chapter•10.1007/978-3-030-58580-8_34•

PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding

[...]

Saining Xie¹, Jiatao Gu¹, Demi Guo¹, Charles R. Qi¹, Leonidas J. Guibas², Or Litany² - Show less +2 more•Institutions (2)

Facebook¹, Stanford University²

23 Aug 2020

TL;DR: In this article, a triplet of architecture, source dataset, and contrastive loss for pre-training is used for 3D point cloud point cloud segmentation and detection in indoor and outdoor, real and synthetic datasets.

...read moreread less

Abstract: Arguably one of the top success stories of deep learning is transfer learning. The finding that pre-training a network on a rich source set (e.g., ImageNet) can help boost performance once fine-tuned on a usually much smaller target set, has been instrumental to many applications in language and vision. Yet, very little is known about its usefulness in 3D point cloud understanding. We see this as an opportunity considering the effort required for annotating data in 3D. In this work, we aim at facilitating research on 3D representation learning. Different from previous works, we focus on high-level scene understanding tasks. To this end, we select a suit of diverse datasets and tasks to measure the effect of unsupervised pre-training on a large source set of 3D scenes. Our findings are extremely encouraging: using a unified triplet of architecture, source dataset, and contrastive loss for pre-training, we achieve improvement over recent best results in segmentation and detection across 6 different benchmarks for indoor and outdoor, real and synthetic datasets – demonstrating that the learned representation can generalize across domains. Furthermore, the improvement was similar to supervised pre-training, suggesting that future efforts should favor scaling data collection over more detailed annotation. We hope these findings will encourage more research on unsupervised pretext task design for 3D deep learning.

...read moreread less

Book Chapter•10.1007/978-3-030-58607-2_16•

SCAN: Learning to Classify Images Without Labels

[...]

Wouter Van Gansbeke¹, Simon Vandenhende¹, Stamatios Georgoulis², Marc Proesmans¹, Luc Van Gool¹ - Show less +1 more•Institutions (2)

Katholieke Universiteit Leuven¹, ETH Zurich²

23 Aug 2020

TL;DR: Wang et al. as mentioned in this paper proposed a two-step approach where feature learning and clustering are decoupled, and obtained semantically meaningful features as a prior in a learnable clustering approach.

...read moreread less

Abstract: Can we automatically group images into semantically meaningful clusters when ground-truth annotations are absent? The task of unsupervised image classification remains an important, and open challenge in computer vision. Several recent approaches have tried to tackle this problem in an end-to-end fashion. In this paper, we deviate from recent works, and advocate a two-step approach where feature learning and clustering are decoupled. First, a self-supervised task from representation learning is employed to obtain semantically meaningful features. Second, we use the obtained features as a prior in a learnable clustering approach. In doing so, we remove the ability for cluster learning to depend on low-level features, which is present in current end-to-end learning approaches. Experimental evaluation shows that we outperform state-of-the-art methods by large margins, in particular \(+26.6\%\) on CIFAR10, \(+25.0\%\) on CIFAR100-20 and \(+21.3\%\) on STL10 in terms of classification accuracy. Furthermore, our method is the first to perform well on a large-scale dataset for image classification. In particular, we obtain promising results on ImageNet, and outperform several semi-supervised learning methods in the low-data regime without the use of any ground-truth annotations. The code is available at www.github.com/wvangansbeke/Unsupervised-Classification.git.

...read moreread less

Proceedings Article•10.1109/ICASSP40776.2020.9053569•

Multi-Task Self-Supervised Learning for Robust Speech Recognition

[...]

Mirco Ravanelli¹, Jianyuan Zhong², Santiago Pascual³, Pawel Swietojanski⁴, Joao Monteiro⁵, Jan Trmal⁶, Yoshua Bengio¹ - Show less +3 more•Institutions (6)

Université de Montréal¹, University of Rochester², Polytechnic University of Catalonia³, University of New South Wales⁴, Institut national de la recherche scientifique⁵, Johns Hopkins University⁶

25 Jan 2020

TL;DR: PASE+ is proposed, an improved version of PASE that better learns short- and long-term speech dynamics with an efficient combination of recurrent and convolutional networks and learns transferable representations suitable for highly mismatched acoustic conditions.

...read moreread less

Abstract: Despite the growing interest in unsupervised learning, extracting meaningful knowledge from unlabelled audio remains an open challenge. To take a step in this direction, we recently proposed a problem-agnostic speech encoder (PASE), that combines a convolutional encoder followed by multiple neural networks, called workers, tasked to solve self-supervised problems (i.e., ones that do not require manual annotations as ground truth). PASE was shown to capture relevant speech information, including speaker voice-print and phonemes. This paper proposes PASE+, an improved version of PASE for robust speech recognition in noisy and reverberant environments. To this end, we employ an online speech distortion module, that contaminates the input signals with a variety of random disturbances. We then propose a revised encoder that better learns short- and long-term speech dynamics with an efficient combination of recurrent and convolutional networks. Finally, we refine the set of workers used in self-supervision to encourage better cooperation.Results on TIMIT, DIRHA and CHiME-5 show that PASE+ significantly outperforms both the previous version of PASE as well as common acoustic features. Interestingly, PASE+ learns transferable representations suitable for highly mismatched acoustic conditions.

...read moreread less

Journal Article•10.1109/TCOMM.2019.2960361•

A Deep Learning Framework for Optimization of MISO Downlink Beamforming

[...]

Wenchao Xia¹, Gan Zheng², Yongxu Zhu³, Jun Zhang¹, Jiangzhou Wang⁴, Athina P. Petropulu⁵ - Show less +2 more•Institutions (5)

Nanjing University of Posts and Telecommunications¹, Loughborough University², London South Bank University³, University of Kent⁴, Rutgers University⁵

14 Jan 2020-IEEE Transactions on Communications

TL;DR: A deep learning framework for the optimization of downlink beamforming is proposed based on convolutional neural networks and exploitation of expert knowledge, such as the uplink-downlink duality and the known structure of optimal solutions, paving the way for fast realization of optimal beamforming in multiuser MISO systems.

...read moreread less

Abstract: Beamforming is an effective means to improve the quality of the received signals in multiuser multiple-input-single-output (MISO) systems. Traditionally, finding the optimal beamforming solution relies on iterative algorithms, which introduces high computational delay and is thus not suitable for real-time implementation. In this paper, we propose a deep learning framework for the optimization of downlink beamforming. In particular, the solution is obtained based on convolutional neural networks and exploitation of expert knowledge, such as the uplink-downlink duality and the known structure of optimal solutions. Using this framework, we construct three beamforming neural networks (BNNs) for three typical optimization problems, i.e., the signal-to-interference-plus-noise ratio (SINR) balancing problem, the power minimization problem, and the sum rate maximization problem. For the former two problems the BNNs adopt the supervised learning approach, while for the sum rate maximization problem a hybrid method of supervised and unsupervised learning is employed. Simulation results show that the BNNs can achieve near-optimal solutions to the SINR balancing and power minimization problems, and a performance close to that of the weighted minimum mean squared error algorithm for the sum rate maximization problem, while in all cases enjoy significantly reduced computational complexity. In summary, this work paves the way for fast realization of optimal beamforming in multiuser MISO systems.

...read moreread less

Proceedings Article•

On Mutual Information Maximization for Representation Learning

[...]

Michael Tschannen¹, Josip Djolonga¹, Paul K. Rubenstein², Sylvain Gelly¹, Mario Lucic¹ - Show less +1 more•Institutions (2)

Google¹, Max Planck Society²

30 Apr 2020

TL;DR: This paper argues, and provides empirical evidence, that the success of these methods cannot be attributed to the properties of MI alone, and that they strongly depend on the inductive bias in both the choice of feature extractor architectures and the parametrization of the employed MI estimators.

...read moreread less

Abstract: Many recent methods for unsupervised or self-supervised representation learning train feature extractors by maximizing an estimate of the mutual information (MI) between different views of the data. This comes with several immediate problems: For example, MI is notoriously hard to estimate, and using it as an objective for representation learning may lead to highly entangled representations due to its invariance under arbitrary invertible transformations. Nevertheless, these methods have been repeatedly shown to excel in practice. In this paper we argue, and provide empirical evidence, that the success of these methods cannot be attributed to the properties of MI alone, and that they strongly depend on the inductive bias in both the choice of feature extractor architectures and the parametrization of the employed MI estimators. Finally, we establish a connection to deep metric learning and argue that this interpretation may be a plausible explanation for the success of the recently introduced methods.

...read moreread less

Proceedings Article•10.1109/CVPR42600.2020.01077•

Visual Commonsense R-CNN

[...]

Tan Wang¹, Jianqiang Huang², Hanwang Zhang², Qianru Sun³•Institutions (3)

University of Electronic Science and Technology of China¹, Nanyang Technological University², Singapore Management University³

14 Jun 2020

TL;DR: A novel unsupervised feature representation learning method, Visual Commonsense Region-based Convolutional Neural Network (VC R-CNN), is presented to serve as an improved visual region encoder for high-level tasks such as captioning and VQA, and observes consistent performance boosts across them.

...read moreread less

Abstract: We present a novel unsupervised feature representation learning method, Visual Commonsense Region-based Convolutional Neural Network (VC R-CNN), to serve as an improved visual region encoder for high-level tasks such as captioning and VQA. Given a set of detected object regions in an image (e.g., using Faster R-CNN), like any other unsupervised feature learning methods (e.g., word2vec), the proxy training objective of VC R-CNN is to predict the contextual objects of a region. However, they are fundamentally different: the prediction of VC R-CNN is by using causal intervention: P(Y|do(X)), while others are by using the conventional likelihood: P(Y|X). This is also the core reason why VC R-CNN can learn “sense-making” knowledge like chair can be sat — while not just “common” co-occurrences such as chair is likely to exist if table is observed. We extensively apply VC R-CNN features in prevailing models of three popular tasks: Image Captioning, VQA, and VCR, and observe consistent performance boosts across them, achieving many new state-of-the-arts.

...read moreread less

...

Expand