Class Incremental Learning for Visual Task using Knowledge Distillation

doi:10.1109/inmic56986.2022.9972924

Proceedings Article10.1109/inmic56986.2022.9972924

Class Incremental Learning for Visual Task using Knowledge Distillation

21 Oct 2022

TL;DR: In this article , the authors proposed class incremental learning using bi-distillation (CILBD) method that effectively learns not only the classes of the new data but also previously learned classes.

Abstract: The Artificial Agent's ability to enhance knowledge incrementally for new data is challenging in class incremental learning because of catastrophic forgetting in which new classes make the trained model quickly forget old classes knowledge. Knowledge distilling techniques and keeping subset of data from the old classes have been proposed to revamp models to accommodate new classes. These techniques allow models to sustain their knowledge without forgetting everything they already know but somewhat alleviate the catastrophic forgetting problem. In this study we propose class incremental learning using bi-distillation (CILBD) method that effectively learn not only the classes of the new data but also previously learned classes. The proposed architecture uses knowledge distillation in such a way that the student model directly learns knowledge from two teacher model and thus alleviate the forgetting of the old class. Our experiments on the iCIFAR-100 dataset showed that the proposed method is more accurate at classifying, forgets less, and works better than state-of-the-art methods.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

References

•Dissertation

Learning Multiple Layers of Features from Tiny Images

Alex Krizhevsky

- 01 Jan 2009

TL;DR: In this paper, the authors describe how to train a multi-layer generative model of natural images, using a dataset of millions of tiny colour images, described in the next section.

...read moreread less

23.7K

•Posted Content

Distilling the Knowledge in a Neural Network

Geoffrey E. Hinton, +2 more

- 09 Mar 2015

- arXiv: Machine Learning

TL;DR: This work shows that it can significantly improve the acoustic model of a heavily used commercial system by distilling the knowledge in an ensemble of models into a single model and introduces a new type of ensemble composed of one or more full models and many specialist models which learn to distinguish fine-grained classes that the full models confuse.

...read moreread less

21.2K

•Posted Content

Proceedings of the 29th International Conference on Machine Learning (ICML-12)

John Langford, +1 more

- 19 Jul 2012

- arXiv: Learning

TL;DR: This is an index to the papers that appear in the Proceedings of the 29th International Conference on Machine Learning (ICML-12).

...read moreread less

5.5K

Book Chapter•10.1016/S0079-7421(08)60536-8

Catastrophic interference in connectionist networks: the sequential learning problem

Michael McCloskey, +1 more

- 01 Jan 1989

- Psychology of Learning and Motivation

TL;DR: In this article, the authors discuss the catastrophic interference in connectionist networks and show that new learning may interfere catastrophically with old learning when networks are trained sequentially, and the analysis of the causes of interference implies that at least some interference will occur whenever new learning might alter weights involved in representing old learning.

...read moreread less

4.6K

•Journal Article•10.1016/J.NEUNET.2019.01.012

Continual lifelong learning with neural networks: A review.

German Ignacio Parisi, +4 more

- 01 May 2019

- Neural Networks

TL;DR: This review critically summarize the main challenges linked to lifelong learning for artificial learning systems and compare existing neural network approaches that alleviate, to different extents, catastrophic forgetting.

...read moreread less

3.2K