Identifying implementation bugs in machine learning based image classifiers using metamorphic testing

doi:10.1145/3213846.3213858

Open AccessProceedings Article10.1145/3213846.3213858

Identifying implementation bugs in machine learning based image classifiers using metamorphic testing

Anurag Dwarakanath, +6 more

- 12 Jul 2018

- pp 118-128

199

TL;DR: In this article, the authors present an articulation of the challenges in testing ML based applications and present a solution approach based on the concept of metamorphic testing, which aims to identify implementation bugs in ML based image classifiers.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1109/TSE.2019.2962027

Machine Learning Testing: Survey, Landscapes and Horizons

Jie Zhang, +3 more

- 17 Feb 2020

- IEEE Transactions on Software Engineerin...

TL;DR: A comprehensive survey of machine learning testing can be found in this article, which covers 138 papers on testing properties (e.g., correctness, robustness, and fairness), testing components (i.e., the data, learning program, and framework), testing workflow, and application scenarios.

...read moreread less

702

•Posted Content

Machine Learning Testing: Survey, Landscapes and Horizons

Jie Zhang, +3 more

- 19 Jun 2019

- arXiv: Learning

TL;DR: This paper provides a comprehensive survey of techniques for testing machine learning systems; Machine Learning Testing (ML testing) research, covering 144 papers on testing properties, testing components, and application scenarios.

...read moreread less

503

•Posted Content

Testing Deep Neural Networks

Youcheng Sun, +2 more

- 10 Mar 2018

- arXiv: Learning

TL;DR: This paper proposes a family of four novel test criteria that are tailored to structural features of DNNs and their semantics, and validated by demonstrating that the generated test inputs guided via the proposed coverage criteria are able to capture undesired behaviours in a DNN.

...read moreread less

262

•Journal Article•10.1007/S10664-020-09881-0

Testing machine learning based systems: a systematic mapping

Vincenzo Riccio, +5 more

- 01 Nov 2020

- Empirical Software Engineering

TL;DR: A systematic mapping study about testing techniques for MLSs driven by 33 research questions and investigated multiple aspects of the testing approaches, such as the used/proposed adequacy criteria, the algorithms for test input generation, and the test oracles.

...read moreread less

216

•Journal Article•10.1016/J.JSS.2020.110542

On testing machine learning programs

Houssem Ben Braiek, +1 more

- 01 Jun 2020

- Journal of Systems and Software

TL;DR: A comprehensive review of software testing practices for ML programs can be found in this paper, where the authors identify and explain challenges that should be addressed when testing ML programs and report existing solutions found in the literature.

...read moreread less

209

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 04 Sep 2014

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

102.6K

•Journal Article•10.1145/3065386

ImageNet classification with deep convolutional neural networks

Alex Krizhevsky, +2 more

- 24 May 2017

- Communications of The ACM

TL;DR: A large, deep convolutional neural network was trained to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes and employed a recently developed regularization method called "dropout" that proved to be very effective.

...read moreread less

98.2K

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

•Book Chapter•10.1007/978-3-319-10602-1_48

Microsoft COCO: Common Objects in Context

Tsung-Yi Lin, +7 more

- 06 Sep 2014

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.

...read moreread less

51.7K