Inflation of test accuracy due to data leakage in deep learning-based classification of OCT images

doi:10.1038/s41597-022-01618-6

Open AccessJournal Article10.1038/s41597-022-01618-6

Inflation of test accuracy due to data leakage in deep learning-based classification of OCT images

Iulian Emil Tampu, +2 more

- 21 Feb 2022

- Scientific Data

- Vol. 9, Iss: 1

51

TL;DR: In this paper , the effect of improper dataset splitting on model evaluation is demonstrated for three classification tasks using three OCT open-access datasets extensively used, Kermany's and Srinivasan's ophthalmology datasets, and AIIMS breast tissue dataset.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1007/s10964-023-01767-w

Predicting Adolescent Mental Health Outcomes Across Cultures: A Machine Learning Approach

W. Andrew Rothenberg, +22 more

- 19 Apr 2023

- Journal of Youth and Adolescence

TL;DR: This paper used machine learning models to identify the most important preadolescent risk factors in predicting adolescent mental health, including family context, parenting behaviors, individual child characteristics, and neighborhood and cultural variables.

...read moreread less

23

Journal Article•10.1093/petrology/egad050

Barometers behaving badly II: A critical evaluation of Cpx-only and Cpx-Liq thermobarometry in variably-hydrous arc magmas

Penny E. Wieser, +2 more

- 05 Jul 2023

- Journal of Petrology

TL;DR: In this paper , the average Clinopyroxene-Liquid (Cpx-Liq) compositions from N=543 variably-hydrous experiments at crustal conditions (1 bar to 17 kbar) were used to assess the performance of different thermobarometers, and identify the most accurate and precise expressions for application to subduction zone magmas.

...read moreread less

22

Journal Article•10.1016/j.jksuci.2023.101810

ConvAttenMixer: Brain Tumor Detection and Type Classification using Convolutional Mixer with External and Self-Attention Mechanisms

Salha Alzahrani

- 01 Oct 2023

- Journal of King Saud University - Comput...

TL;DR: ConvAttenMixer, a transformer model, combines convolutional layers with self-attention and external attention mechanisms to enhance brain tumor detection and classification in MRI images, outperforming state-of-the-art baselines with higher precision, recall, and accuracy (0.9794).

...read moreread less

20

•Journal Article•10.1371/journal.pdig.0000276

AutoPrognosis 2.0: Democratizing diagnostic and prognostic modeling in healthcare with automated machine learning

Fergus Imrie

- 22 Jun 2023

- PLOS digital health

TL;DR: Vandergaard et al. as discussed by the authors presented an open-source machine learning framework, AutoPrognosis 2.0, to facilitate the development of diagnostic and prognostic models.

...read moreread less

19

Journal Article•10.3390/app132111625

Vision Transformers and Transfer Learning Approaches for Arabic Sign Language Recognition

Nojood M. Alharthi, +1 more

- 24 Oct 2023

- Applied Sciences

TL;DR: This study aimed to create robust transfer learning models trained on a dataset of 54,049 images depicting 32 alphabets from an ArSL dataset and demonstrated the effectiveness and robustness of using transfer learning with vision transformers for sign language recognition for other low-resourced languages.

...read moreread less

18

...

Expand

References

•Journal Article•10.1016/J.MEDIA.2017.07.005

A survey on deep learning in medical image analysis

Geert Litjens, +8 more

- 01 Dec 2017

- Medical Image Analysis

TL;DR: This paper reviews the major deep learning concepts pertinent to medical image analysis and summarizes over 300 contributions to the field, most of which appeared in the last year, to survey the use of deep learning for image classification, object detection, segmentation, registration, and other tasks.

...read moreread less

12.5K

•Book

Applied Predictive Modeling

Max Kuhn, +1 more

- 17 May 2013

TL;DR: This research presents a novel and scalable approach called “Smartfitting” that automates the very labor-intensive and therefore time-heavy and therefore expensive and expensive process of designing and implementing statistical models for regression models.

...read moreread less

5.9K

Journal Article•10.1016/J.IPM.2009.03.002

A systematic analysis of performance measures for classification tasks

Marina Sokolova, +1 more

- 01 Jul 2009

- Information Processing and Management

TL;DR: This paper presents a systematic analysis of twenty four performance measures used in the complete spectrum of Machine Learning classification tasks, i.e., binary, multi-class,multi-labelled, and hierarchical, to produce a measure invariance taxonomy with respect to all relevant label distribution changes in a classification problem.

...read moreread less

5.4K

•Journal Article•10.1186/S12864-019-6413-7

The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation

Davide Chicco, +1 more

- 02 Jan 2020

- BMC Genomics

TL;DR: This article shows how MCC produces a more informative and truthful score in evaluating binary classifications than accuracy and F1 score, by first explaining the mathematical properties, and then the asset of MCC in six synthetic use cases and in a real genomics scenario.

...read moreread less

4.5K

•Journal Article•10.1016/J.CELL.2018.02.010

Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning

Daniel S. Kermany, +46 more

- 22 Feb 2018

- Cell

TL;DR: A diagnostic tool based on a deep-learning framework for the screening of patients with common treatable blinding retinal diseases, which demonstrates performance comparable to that of human experts in classifying age-related macular degeneration and diabetic macular edema.

...read moreread less

4.2K