An interpretable multiple-instance approach for the detection of referable diabetic retinopathy in fundus images.

doi:10.1038/S41598-021-93632-8

Open AccessJournal Article10.1038/S41598-021-93632-8

An interpretable multiple-instance approach for the detection of referable diabetic retinopathy in fundus images.

Alexandros Papadopoulos, +2 more

- 12 Jul 2021

- Scientific Reports

- Vol. 11, Iss: 1, pp 14326-14326

24

TL;DR: In this article, a machine learning system was proposed for the detection of referable diabetic retinopathy in fundus images, which is based on the paradigm of multiple-instance learning.

Abstract: Diabetic retinopathy (DR) is one of the leading causes of vision loss across the world. Yet despite its wide prevalence, the majority of affected people lack access to the specialized ophthalmologists and equipment required for monitoring their condition. This can lead to delays in the start of treatment, thereby lowering their chances for a successful outcome. Machine learning systems that automatically detect the disease in eye fundus images have been proposed as a means of facilitating access to retinopathy severity estimates for patients in remote regions or even for complementing the human expert’s diagnosis. Here we propose a machine learning system for the detection of referable diabetic retinopathy in fundus images, which is based on the paradigm of multiple-instance learning. Our method extracts local information independently from multiple rectangular image patches and combines it efficiently through an attention mechanism that focuses on the abnormal regions of the eye (i.e. those that contain DR-induced lesions), thus resulting in a final image representation that is suitable for classification. Furthermore, by leveraging the attention mechanism our algorithm can seamlessly produce informative heatmaps that highlight the regions where the lesions are located. We evaluate our approach on the publicly available Kaggle, Messidor-2 and IDRiD retinal image datasets, in which it exhibits near state-of-the-art classification performance (AUC of 0.961 in Kaggle and 0.976 in Messidor-2), while also producing valid lesion heatmaps (AUPRC of 0.869 in the 81 images of IDRiD that contain pixel-level lesion annotations). Our results suggest that the proposed approach provides an efficient and interpretable solution against the problem of automated diabetic retinopathy grading.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1109/access.2022.3157632

Deep Learning Techniques for Diabetic Retinopathy Classification: A Survey

01 Jan 2022

- IEEE Access

TL;DR: In this paper , state-of-the-art deep learning methods in supervised, self-supervised, and Vision Transformer setups, proposing retinal fundus image classification and detection.

...read moreread less

144

10.1007/s40520-023-02552-2

A comprehensive review of machine learning algorithms and their application in geriatric medicine: present and future.

Richard J Woodman, +1 more

- 08 Sep 2023

- Aging Clinical and Experimental Research

TL;DR: A broad taxonomy of machine learning algorithms is provided, followed by a more detailed description of each algorithm class, their purpose and capabilities, and examples of their applications, particularly in geriatric medicine.

...read moreread less

48

•Journal Article•10.1049/cit2.12155

A deep convolutional neural network for diabetic retinopathy detection via mining local and long‐range dependence

Xiaoling Luo, +6 more

- 24 Jan 2023

- CAAI Transactions on Intelligence Techno...

TL;DR: Wang et al. as discussed by the authors incorporated correlations between long-range patches into the deep learning framework to improve diabetic retinopathy (DR) detection, where patch-wise relationships are used to enhance the local patch features since lesions of DR usually appear as plaques.

...read moreread less

35

•Journal Article•10.1038/s41598-022-16089-3

Fractal dimension of retinal vasculature as an image quality metric for automated fundus image analysis systems

Xingzheng Lyu, +3 more

- 13 Jul 2022

- Dental science reports

TL;DR: In this article , the authors proposed fractal dimension of retinal vasculature as an easy, effective and explainable indicator for retinal image quality, which was validated on 30,644 images from four public database.

...read moreread less

29

Journal Article•10.1109/access.2023.3326528

Vision Transformer Model for Predicting the Severity of Diabetic Retinopathy in Fundus Photography-Based Retina Images

Waleed Nazih, +3 more

- IEEE Access

TL;DR: A novel ViT based deep learning pipeline for detecting the severity stages of DR based on fundus photography-based retina images using FGADR dataset, which was able to capture the crucial features of retinal images to understand DR severity better.

...read moreread less

22

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 04 Sep 2014

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

102.6K

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

•Proceedings Article

Neural Machine Translation by Jointly Learning to Align and Translate

Dzmitry Bahdanau, +2 more

- 01 Jan 2015

TL;DR: It is conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder-decoder architecture, and it is proposed to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.

...read moreread less

25.7K

•Proceedings Article•10.1109/ICCV.2017.74

Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization

Ramprasaath R. Selvaraju, +5 more

- 01 Oct 2017

TL;DR: This work combines existing fine-grained visualizations to create a high-resolution class-discriminative visualization, Guided Grad-CAM, and applies it to image classification, image captioning, and visual question answering (VQA) models, including ResNet-based architectures.

...read moreread less

14.7K