MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network

doi:10.1109/CVPR.2017.378

Open AccessProceedings Article10.1109/CVPR.2017.378

MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network

Zizhao Zhang, +4 more

- 08 Jul 2017

- pp 3549-3557

393

TL;DR: This paper proposes MDNet to establish a direct multimodal mapping between medical images and diagnostic reports that can read images, generate diagnostic reports, retrieve images by symptom descriptions, and visualize attention, to provide justifications of the network diagnosis process.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Proceedings Article

Cross-modal Contrastive Attention Model for Medical Report Generation

Xiao Song, +4 more

TL;DR: This paper proposes a novel Cross-modal Contrastive Attention (CMCA) model to capture both visual and semantic information from similar cases, with mainly two modules: a Visual Contrastive attention Module for refining the unique abnormal regions compared to the retrieved case images; a Cross- modal Attention Module for matching the positive semantic Information from the case reports.

...read moreread less

•Proceedings Article•10.1109/CVPR46437.2021.01628

Multiple Instance Captioning: Learning Representations from Histopathology Textbooks and Articles

Jevgenij Gamper, +1 more

- 01 Jun 2021

TL;DR: It is shown that ARCH is the only CP dataset to (ARCH-)rival its computer vision analog MS-COCO Captions, and conjecture that an encoder pre-trained on dense image captions learns transferable representations for most CP tasks.

...read moreread less

Journal Article•10.48550/arXiv.2306.17180

Replace and Report: NLP Assisted Radiology Report Generation

Kaveri Kale, +2 more

- 19 Jun 2023

- arXiv.org

TL;DR: In this article , a template-based approach was proposed to generate radiology reports from radiographs by first creating small sentences for abnormal findings and then replacing them in the normal report template.

...read moreread less

Journal Article•10.48550/arXiv.2304.04920

Advancing Medical Imaging with Language Models: A Journey from N-grams to ChatGPT

Mingzhe Hu, +3 more

- 11 Apr 2023

- arXiv.org

TL;DR: A review and tutorial for researchers in the field of medical imaging using language models to improve their tasks at hand is provided in this article , where the potential benefits of accurate and efficient language models for medical imaging analysis, including improving clinical workflow efficiency, reducing diagnostic errors and assisting healthcare professionals in providing timely and accurate diagnoses.

...read moreread less

Preprint•10.48550/arxiv.2405.17002

UIT-DarkCow team at ImageCLEFmedical Caption 2024: Diagnostic Captioning for Radiology Images Efficiency with Transformer Models

Quan Van Nguyen, +5 more

- 27 May 2024

TL;DR: Diagnostic captioning models for radiology images using transformer models achieved high performance, contributing to the team's third-place finish in the ImageCLEFmedical Caption 2024 competition.

...read moreread less

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 04 Sep 2014

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

102.6K

Journal Article•10.1162/NECO.1997.9.8.1735

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997

- Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

99K

•Proceedings Article•10.1109/CVPR.2015.7298594

Going deeper with convolutions

Christian Szegedy, +8 more

- 07 Jun 2015

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

56.6K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 01 Jan 2015

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

51.9K

...

Expand

MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network

Chat with Paper

AI Agents for this Paper

Citations

Cross-modal Contrastive Attention Model for Medical Report Generation

Multiple Instance Captioning: Learning Representations from Histopathology Textbooks and Articles

Replace and Report: NLP Assisted Radiology Report Generation

Advancing Medical Imaging with Language Models: A Journey from N-grams to ChatGPT

UIT-DarkCow team at ImageCLEFmedical Caption 2024: Diagnostic Captioning for Radiology Images Efficiency with Transformer Models

References

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

Long short-term memory

Going deeper with convolutions

Very Deep Convolutional Networks for Large-Scale Image Recognition

Related Papers (5)

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

U-Net: Convolutional Networks for Biomedical Image Segmentation

ImageNet: A large-scale hierarchical image database

ImageNet Classification with Deep Convolutional Neural Networks