MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network

doi:10.1109/CVPR.2017.378

Open AccessProceedings Article10.1109/CVPR.2017.378

MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network

Zizhao Zhang, +4 more

- 08 Jul 2017

- pp 3549-3557

393

TL;DR: This paper proposes MDNet to establish a direct multimodal mapping between medical images and diagnostic reports that can read images, generate diagnostic reports, retrieve images by symptom descriptions, and visualize attention, to provide justifications of the network diagnosis process.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Preprint•10.2139/ssrn.4625427

Automatic Report Generation Method for Ultrasound Assisted Diagnosis of Cervical Lymph Nodes

Xuehai Ding, +6 more

- 01 Jan 2023

TL;DR: A novel ultrasound report generation method for the auxiliary diagnosis of cervical lymph nodes is proposed. The method includes a classification network for metastatic prediction and a descriptive sentence generation network for ultrasound characteristics of cervical lymph nodes. The method achieved high accuracy in metastatic finding and description text generation.

...read moreread less

•Posted Content

Multiple Instance Captioning: Learning Representations from Histopathology Textbooks and Articles

Jevgenij Gamper, +1 more

- 08 Mar 2021

- arXiv: Computer Vision and Pattern Recog...

TL;DR: ArCH as mentioned in this paper is a computational pathology (CP) multiple instance captioning dataset to facilitate dense supervision of CP tasks, which contains dense diagnostic and morphological descriptions for a range of stains, tissue types and pathologies.

...read moreread less

•Posted Content

Learning Efficient, Explainable and Discriminative Representations for Pulmonary Nodules Classification

Hanliang Jiang, +3 more

- 19 Jan 2021

- arXiv: Image and Video Processing

TL;DR: Wang et al. as mentioned in this paper used neural architecture search (NAS) to automatically search 3D network architectures with excellent accuracy/speed trade-off and used the convolutional block attention module (CBAM) in the networks, which helps us understand the reasoning process.

...read moreread less

Journal Article•10.48550/arXiv.2203.06458

Factored Attention and Embedding for Unstructured-view Topic-related Ultrasound Report Generation

Fuhai Chen, +5 more

- 12 Mar 2022

- arXiv.org

TL;DR: Both quantitative comparisons and qualitative analysis demonstrate the effectiveness and the superiority of FAE-Gen over seven commonly-used metrics.

...read moreread less

Journal Article•10.1088/1361-6560/ad1995

Intensive vision-guided network for radiology report generation.

Fudan Zheng, +7 more

- 29 Dec 2023

- Physics in Medicine and Biology

TL;DR: This work proposes a Globally-intensive Attention (GIA) module in the medical image encoder to simulate and integrate multi-view vision perception, and designs a Visual Knowledge-guided Decoder (VKGD), which can adaptively consider how much the model needs to rely on visual information and previously predicted text to assist next word prediction.

...read moreread less

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 04 Sep 2014

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

102.6K

Journal Article•10.1162/NECO.1997.9.8.1735

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997

- Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

99K

•Proceedings Article•10.1109/CVPR.2015.7298594

Going deeper with convolutions

Christian Szegedy, +8 more

- 07 Jun 2015

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

56.6K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 01 Jan 2015

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

51.9K

...

Expand

MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network

Chat with Paper

AI Agents for this Paper

Citations

Automatic Report Generation Method for Ultrasound Assisted Diagnosis of Cervical Lymph Nodes

Multiple Instance Captioning: Learning Representations from Histopathology Textbooks and Articles

Learning Efficient, Explainable and Discriminative Representations for Pulmonary Nodules Classification

Factored Attention and Embedding for Unstructured-view Topic-related Ultrasound Report Generation

Intensive vision-guided network for radiology report generation.

References

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

Long short-term memory

Going deeper with convolutions

Very Deep Convolutional Networks for Large-Scale Image Recognition

Related Papers (5)

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

U-Net: Convolutional Networks for Biomedical Image Segmentation

ImageNet: A large-scale hierarchical image database

ImageNet Classification with Deep Convolutional Neural Networks