Multiple Instance Captioning: Learning Representations from Histopathology Textbooks and Articles

doi:10.1109/CVPR46437.2021.01628

Open AccessProceedings Article10.1109/CVPR46437.2021.01628

Multiple Instance Captioning: Learning Representations from Histopathology Textbooks and Articles

Jevgenij Gamper, +1 more

- 01 Jun 2021

- pp 16549-16559

79

TL;DR: It is shown that ARCH is the only CP dataset to (ARCH-)rival its computer vision analog MS-COCO Captions, and conjecture that an encoder pre-trained on dense image captions learns transferable representations for most CP tasks.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arXiv.2206.06488

Multimodal Learning with Transformers: A Survey

Peng Xu, +2 more

- 13 Jun 2022

- IEEE Transactions on Pattern Analysis an...

TL;DR: A comprehensive survey of Transformer techniques oriented at multimodal data and a discussion of open problems and potential research directions for the community are presented.

...read moreread less

337

Journal Article•10.1038/s41591-023-02504-3

A visual–language foundation model for pathology image analysis using medical Twitter

Zhi Huang, +4 more

- 17 Aug 2023

- news@nature.com

TL;DR: This work develops pathology language–image pretraining (PLIP), a multimodal artificial intelligence with both image and text understanding, which is trained on OpenPath and enables users to retrieve similar cases by either image or natural language search, greatly facilitating knowledge sharing.

...read moreread less

308

•Journal Article•10.1109/tpami.2023.3275156

Multimodal Learning With Transformers: A Survey

01 Jan 2023

- IEEE Transactions on Pattern Analysis an...

TL;DR: Transformer is a promising neural network learner, and has achieved great success in various machine learning tasks as discussed by the authors , thanks to the recent prevalence of multimodal applications and Big Data, Transformer-based multimodAL learning has become a hot topic in AI research.

...read moreread less

278

•Journal Article•10.1038/s41746-023-00811-0

Self-supervised learning for medical image classification: a systematic review and implementation guidelines

Shih-Cheng Huang, +5 more

- 26 Apr 2023

- npj digital medicine

TL;DR: In this paper , the authors provide consistent descriptions of different self-supervised learning strategies and compose a systematic review of papers published between 2012 and 2022 on PubMed, Scopus, and ArXiv.

...read moreread less

174

Journal Article•10.1038/s41591-024-02856-4

A visual-language foundation model for computational pathology.

Ming Y. Lu, +12 more

- 01 Mar 2024

- news@nature.com

154

...

Expand

References

•Proceedings Article•10.1109/CVPR.2015.7299087

CIDEr: Consensus-based image description evaluation

Ramakrishna Vedantam, +2 more

- 07 Jun 2015

TL;DR: A novel paradigm for evaluating image descriptions that uses human consensus is proposed and a new automated metric that captures human judgment of consensus better than existing metrics across sentences generated by various sources is evaluated.

...read moreread less

5.6K

•Posted Content

Scaling Laws for Neural Language Models

Jared Kaplan, +9 more

- 23 Jan 2020

- arXiv: Learning

TL;DR: Larger models are significantly more sample-efficient, such that optimally compute-efficient training involves training very large models on a relatively modest amount of data and stopping significantly before convergence.

...read moreread less

3.3K

•Journal Article•10.1038/S41591-019-0508-1

Clinical-grade computational pathology using weakly supervised deep learning on whole slide images.

Gabriele Campanella, +11 more

- 15 Jul 2019

- Nature Medicine

TL;DR: A multiple instance learning-based deep learning system that uses only the reported diagnoses as labels for training, thereby avoiding expensive and time-consuming pixel-wise manual annotations, and has the ability to train accurate classification models at unprecedented scale.

...read moreread less

2.2K

•Proceedings Article•10.1109/ICCV.2019.00393

Digging Into Self-Supervised Monocular Depth Estimation

Clément Godard, +3 more

- 01 Oct 2019

TL;DR: In this paper, the authors propose a set of improvements, which together result in both quantitatively and qualitatively improved depth maps compared to competing self-supervised methods, and demonstrate the effectiveness of each component in isolation, and show high quality, state-of-theart results on the KITTI benchmark.

...read moreread less

1.8K

Proceedings Article•10.1109/CVPR.2019.00963

Panoptic Segmentation

Alexander Kirillov, +4 more

- 01 Jun 2019

TL;DR: A novel panoptic quality (PQ) metric is proposed that captures performance for all classes (stuff and things) in an interpretable and unified manner and is performed a rigorous study of both human and machine performance for PS on three existing datasets, revealing interesting insights about the task.

...read moreread less

1.8K