Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer.
Babak Ehteshami Bejnordi,Mitko Veta,Paul J. van Diest,Bram van Ginneken,Nico Karssemeijer,Geert Litjens,Jeroen van der Laak,Meyke Hermsen,Quirine F. Manson,Maschenka Balkenhol,Oscar Geessink,N. Stathonikos,Marcory C. R. F. van Dijk,Peter Bult,Francisco Beca,Andrew H. Beck,Dayong Wang,Aditya Khosla,Rishab Gargeya,Humayun Irshad,Aoxiao Zhong,Qi Dou,Qi Dou,Quanzheng Li,Hao Chen,Huangjing Lin,Pheng-Ann Heng,Christian Haß,Elia Bruni,Quincy Wong,Ugur Halici,Mustafa Umit Oner,Rengul Cetin-Atalay,Matt Berseth,Vitali Khvatkov,Alexei Vylegzhanin,Oren Kraus,Muhammad Shaban,Nasir M. Rajpoot,Nasir M. Rajpoot,Ruqayya Awan,Korsuk Sirinukunwattana,Talha Qaiser,Yee-Wah Tsang,David Tellez,Jonas Annuscheit,Peter Hufnagl,Mira Valkonen,Kimmo Kartasalo,Kimmo Kartasalo,Leena Latonen,Pekka Ruusuvuori,Pekka Ruusuvuori,Kaisa Liimatainen,Shadi Albarqouni,Bharti Mungal,Ami George,Stefanie Demirci,Nassir Navab,Seiryo Watanabe,Shigeto Seno,Yoichi Takenaka,Hideo Matsuda,Hady Ahmady Phoulady,Vassili Kovalev,A. Kalinovsky,Vitali Liauchuk,Gloria Bueno,M. Milagro Fernández-Carrobles,Ismael Serrano,Oscar Deniz,Daniel Racoceanu,Daniel Racoceanu,Rui Venâncio +73 more
TL;DR: In the setting of a challenge competition, some deep learning algorithms achieved better diagnostic performance than a panel of 11 pathologists participating in a simulation exercise designed to mimic routine pathology workflow; algorithm performance was comparable with an expert pathologist interpreting whole-slide images without time constraints.
read more
Abstract: Importance Application of deep learning algorithms to whole-slide pathology images can potentially improve diagnostic accuracy and efficiency. Objective Assess the performance of automated deep learning algorithms at detecting metastases in hematoxylin and eosin–stained tissue sections of lymph nodes of women with breast cancer and compare it with pathologists’ diagnoses in a diagnostic setting. Design, Setting, and Participants Researcher challenge competition (CAMELYON16) to develop automated solutions for detecting lymph node metastases (November 2015-November 2016). A training data set of whole-slide images from 2 centers in the Netherlands with (n = 110) and without (n = 160) nodal metastases verified by immunohistochemical staining were provided to challenge participants to build algorithms. Algorithm performance was evaluated in an independent test set of 129 whole-slide images (49 with and 80 without metastases). The same test set of corresponding glass slides was also evaluated by a panel of 11 pathologists with time constraint (WTC) from the Netherlands to ascertain likelihood of nodal metastases for each slide in a flexible 2-hour session, simulating routine pathology workflow, and by 1 pathologist without time constraint (WOTC). Exposures Deep learning algorithms submitted as part of a challenge competition or pathologist interpretation. Main Outcomes and Measures The presence of specific metastatic foci and the absence vs presence of lymph node metastasis in a slide or image using receiver operating characteristic curve analysis. The 11 pathologists participating in the simulation exercise rated their diagnostic confidence as definitely normal, probably normal, equivocal, probably tumor, or definitely tumor. Results The area under the receiver operating characteristic curve (AUC) for the algorithms ranged from 0.556 to 0.994. The top-performing algorithm achieved a lesion-level, true-positive fraction comparable with that of the pathologist WOTC (72.4% [95% CI, 64.3%-80.4%]) at a mean of 0.0125 false-positives per normal whole-slide image. For the whole-slide image classification task, the best algorithm (AUC, 0.994 [95% CI, 0.983-0.999]) performed significantly better than the pathologists WTC in a diagnostic simulation (mean AUC, 0.810 [range, 0.738-0.884];P Conclusions and Relevance In the setting of a challenge competition, some deep learning algorithms achieved better diagnostic performance than a panel of 11 pathologists participating in a simulation exercise designed to mimic routine pathology workflow; algorithm performance was comparable with an expert pathologist interpreting whole-slide images without time constraints. Whether this approach has clinical utility will require evaluation in a clinical setting.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Identification of misdiagnosis by deep neural networks on a histopathologic review of breast cancer lymph node metastases
Cancan Chen,Shan Zheng,Lei Guo,Xuebing Yang,Yan Song,Zhuo Li,Yanwu Zhu,Xiaoqi Liu,Qingzhuang Li,Huijuan Zhang,Ning Feng,Zuxuan Zhao,Tinglin Qiu,J. Du,Qiang Guo,Wensheng Zhang,Wenzhao Shi,Jianhui Ma,Fenglong Sun +18 more
TL;DR: Wang et al. as discussed by the authors presented a relative risk classification and regression tree (RRCART) to identify the misdiagnosed whole-slide images (WSIs) and recommend them to be reviewed by pathologists.
Machine learning and natural language processing (NLP) approach to predict early progression to first-line treatment in real-world hormone receptor-positive (HR+)/HER2-negative advanced breast cancer patients.
Nuria Ribelles,José M. Jerez,Pablo Rodriguez-Brazzarola,Begoña Jimenez,Tamara Diaz-Redondo,Héctor Mesa,Antonia Márquez,Alfonso Sánchez-Muñoz,Bella Pajares,Francisco Carabantes,María José Bermejo,Ester Villar,Maria E. Dominguez-Recio,Enrique Saez,Laura Galvez,Ana Godoy,Leo Franco,Sofia Ruiz-Medina,Irene López,Emilio Alba +19 more
TL;DR: In this paper, the authors developed predictive models for early and late progression to first-line treatment of HR+/HER2-negative metastatic breast cancer, also finding that NLP-based machine learning models are slightly better than predictive models based on manually obtained data.
14
•Book
Evidence-Based Diagnosis: An Introduction to Clinical Epidemiology
Thomas B. Newman,Michael Kohn +1 more
- 25 Jun 2020
TL;DR: Based on decades of experience teaching in a clinical research training program, this fully updated second edition is an essential guide for anyone looking to select, develop, or market medical tests as mentioned in this paper, which includes the diagnostic process, reliability and accuracy of tests, and quantifying treatment benefits using randomized trials.
14
Applications of Artificial Intelligence in Healthcare
TL;DR: The goal of this study is to keep track of current advancements in science, understand technological availability, recognize the enormous power of AI in healthcare, and encourage scientists to use AI in their related fields of research.
Monkeypox Disease Diagnosis using Machine Learning Approach
01 Dec 2022
TL;DR: In this paper , a machine learning-based method for analyzing RGB images for signs of monkey pox was presented, which employed 3 convolutional neural network (CNN) models and 6 machine learning classifiers (MLCs).
14
References
Deep Residual Learning for Image Recognition
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
•Proceedings Article
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
- 01 Jan 2015
TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
138.5K
•Posted Content
Deep Residual Learning for Image Recognition
TL;DR: This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.
117.9K
Random Forests
Leo Breiman
- 01 Oct 2001
TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
•Proceedings Article
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan,Andrew Zisserman +1 more
- 04 Sep 2014
TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
102.6K
Related Papers (5)
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
[...]
Karen Simonyan,Andrew Zisserman +1 more
- 04 Sep 2014