Computational Pathology at Health System Scale -- Self-Supervised Foundation Models from Three Billion Images

Campanella, Gabriele; Kwan, Ricky; Fluder, Eugene; Zeng, Jennifer; Stock, Aryeh; Veremis, Brandon; Polydorides, Alexandros D.; Hedvat, Cyrus; Schoenfeld, Adam; Vanderbilt, Chad; Kovatch, Patricia; Cordon-Cardo, Carlos; Fuchs, Thomas J.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2310.07033 (cs)

[Submitted on 10 Oct 2023]

Title:Computational Pathology at Health System Scale -- Self-Supervised Foundation Models from Three Billion Images

Authors:Gabriele Campanella, Ricky Kwan, Eugene Fluder, Jennifer Zeng, Aryeh Stock, Brandon Veremis, Alexandros D. Polydorides, Cyrus Hedvat, Adam Schoenfeld, Chad Vanderbilt, Patricia Kovatch, Carlos Cordon-Cardo, Thomas J. Fuchs

View PDF

Abstract:Recent breakthroughs in self-supervised learning have enabled the use of large unlabeled datasets to train visual foundation models that can generalize to a variety of downstream tasks. While this training paradigm is well suited for the medical domain where annotations are scarce, large-scale pre-training in the medical domain, and in particular pathology, has not been extensively studied. Previous work in self-supervised learning in pathology has leveraged smaller datasets for both pre-training and evaluating downstream performance. The aim of this project is to train the largest academic foundation model and benchmark the most prominent self-supervised learning algorithms by pre-training and evaluating downstream performance on large clinical pathology datasets. We collected the largest pathology dataset to date, consisting of over 3 billion images from over 423 thousand microscopy slides. We compared pre-training of visual transformer models using the masked autoencoder (MAE) and DINO algorithms. We evaluated performance on six clinically relevant tasks from three anatomic sites and two institutions: breast cancer detection, inflammatory bowel disease detection, breast cancer estrogen receptor prediction, lung adenocarcinoma EGFR mutation prediction, and lung cancer immunotherapy response prediction. Our results demonstrate that pre-training on pathology data is beneficial for downstream performance compared to pre-training on natural images. Additionally, the DINO algorithm achieved better generalization performance across all tasks tested. The presented results signify a phase change in computational pathology research, paving the way into a new era of more performant models based on large-scale, parallel pre-training at the billion-image scale.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Cite as:	arXiv:2310.07033 [cs.CV]
	(or arXiv:2310.07033v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2310.07033

Submission history

From: Gabriele Campanella [view email]
[v1] Tue, 10 Oct 2023 21:40:19 UTC (618 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Computational Pathology at Health System Scale -- Self-Supervised Foundation Models from Three Billion Images

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Computational Pathology at Health System Scale -- Self-Supervised Foundation Models from Three Billion Images

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators