Publication | Closed Access
Self-Supervised Pre-Training of Swin Transformers for 3D Medical Image Analysis
691
Citations
43
References
2022
Year
Convolutional Neural NetworkMedical Image SegmentationEngineeringMachine LearningBiomedical EngineeringVision TransformersImage AnalysisData SciencePattern RecognitionImage-based ModelingSelf-supervised Learning3D ImagingComputational ImagingRadiologySwin TransformersHealth SciencesMachine VisionComputer-assisted SurgeryMedical ImagingComputational PathologyImage GuidanceComputer ScienceDeep LearningMedical Image ComputingSegmentation TasksDeformation ReconstructionSegmentation ChallengeComputer VisionBiomedical ImagingMedical Image AnalysisImage SegmentationFoundation Models
Vision Transformers (ViT)s have shown great performance in self-supervised learning of global and local representations that can be transferred to downstream applications. Inspired by these results, we introduce a novel self-supervised learning framework with tailored proxy tasks for medical image analysis. Specifically, we propose: (i) a new 3D transformer-based model, dubbed Swin UNEt TRansformers (Swin UNETR), with a hierarchical encoder for self-supervised pretraining; (ii) tailored proxy tasks for learning the underlying pattern of human anatomy. We demonstrate successful pre-training of the proposed model on 5,050 publicly available computed tomography (CT) images from various body organs. The effectiveness of our approach is validated by fine-tuning the pre-trained models on the Beyond the Cranial Vault (BTCV) Segmentation Challenge with 13 abdominal organs and segmentation tasks from the Medical Segmentation Decathlon (MSD) dataset. Our model is currently the state-of-the-art on the public test leaderboards of both MSD <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sup> <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sup> https://decathlon-10.grand-challenge.org/evaluation/challenge/leaderboard/ and BTCV <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> https://www.synapse.org/#!Synapse:syn3193805/wiki/217785/ datasets. Code: https://monai.io/research/swin-unetr.
| Year | Citations | |
|---|---|---|
Page 1
Page 1