Segment anything model for medical images?

Med Image Anal

National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, School of Biomedical Engineering, Shenzhen University Medical School, Shenzhen University, Shenzhen, China; Medical UltraSound Image Computing (MUSIC) Lab, Shenzhen University, Shenzhen, China; Marshall Laboratory of Biomedical Engineering, Shenzhen University, Shenzhen, China. Electronic address:

Published: February 2024

The Segment Anything Model (SAM) is the first foundation model for general image segmentation. It has achieved impressive results on various natural image segmentation tasks. However, medical image segmentation (MIS) is more challenging because of the complex modalities, fine anatomical structures, uncertain and complex object boundaries, and wide-range object scales. To fully validate SAM's performance on medical data, we collected and sorted 53 open-source datasets and built a large medical segmentation dataset with 18 modalities, 84 objects, 125 object-modality paired targets, 1050K 2D images, and 6033K masks. We comprehensively analyzed different models and strategies on the so-called COSMOS 1050K dataset. Our findings mainly include the following: (1) SAM showed remarkable performance in some specific objects but was unstable, imperfect, or even totally failed in other situations. (2) SAM with the large ViT-H showed better overall performance than that with the small ViT-B. (3) SAM performed better with manual hints, especially box, than the Everything mode. (4) SAM could help human annotation with high labeling quality and less time. (5) SAM was sensitive to the randomness in the center point and tight box prompts, and may suffer from a serious performance drop. (6) SAM performed better than interactive methods with one or a few points, but will be outpaced as the number of points increases. (7) SAM's performance correlated to different factors, including boundary complexity, intensity differences, etc. (8) Finetuning the SAM on specific medical tasks could improve its average DICE performance by 4.39% and 6.68% for ViT-B and ViT-H, respectively. Codes and models are available at: https://github.com/yuhoo0302/Segment-Anything-Model-for-Medical-Images. We hope that this comprehensive report can help researchers explore the potential of SAM applications in MIS, and guide how to appropriately use and develop SAM.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.media.2023.103061DOI Listing

Publication Analysis

Top Keywords

image segmentation
12
sam
10
segment model
8
sam's performance
8
sam performed
8
performed better
8
performance
6
medical
5
model medical
4
medical images?
4

Similar Publications

Background: The presence of significant tortuosity in access routes to aneurysms can interfere with catheter guidance and manipulation and significantly impact treatment strategies.

Observations: In this report, the authors combined intentional staged aneurysm embolization with the construction of a new direct access route, which they call a "highway bypass," for a symptomatic posterior circulation cerebral aneurysm that was difficult to access with a catheter. Notably, the highway bypass is used for catheter passage, and technical tips should be considered.

View Article and Find Full Text PDF

Background: External ventricular drains (EVDs) provide an invaluable diagnostic method for accessing cerebrospinal fluid and therapeutically treating elevated intracranial pressure. Although complications including hemorrhage and infection have been well documented, the formation of iatrogenic pseudoaneurysms following EVD placement has rarely been reported. The authors present a case of this exceedingly rare complication of iatrogenic pseudoaneurysm formation following EVD placement.

View Article and Find Full Text PDF

Objectives: This study aimed to develop an automated method for generating clearer, well-aligned panoramic views by creating an optimized three-dimensional (3D) reconstruction zone centered on the teeth. The approach focused on achieving high contrast and clarity in key dental features, including tooth roots, morphology, and periapical lesions, by applying a 3D U-Net deep learning model to generate an arch surface and align the panoramic view.

Methods: This retrospective study analyzed anonymized cone-beam CT (CBCT) scans from 312 patients (mean age 40 years; range 10-78; 41.

View Article and Find Full Text PDF

Bruises can affect the appearance and nutritional value of apples and cause economic losses. Therefore, the accurate detection of bruise levels and bruise time of apples is crucial. In this paper, we proposed a method that combines a self-designed multispectral imaging system with deep learning to accurately detect the level and time of bruising on apples.

View Article and Find Full Text PDF

Purpose: The impact of ventriculomegaly (VM) on cortical development and brain functionality has been extensively explored in existing literature. VM has been associated with higher risks of attention-deficit and hyperactivity disorders, as well as cognitive, language, and behavior deficits. Some studies have also shown a relationship between VM and cortical overgrowth, along with reduced cortical folding, both in fetuses and neonates.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!