Medical Image Segmentation Using Transformer Networks.

IEEE Access

Department of Radiology, Boston Children's Hospital, Harvard Medical School, Boston, MA 02115, USA.

Published: March 2022

Deep learning models represent the state of the art in medical image segmentation. Most of these models are fully-convolutional networks (FCNs), namely each layer processes the output of the preceding layer with convolution operations. The convolution operation enjoys several important properties such as sparse interactions, parameter sharing, and translation equivariance. Because of these properties, FCNs possess a strong and useful inductive bias for image modeling and analysis. However, they also have certain important shortcomings, such as performing a fixed and pre-determined operation on a test image regardless of its content and difficulty in modeling long-range interactions. In this work we show that a different deep neural network architecture, based entirely on self-attention between neighboring image patches and without any convolution operations, can achieve more accurate segmentations than FCNs. Our proposed model is based directly on the transformer network architecture. Given a 3D image block, our network divides it into non-overlapping 3D patches and computes a 1D embedding for each patch. The network predicts the segmentation map for the block based on the self-attention between these patch embeddings. Furthermore, in order to address the common problem of scarcity of labeled medical images, we propose methods for pre-training this model on large corpora of unlabeled images. Our experiments show that the proposed model can achieve segmentation accuracies that are better than several state of the art FCN architectures on two datasets. Our proposed network can be trained using only tens of labeled images. Moreover, with the proposed pre-training strategies, our network outperforms FCNs when labeled training data is small.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9159704PMC
http://dx.doi.org/10.1109/access.2022.3156894DOI Listing

Publication Analysis

Top Keywords

medical image
8
image segmentation
8
state art
8
convolution operations
8
network architecture
8
proposed model
8
network
6
image
5
segmentation
4
segmentation transformer
4

Similar Publications

We sought to evaluate the intracardiac morphology and associated cardiovascular anomalies in patients with double inlet right ventricle (DIRV) on multidetector CT angiography. A retrospective search of our departmental database was conducted from January 2014 to January 2023 to identify patients with a diagnosis of DIRV on CT angiography. The intracardiac anatomy and associated cardiovascular abnormalities were systematically evaluated.

View Article and Find Full Text PDF

Background: Knee injuries resulting in purely cartilaginous defects are rare, and controversy remains regarding the reliability of chondral-only fixation.

Purpose: To systematically review the literature for fixation methods and outcomes after primary fixation of chondral-only defects within the knee.

Study Design: Systematic review; Level of evidence, 5.

View Article and Find Full Text PDF

Background: Studies are still limited on the isolated effect of retear after arthroscopic rotator cuff repair (ARCR) on functional outcomes after the midterm period.

Purpose: To assess the effect of retear at midterm follow-up after ARCR and to identify factors associated with the need for revision surgery.

Study Design: Cohort study; Level of evidence, 3.

View Article and Find Full Text PDF

Comparative Analysis of Gelatin/Polylactic Acid and Commercial PLA Membranes for Guided Bone Regeneration: A Randomized Clinical Trial.

Med Sci Monit

January 2025

Department of Oral Implantology, The Affiliated Stomatology Hospital, Jiangxi Medical College, Nanchang University, Jiangxi Province Key Laboratory of Oral Biomedicine, Jiangxi Province Clinical Research Center for Oral Disease, Nanchang, Jiangxi, China.

BACKGROUND This study included 32 patients with single missing teeth and alveolar bone defects and aimed to compare outcomes from guided bone regeneration with a gelatin/polylactic acid (GT/PLA) barrier membrane and a Guidor® bioresorbable matrix barrier dental membrane. MATERIAL AND METHODS A total of 32 participants were recruited in the clinical study, with single missing teeth and alveolar bone defects, requiring guided bone regeneration (32 missing teeth in total). They were randomly divided into the GT/PLA membrane group (experimental) and Guidor® membrane group (control) by the envelope method (n=16).

View Article and Find Full Text PDF

Adequate intraoperative visualization is mandatory for implant application in pelvic ring injuries. Several fluoroscopic X-ray views are in practical use. The gold standard primary X-ray is the anteroposterior view of the pelvis.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!