Ensembles of Convolutional Neural Networks and Transformers for Polyp Segmentation.

Sensors (Basel)

Department of Computer Science and Engineering, University of Bologna, 40126 Bologna, Italy.

Published: May 2023

In the realm of computer vision, semantic segmentation is the task of recognizing objects in images at the pixel level. This is done by performing a classification of each pixel. The task is complex and requires sophisticated skills and knowledge about the context to identify objects' boundaries. The importance of semantic segmentation in many domains is undisputed. In medical diagnostics, it simplifies the early detection of pathologies, thus mitigating the possible consequences. In this work, we provide a review of the literature on deep ensemble learning models for polyp segmentation and develop new ensembles based on convolutional neural networks and transformers. The development of an effective ensemble entails ensuring diversity between its components. To this end, we combined different models (HarDNet-MSEG, Polyp-PVT, and HSNet) trained with different data augmentation techniques, optimization methods, and learning rates, which we experimentally demonstrate to be useful to form a better ensemble. Most importantly, we introduce a new method to obtain the segmentation mask by averaging intermediate masks after the sigmoid layer. In our extensive experimental evaluation, the average performance of the proposed ensembles over five prominent datasets beat any other solution that we know of. Furthermore, the ensembles also performed better than the state-of-the-art on two of the five datasets, when individually considered, without having been specifically trained for them.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10224477PMC
http://dx.doi.org/10.3390/s23104688DOI Listing

Publication Analysis

Top Keywords

convolutional neural
8
neural networks
8
networks transformers
8
polyp segmentation
8
semantic segmentation
8
segmentation
5
ensembles
4
ensembles convolutional
4
transformers polyp
4
segmentation realm
4

Similar Publications

Coronary artery calcification (CAC) is a key marker of coronary artery disease (CAD) but is often underreported in cancer patients undergoing non-gated CT or PET/CT scans. Traditional CAC assessment requires gated CT scans, leading to increased radiation exposure and the need for specialized personnel. This study aims to develop an artificial intelligence (AI) method to automatically detect CAC from non-gated, freely-breathing, low-dose CT images obtained from positron emission tomography/computed tomography scans.

View Article and Find Full Text PDF

Objectives: To investigate the performance of a deep learning (DL) model for segmenting cone-beam computed tomography (CBCT) scans taken before and after mandibular horizontal guided bone regeneration (GBR) to evaluate hard tissue changes.

Materials And Methods: The proposed SegResNet-based DL model was trained on 70 CBCT scans. It was tested on 10 pairs of pre- and post-operative CBCT scans of patients who underwent mandibular horizontal GBR.

View Article and Find Full Text PDF

Background: Effective pain recognition and treatment in perioperative environments reduce length of stay and decrease risk of delirium and chronic pain. We sought to develop and validate preliminary computer vision-based approaches for nociception detection in hospitalized patients.

Methods: Prospective observational cohort study using red-green-blue camera detection of perioperative patients.

View Article and Find Full Text PDF

Objectives: To automatically identify and diagnose bladder outflow obstruction (BOO) and detrusor underactivity (DUA) in male patients with lower urinary tract symptoms through urodynamics exam.

Patients And Methods: We performed a retrospective review of 1949 male patients who underwent a urodynamic study at two institutions. Deep Convolutional Neural Networks scheme combined with a short-time Fourier transform algorithm was trained to perform an accurate diagnosis of BOO and DUA, utilizing five-channel urodynamic data (consisting of uroflowmetry, urine volume, intravesical pressure, abdominal pressure, and detrusor pressure).

View Article and Find Full Text PDF

Background And Purpose: Endovascular thrombectomy (EVT) is the standard for acute ischemic stroke from large vessel occlusion, but post-EVT functional independence varies. Brain atrophy, linked to higher cerebrospinal fluid volume (CSFV), may affect outcomes. Baseline CSFV could predict EVT benefit by assessing brain health.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!