Purpose: Two-dimensional radiotherapy is often used to treat cervical cancer in low- and middle-income countries, but treatment planning can be challenging and time-consuming. Neural networks offer the potential to greatly decrease planning time through automation, but the impact of the wide range of hyperparameters to be set during training on model accuracy has not been exhaustively investigated. In the current study, we evaluated the effect of several convolutional neural network architectures and hyperparameters on 2D radiotherapy treatment field delineation.

Methods: Six commonly used deep learning architectures were trained to delineate four-field box apertures on digitally reconstructed radiographs for cervical cancer radiotherapy. A comprehensive search of optimal hyperparameters for all models was conducted by varying the initial learning rate, image normalization methods, and (when appropriate) convolutional kernel size, the number of learnable parameters via network depth and the number of feature maps per convolution, and nonlinear activation functions. This yielded over 1700 unique models, which were all trained until performance converged and then tested on a separate dataset.

Results: Of all hyperparameters, the choice of initial learning rate was most consistently significant for improved performance on the test set, with all top-performing models using learning rates of 0.0001. The optimal image normalization was not consistent across architectures. High overlap (mean Dice similarity coefficient = 0.98) and surface distance agreement (mean surface distance < 2 mm) were achieved between the treatment field apertures for all architectures using the identified best hyperparameters. Overlap Dice similarity coefficient (DSC) and distance metrics (mean surface distance and Hausdorff distance) indicated that DeepLabv3+ and D-LinkNet architectures were least sensitive to initial hyperparameter selection.

Conclusion: DeepLabv3+ and D-LinkNet are most robust to initial hyperparameter selection. Learning rate, nonlinear activation function, and kernel size are also important hyperparameters for improving performance.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10691634PMC
http://dx.doi.org/10.1002/acm2.14131DOI Listing

Publication Analysis

Top Keywords

deep learning
8
cervical cancer
8
initial learning
8
learning rate
8
image normalization
8
learning
5
identifying optimal
4
optimal deep
4
learning architecture
4
architecture parameters
4

Similar Publications

Objectives: To evaluate the performance of a 3D V-Net-based segmentation model of adrenal lesions in characterizing adrenal glands as normal or abnormal.

Methods: A total of 1086 CT image series with focal adrenal lesions were retrospectively collected, annotated, and used for the training of the adrenal lesion segmentation model. The dice similarity coefficient (DSC) of the test set was used to evaluate the segmentation performance.

View Article and Find Full Text PDF

Improved deep canonical correlation fusion approach for detection of early mild cognitive impairment.

Med Biol Eng Comput

January 2025

Non-Invasive Imaging and Diagnostic Laboratory, Department of Applied Mechanics and Biomedical Engineering, Indian Institute of Technology Madras, Chennai, India.

Detection of early mild cognitive impairment (EMCI) is clinically challenging as it involves subtle alterations in multiple brain sub-anatomic regions. Among different brain regions, the corpus callosum and lateral ventricles are primarily affected due to EMCI. In this study, an improved deep canonical correlation analysis (CCA) based framework is proposed to fuse magnetic resonance (MR) image features from lateral ventricular and corpus callosal structures for the detection of EMCI condition.

View Article and Find Full Text PDF

Cu-Ni Oxidation Mechanism Unveiled: A Machine Learning-Accelerated First-Principles and TEM Study.

Nano Lett

January 2025

Department of Mechanical Engineering & Materials Science, University of Pittsburgh, Pittsburgh, Pennsylvania 15261, United States.

The development of accurate methods for determining how alloy surfaces spontaneously restructure under reactive and corrosive environments is a key, long-standing, grand challenge in materials science. Using machine learning-accelerated density functional theory and rare-event methods, in conjunction with environmental transmission electron microscopy (ETEM), we examine the interplay between surface reconstructions and preferential segregation tendencies of CuNi(100) surfaces under oxidation conditions. Our modeling approach predicts that oxygen-induced Ni segregation in CuNi alloys favors Cu(100)-O c(2 × 2) reconstruction and destabilizes the Cu(100)-O (2√2 × √2)45° missing row reconstruction (MRR).

View Article and Find Full Text PDF

VirDetect-AI: a residual and convolutional neural network-based metagenomic tool for eukaryotic viral protein identification.

Brief Bioinform

November 2024

Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos 62210, México.

This study addresses the challenging task of identifying viruses within metagenomic data, which encompasses a broad array of biological samples, including animal reservoirs, environmental sources, and the human body. Traditional methods for virus identification often face limitations due to the diversity and rapid evolution of viral genomes. In response, recent efforts have focused on leveraging artificial intelligence (AI) techniques to enhance accuracy and efficiency in virus detection.

View Article and Find Full Text PDF

Deep Learning to Simulate Contrast-Enhanced MRI for Evaluating Suspected Prostate Cancer.

Radiology

January 2025

From the Department of Radiology, Shenzhen Nanshan People's Hospital, Shenzhen University, Taoyuan Rd No. 89, Nanshan District, Shenzhen 518000, Guangdong, China (H.H., Z.D., Y.Q.); Medical AI Laboratory and Guangdong Key Laboratory of Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Shenzhen University Medical School, Shenzhen University, Shenzhen, China (J.M., R.L., B.H.); Department of Medical Imaging, People's Hospital of Longhua, Shenzhen, Guangdong, China (X.P., Y.Z.); and Department of Radiology, Shenzhen People's Hospital, Shenzhen, Guangdong, China (D.Z., G.H.).

Background Multiparametric MRI, including contrast-enhanced sequences, is recommended for evaluating suspected prostate cancer, but concerns have been raised regarding potential contrast agent accumulation and toxicity. Purpose To evaluate the feasibility of generating simulated contrast-enhanced MRI from noncontrast MRI sequences using deep learning and to explore their potential value for assessing clinically significant prostate cancer using Prostate Imaging Reporting and Data System (PI-RADS) version 2.1.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!