Endoscopic high-speed video (HSV) systems for visualization and assessment of vocal fold dynamics in the larynx are diverse and technically advancing. To consider resulting "concepts shifts" for neural network (NN)-based image processing, re-training of already trained and used NNs is necessary to allow for sufficiently accurate image processing for new recording modalities. We propose and discuss several re-training approaches for convolutional neural networks (CNN) being used for HSV image segmentation. Our baseline CNN was trained on the BAGLS data set (58,750 images). The new BAGLS-RT data set consists of additional 21,050 images from previously unused HSV systems, light sources, and different spatial resolutions. Results showed that increasing data diversity by means of preprocessing already improves the segmentation accuracy (mIoU + 6.35%). Subsequent re-training further increases segmentation performance (mIoU + 2.81%). For re-training, finetuning with dynamic knowledge distillation showed the most promising results. Data variety for training and additional re-training is a helpful tool to boost HSV image segmentation quality. However, when performing re-training, the phenomenon of catastrophic forgetting should be kept in mind, i.e., adaption to new data while forgetting already learned knowledge.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10427138 | PMC |
http://dx.doi.org/10.3390/app12199791 | DOI Listing |
Acad Radiol
January 2025
Department of Radiology and Nuclear Medicine, German Heart Center Munich, Lazarettstraße 36, 80636 Munich, Germany (K.K.B.).
Rationale And Objectives: Training Convolutional Neural Networks (CNN) requires large datasets with labeled data, which can be very labor-intensive to prepare. Radiology reports contain a lot of potentially useful information for such tasks. However, they are often unstructured and cannot be directly used for training.
View Article and Find Full Text PDFSci Total Environ
January 2025
Interdisciplinary Lab for Mathematical Ecology and Epidemiology & Department of Mathematical and Statistical Sciences, University of Alberta, Canada. Electronic address:
Prompt and accurate monitoring of cyanobacterial blooms is essential for public health management and understanding aquatic ecosystem dynamics. Remote sensing, in particular satellite observations, presents a good alternative for continuous monitoring. This study employs multispectral images from the Sentinel-2 constellation alongside ERA5-Land to enable broad-scale data acquisition.
View Article and Find Full Text PDFComput Biol Med
January 2025
Department of Computing Technologies, School of Computing, SRM Institute of Science and Technology, Kattankulathur, Tamil Nadu, 603203, India. Electronic address:
This research work focuses on developing an advanced diagnostic method for thyroid nodules using ultrasonography images. The core idea revolves around the observation that the presence and amount of calcium flecks in thyroid nodules can indicate their severity, potentially leading to severe thyroid cancer. A novel technique, named Bilateral Mean Clustering Strategy (Bi-MCS), is proposed, combining the strengths of Fuzzy C mean and K-mean clustering approaches.
View Article and Find Full Text PDFNeural Netw
December 2024
College of Computer and Data Science, Fuzhou University, Fuzhou, 350108, China; Key Laboratory of Intelligent Metro, Fujian Province University, Fuzhou, 350108, China. Electronic address:
Graph convolutional networks have achieved remarkable success in the field of multi-view learning. Unfortunately, most graph convolutional network-based multi-view learning methods fail to capture long-range dependencies due to the over-smoothing problem. Many studies have attempted to mitigate this issue by decoupling graph convolution operations.
View Article and Find Full Text PDFCurrent neural network models of primate vision focus on replicating overall levels of behavioral accuracy, often neglecting perceptual decisions' rich, dynamic nature. Here, we introduce a novel computational framework to model the dynamics of human behavioral choices by learning to align the temporal dynamics of a recurrent neural network (RNN) to human reaction times (RTs). We describe an approximation that allows us to constrain the number of time steps an RNN takes to solve a task with human RTs.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!