State-of-the-art object detection models need large and diverse datasets for training. As these are hard to acquire for many practical applications, training images from simulation environments gain more and more attention. A problem arises as deep learning models trained on simulation images usually have problems generalizing to real-world images shown by a sharp performance drop. Definite reasons and influences for this performance drop are not yet found. While previous work mostly investigated the influence of the data as well as the use of domain adaptation, this work provides a novel perspective by investigating the influence of the object detection model itself. Against this background, first, a corresponding measure called is defined, comprising the capability of an object detection model to generalize from simulation training images to real-world evaluation images. Second, 12 different deep learning-based object detection models are trained and their sim-to-real generalizability is evaluated. The models are trained with a variation of hyperparameters resulting in a total of 144 trained and evaluated versions. The results show a clear influence of the feature extractor and offer further insights and correlations. They open up future research on investigating influences on the sim-to-real generalizability of deep learning-based object detection models as well as on developing feature extractors that have better sim-to-real generalizability capabilities.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11509078PMC
http://dx.doi.org/10.3390/jimaging10100259DOI Listing

Publication Analysis

Top Keywords

object detection
24
sim-to-real generalizability
16
detection models
16
models trained
12
generalizability deep
8
deep learning
8
training images
8
performance drop
8
detection model
8
deep learning-based
8

Similar Publications

Deep Learning for Staging Periodontitis Using Panoramic Radiographs.

Oral Dis

January 2025

Salivary Gland Disease Center and Beijing Key Laboratory of Tooth Regeneration and Function Reconstruction, Beijing Laboratory of Oral Health and Beijing Stomatological Hospital, Capital Medical University, Beijing, China.

Objectives: Utilizing a deep learning approach is an emerging trend to improve the efficiency of periodontitis diagnosis and classification. This study aimed to use an object detection model to automatically annotate the anatomic structure and subsequently classify the stages of radiographic bone loss (RBL).

Materials And Methods: In all, 558 panoramic radiographs were cropped to 7359 pieces of individual teeth.

View Article and Find Full Text PDF

Ensuring the integrity of shipping containers is crucial for maintaining product quality, logistics efficiency, and safety in the global supply chain. Damaged containers can lead to significant economic losses, delays, and safety hazards. Traditionally, container inspections have been manual, which are labor-intensive, time-consuming, and error-prone, especially in busy port environments.

View Article and Find Full Text PDF

One of the promising sources for creating specialized foods is the biomass of Arthrospira platensis food microalgae. Biomass of A. platensis and its aqueous extracts are used as a source of bioactive compounds, primarily phycocyanins which are protein macromolecules that largely determine the antioxidant, immunomodulatory and anti-inflammatory properties of this cyanobacterium.

View Article and Find Full Text PDF

Bioimaging probes based on carbon dots (CDs) can become a useful replacement for existing commercial probes, benefiting clinical diagnostics. While the development of dual-mode CD-based probes for magnetic resonance imaging (MRI), which provides the ability for photoluminescence (PL) detection at the same time, is ongoing, several challenges have to be addressed. First, most of the CD-based probes still emit at shorter wavelengths (blue/green spectral range), which is harmful to biological objects or have very low PL intensity in the biological window of tissue transparency (red/near-infrared spectral range).

View Article and Find Full Text PDF

Brain-controlled robotic arm systems are designed to provide a method of communication and control for individuals with limited mobility or communication abilities. These systems can be beneficial for people who have suffered from a spinal cord injury, stroke, or neurological disease that affects their motor abilities. The ability of a person to control a robotic arm to reach and grasp multiple objects using their brain signals.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!