Among the various types of data augmentation strategies, the mixup-based approach has been particularly studied. However, in existing mixup-based approaches, object loss and label mismatching can occur if random patches are utilized when constructing augmented images, and additionally, patches that do not contain objects might be included, which degrades performance. In this paper, we propose a novel augmentation method that mixes patches in a non-overlapping manner after they are extracted from the salient regions in an image. The suggested method can make effective use of object characteristics, because the constructed image consists only of visually important regions and is robust to noise. Since the patches do not occlude each other, the semantically meaningful information in the salient regions can be fully utilized. Additionally, our method is more robust to adversarial attack than the conventional augmentation method. In the experimental results, when Wide ResNet was trained on the public datasets, CIFAR-10, CIFAR-100 and STL-10, the top-1 accuracy was 97.26%, 83.99% and 82.40% respectively, which surpasses other augmentation methods.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9560504PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0274767PLOS

Publication Analysis

Top Keywords

data augmentation
8
augmentation method
8
salient regions
8
augmentation
5
saliency guided
4
guided data
4
augmentation strategy
4
strategy maximally
4
maximally utilizing
4
utilizing object's
4

Similar Publications

Background: Recent advances in artificial intelligence have facilitated the automatic diagnosis of middle ear diseases using endoscopic tympanic membrane imaging.

Aim: We aimed to develop an automated diagnostic system for middle ear diseases by applying deep learning techniques to tympanic membrane images obtained during routine clinical practice.

Material And Methods: To augment the training dataset, we explored the use of generative adversarial networks (GANs) to produce high-quality synthetic tympanic images that were subsequently added to the training data.

View Article and Find Full Text PDF

: Hip fractures are prevalent among the elderly and impose a significant burden on healthcare systems due to the associated high morbidity and costs. The increasing use of intramedullary nails for hip fracture fixation has inadvertently introduced risks; these implants can alter bone elasticity and create stress concentrations, leading to peri-implant fractures. The aim of this study is to investigate the outcomes of peri-implant hip fractures, evaluate the potential causes of such fractures, determine the type of treatment provided, assess the outcomes of said treatments, and establish possible improvement strategies.

View Article and Find Full Text PDF

: Sinus lifting, a procedure to augment bone in the maxilla, may cause complications such as sinusitis due to impaired drainage. This study aimed to assess how sinus lifting impacts airflow in the sinus cavity, which is essential for patients undergoing dental implants. Using computational fluid dynamics (CFD), this research analyzed airflow changes after sinus floor elevation, offering insights into the aerodynamic consequences of the procedure.

View Article and Find Full Text PDF

To address the problems that exist in the target detection of vehicle-mounted visual sensors in foggy environments, a vehicle target detection method based on an improved YOLOX network is proposed. Firstly, to address the issue of vehicle target feature loss in foggy traffic scene images, specific characteristics of fog-affected imagery are integrated into the network training process. This not only augments the training data but also improves the robustness of the network in foggy environments.

View Article and Find Full Text PDF

Advanced Brain Tumor Classification in MR Images Using Transfer Learning and Pre-Trained Deep CNN Models.

Cancers (Basel)

January 2025

Department of Computer Science, Faculty of Information Technology and Electrical Engineering, Norwegian University of Science and Technology, 2815 Gjøvik, Norway.

Background/objectives: Brain tumor classification is a crucial task in medical diagnostics, as early and accurate detection can significantly improve patient outcomes. This study investigates the effectiveness of pre-trained deep learning models in classifying brain MRI images into four categories: Glioma, Meningioma, Pituitary, and No Tumor, aiming to enhance the diagnostic process through automation.

Methods: A publicly available Brain Tumor MRI dataset containing 7023 images was used in this research.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!