Saliency guided data augmentation strategy for maximally utilizing an object's visual information.

Junhyeok An Soojin Jang Junehyoung Kwon Kyohoon Jin YoungBin Kim

PLoS One

Department of Image Science and Arts, Chung-Ang University, Dongjak, Seoul, Korea.

Published: October 2022

Among the various types of data augmentation strategies, the mixup-based approach has been particularly studied. However, in existing mixup-based approaches, object loss and label mismatching can occur if random patches are utilized when constructing augmented images, and additionally, patches that do not contain objects might be included, which degrades performance. In this paper, we propose a novel augmentation method that mixes patches in a non-overlapping manner after they are extracted from the salient regions in an image. The suggested method can make effective use of object characteristics, because the constructed image consists only of visually important regions and is robust to noise. Since the patches do not occlude each other, the semantically meaningful information in the salient regions can be fully utilized. Additionally, our method is more robust to adversarial attack than the conventional augmentation method. In the experimental results, when Wide ResNet was trained on the public datasets, CIFAR-10, CIFAR-100 and STL-10, the top-1 accuracy was 97.26%, 83.99% and 82.40% respectively, which surpasses other augmentation methods.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9560504	PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0274767	PLOS

Publication Analysis

Top Keywords

data augmentation

augmentation method

salient regions

augmentation

saliency guided

guided data

augmentation strategy

strategy maximally

maximally utilizing

utilizing object's

Similar Publications

Deep learning multi-classification of middle ear diseases using synthetic tympanic images.

Acta Otolaryngol

January 2025

Department of Otorhinolaryngology, Institute of Science Tokyo, Tokyo, Japan.

Yoshimaru Mizoguchi Taku Ito Masato Yamada Takeshi Tsutsumi

Background: Recent advances in artificial intelligence have facilitated the automatic diagnosis of middle ear diseases using endoscopic tympanic membrane imaging.

Aim: We aimed to develop an automated diagnostic system for middle ear diseases by applying deep learning techniques to tympanic membrane images obtained during routine clinical practice.

Material And Methods: To augment the training dataset, we explored the use of generative adversarial networks (GANs) to produce high-quality synthetic tympanic images that were subsequently added to the training data.

View Article and Find Full Text PDF

Similar Publications

Clinical and Functional Outcomes of Peri-Implant Fractures Associated with Short Proximal Femur Nails: Prevention Strategies and Key Insights.

J Clin Med

January 2025

Discipline of Physiotherapy, Faculty of Health Sciences, European University Miguel de Cervantes, C del Padre Julio Chevalier 2, 47012 Valladolid, Spain.

Ignacio Aguado-Maestro Sergio Valle-López Clarisa Simón-Pérez Emilio-Javier Frutos-Reoyo Ignacio García-Cepeda

: Hip fractures are prevalent among the elderly and impose a significant burden on healthcare systems due to the associated high morbidity and costs. The increasing use of intramedullary nails for hip fracture fixation has inadvertently introduced risks; these implants can alter bone elasticity and create stress concentrations, leading to peri-implant fractures. The aim of this study is to investigate the outcomes of peri-implant hip fractures, evaluate the potential causes of such fractures, determine the type of treatment provided, assess the outcomes of said treatments, and establish possible improvement strategies.

View Article and Find Full Text PDF

Similar Publications

Modelling Intra-Sinus Fluid Movements and Drainage Through Computational Fluid Dynamics Before and After Maxillary Sinus Augmentation: A Simulation-Based Pilot Study.

J Clin Med

December 2024

Clinic for Masticatory Disorders and Dental Biomaterials, Center for Dental Medicine, University of Zurich, 8006 Zurich, Switzerland.

İpek Necla Güldiken Sarıkaya Alperen Tekin Fatih Suda Zeynep Gülen Çukurova Yilmaz Mutlu Özcan

: Sinus lifting, a procedure to augment bone in the maxilla, may cause complications such as sinusitis due to impaired drainage. This study aimed to assess how sinus lifting impacts airflow in the sinus cavity, which is essential for patients undergoing dental implants. Using computational fluid dynamics (CFD), this research analyzed airflow changes after sinus floor elevation, offering insights into the aerodynamic consequences of the procedure.

View Article and Find Full Text PDF

Similar Publications

Vehicle Target Detection of Autonomous Driving Vehicles in Foggy Environments Based on an Improved YOLOX Network.

Sensors (Basel)

January 2025

College of Transportation, Shandong University of Science and Technology, Qingdao 266590, China.

Zhaohui Liu Huiru Zhang Lifei Lin

To address the problems that exist in the target detection of vehicle-mounted visual sensors in foggy environments, a vehicle target detection method based on an improved YOLOX network is proposed. Firstly, to address the issue of vehicle target feature loss in foggy traffic scene images, specific characteristics of fog-affected imagery are integrated into the network training process. This not only augments the training data but also improves the robustness of the network in foggy environments.

View Article and Find Full Text PDF

Similar Publications

Advanced Brain Tumor Classification in MR Images Using Transfer Learning and Pre-Trained Deep CNN Models.

Cancers (Basel)

January 2025

Department of Computer Science, Faculty of Information Technology and Electrical Engineering, Norwegian University of Science and Technology, 2815 Gjøvik, Norway.

Rukiye Disci Fatih Gurcan Ahmet Soylu

Background/objectives: Brain tumor classification is a crucial task in medical diagnostics, as early and accurate detection can significantly improve patient outcomes. This study investigates the effectiveness of pre-trained deep learning models in classifying brain MRI images into four categories: Glioma, Meningioma, Pituitary, and No Tumor, aiming to enhance the diagnostic process through automation.

Methods: A publicly available Brain Tumor MRI dataset containing 7023 images was used in this research.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!