Among the various types of data augmentation strategies, the mixup-based approach has been particularly studied. However, in existing mixup-based approaches, object loss and label mismatching can occur if random patches are utilized when constructing augmented images, and additionally, patches that do not contain objects might be included, which degrades performance. In this paper, we propose a novel augmentation method that mixes patches in a non-overlapping manner after they are extracted from the salient regions in an image. The suggested method can make effective use of object characteristics, because the constructed image consists only of visually important regions and is robust to noise. Since the patches do not occlude each other, the semantically meaningful information in the salient regions can be fully utilized. Additionally, our method is more robust to adversarial attack than the conventional augmentation method. In the experimental results, when Wide ResNet was trained on the public datasets, CIFAR-10, CIFAR-100 and STL-10, the top-1 accuracy was 97.26%, 83.99% and 82.40% respectively, which surpasses other augmentation methods.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9560504 | PMC |
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0274767 | PLOS |
Acta Otolaryngol
January 2025
Department of Otorhinolaryngology, Institute of Science Tokyo, Tokyo, Japan.
Background: Recent advances in artificial intelligence have facilitated the automatic diagnosis of middle ear diseases using endoscopic tympanic membrane imaging.
Aim: We aimed to develop an automated diagnostic system for middle ear diseases by applying deep learning techniques to tympanic membrane images obtained during routine clinical practice.
Material And Methods: To augment the training dataset, we explored the use of generative adversarial networks (GANs) to produce high-quality synthetic tympanic images that were subsequently added to the training data.
J Clin Med
January 2025
Discipline of Physiotherapy, Faculty of Health Sciences, European University Miguel de Cervantes, C del Padre Julio Chevalier 2, 47012 Valladolid, Spain.
: Hip fractures are prevalent among the elderly and impose a significant burden on healthcare systems due to the associated high morbidity and costs. The increasing use of intramedullary nails for hip fracture fixation has inadvertently introduced risks; these implants can alter bone elasticity and create stress concentrations, leading to peri-implant fractures. The aim of this study is to investigate the outcomes of peri-implant hip fractures, evaluate the potential causes of such fractures, determine the type of treatment provided, assess the outcomes of said treatments, and establish possible improvement strategies.
View Article and Find Full Text PDFJ Clin Med
December 2024
Clinic for Masticatory Disorders and Dental Biomaterials, Center for Dental Medicine, University of Zurich, 8006 Zurich, Switzerland.
: Sinus lifting, a procedure to augment bone in the maxilla, may cause complications such as sinusitis due to impaired drainage. This study aimed to assess how sinus lifting impacts airflow in the sinus cavity, which is essential for patients undergoing dental implants. Using computational fluid dynamics (CFD), this research analyzed airflow changes after sinus floor elevation, offering insights into the aerodynamic consequences of the procedure.
View Article and Find Full Text PDFSensors (Basel)
January 2025
College of Transportation, Shandong University of Science and Technology, Qingdao 266590, China.
To address the problems that exist in the target detection of vehicle-mounted visual sensors in foggy environments, a vehicle target detection method based on an improved YOLOX network is proposed. Firstly, to address the issue of vehicle target feature loss in foggy traffic scene images, specific characteristics of fog-affected imagery are integrated into the network training process. This not only augments the training data but also improves the robustness of the network in foggy environments.
View Article and Find Full Text PDFCancers (Basel)
January 2025
Department of Computer Science, Faculty of Information Technology and Electrical Engineering, Norwegian University of Science and Technology, 2815 Gjøvik, Norway.
Background/objectives: Brain tumor classification is a crucial task in medical diagnostics, as early and accurate detection can significantly improve patient outcomes. This study investigates the effectiveness of pre-trained deep learning models in classifying brain MRI images into four categories: Glioma, Meningioma, Pituitary, and No Tumor, aiming to enhance the diagnostic process through automation.
Methods: A publicly available Brain Tumor MRI dataset containing 7023 images was used in this research.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!