We consider synchronous data-parallel neural network training with a fixed large batch size. While the large batch size provides a high degree of parallelism, it degrades the generalization performance due to the low gradient noise scale. We propose a general learning rate adjustment framework and three critical heuristics that tackle the poor generalization issue. The key idea is to adjust the learning rate based on geometric information of loss landscape and encourage the model to converge into a flat minimum that is known to better generalize to the unknown data. Our empirical study demonstrates that the Hessian-aware learning rate schedule remarkably improves the generalization performance in large-batch training. For CIFAR-10 classification with ResNet20, our method achieves 92.31% accuracy using 16,384 batch size, which is close to 92.83% achieved using 128 batch size, at a negligible extra computational cost.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.neunet.2022.11.007DOI Listing

Publication Analysis

Top Keywords

learning rate
16
batch size
16
hessian-aware learning
8
rate adjustment
8
large batch
8
generalization performance
8
achieving small-batch
4
small-batch accuracy
4
accuracy large-batch
4
large-batch scalability
4

Similar Publications

Protective effects of anthocyanins on the nervous system injury caused by fluoride-induced endoplasmic reticulum stress in rats.

Food Chem Toxicol

March 2025

Center for Endemic Disease Control, Chinese Center for Disease Control and Prevention, Harbin Medical University, Harbin, Heilongjiang 150081, China; Key Lab of Etiology and Epidemiology, Education Bureau of Heilongjiang Province & Ministry of Health (23618504), Harbin Medical University, Harbin 150081, China; Heilongjiang Provincial Key Lab of Trace Elements and Human Health Harbin Medical University, Harbin 150081, China. Electronic address:

Long-term fluoride exposure can produce neurotoxicity. Anthocyanins, as antioxidants, have a certain protective effect in nerve damage. This study aimed to investigate the protective role of anthocyanins in fluoride-induced neurological damage due to endoplasmic reticulum stress (ERS).

View Article and Find Full Text PDF

Skeletal fluorosis caused by coal-burning type endemic fluorosis greatly affects the health of the population in the affected areas, but large-scale diagnostic work is limited by human and material resources, making it difficult to implement comprehensively. In this study, we investigate adults in coal-burning type endemic skeletal fluorosis areas in Guizhou. The study areas are selected by a comprehensive analysis of the detection rate of dental fluorosis in children aged 8-12 years in Guizhou for the year 2023.

View Article and Find Full Text PDF

Background: Timely and accurate outcome prediction is essential for clinical decision-making for ischemic stroke patients in the intensive care unit (ICU). However, the interpretation and translation of predictive models into clinical applications are equally crucial. This study aims to develop an interpretable machine learning (IML) model that effectively predicts in-hospital mortality for ischemic stroke patients.

View Article and Find Full Text PDF

Background: Processing data from electronic health records (EHRs) to build research-grade databases is a lengthy and expensive process. Modern arthroplasty practice commonly uses multiple sites of care, including clinics and ambulatory care centers. However, most private data systems prevent obtaining usable insights for clinical practice.

View Article and Find Full Text PDF

Background: Amyotrophic lateral sclerosis (ALS) leads to rapid physiological and functional decline before causing untimely death. Current best-practice approaches to interdisciplinary care are unable to provide adequate monitoring of patients' health. Passive in-home sensor systems enable 24×7 health monitoring.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!